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PREFACE 


Introduction and apologia 


Four men, who had been blind from birth, wanted to know what an elephant was 
like; so they asked an elephant-driver for information. He led them to an elephant, 
and invited them to examine it; so one man felt the elephant’s leg, another its trunk, 
another its tail and the fourth its ear. Then they attempted to describe the elephant 
to one another. The first man said ‘The elephant is like a tree’. ‘No,’ said the second, 
‘the elephant is like a snake’. ‘Nonsense!’ said the third, ‘the elephant is like a broom’. 
“You are all wrong,’ said the fourth, ‘the elephant is like a fan’. And so they went on 
arguing amongst themselves, while the elephant stood watching them quietly.' 


The notion of topos resembles an elephant in that it is possible to come up 
with very different descriptions of what topos theory is about, depending on 
the direction from which you approach the subject. It was André Joyal, in his 
lecture at the 1981 Cambridge Summer Meeting on Category Theory, who first 
drew attention to the parallel, and I am grateful to him for allowing me to use it 
as the basis for the title of the present work. He listed seven different descriptions 
of ‘what a topos is like’: 


(i) ‘A topos is a category of sheaves on a site’ 
(ii) ‘A topos is a category with finite limits and power-objects’ 
(iii) ‘A topos is (the embodiment of) an intuitionistic higher-order theory’ 


(iv) ‘A topos is (the extensional essence of) a first-order (infinitary) geometric 
theory’ 


(v) ‘A topos is a totally cocomplete object in the meta-2-category CART of 
cartesian (i.e., finitely complete) categories’ 


(vi) ‘A topos is a generalized space’ 

(vii) ‘A topos is a semantics for intuitionistic formal systems’ 
In the 20 years that have passed since that lecture, the category-theory commu- 
nity has added a few more descriptions to the list (and the theoretical computer 
scientists have contributed yet others); for example, 

(viii) ‘A topos is a Morita equivalence class of continuous groupoids’ 

(ix) ‘A topos is the category of maps of a power allegory’ 

l The story of the blind men and the elephant exists in numerous versions in Indian folklore. 


This version is freely adapted from E. J. Robinson, Tales and Poems of South India (London, 
1885). 
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(x) ʻA topos is a category whose canonical indexing over itself is complete 
and well-powered’ 


(xi) ‘A topos is the spatial manifestation of a Giraud frame’ 
(xii) ‘A topos is a setting for synthetic differential geometry’ 
(xiii) ‘A topos is a setting for synthetic domain theory’, 


and so on. But the important thing about the elephant is that ‘however you 
approach it, it is still the same animal’; this book is an attempt to demonstrate 
that the same is true of topos theory. 


A brief bibliography of topos theory 


This book resembles an elephant in another respect: namely its gestation period. 
To explain (and apologize for) this, I have to go back still further into the history 
of the subject. I must also crave the reader’s indulgence for what may seem an 
excessively egocentric view of that history in the paragraphs which follow. 

The original notion of topos, as a ‘generalized space’ suitable for supporting 
the exotic cohomology theories required in algebraic geometry, sprang from the 
fertile brain of Alexandre Grothendieck in the early 1960s, and was developed in 
his ‘Séminaire de Géometrie Algébrique du Bois-Marie’ particularly during the 
academic year 1963-64. The duplicated notes of that seminar circulated widely 
among algebraic geometers and category-theorists over the next decade, until 
Springer-Verlag did the world a service by publishing a revised and expanded 
edition in three volumes of Lecture Notes in Mathematics in 1972 [36]. But 
by then the subject had already been ‘reborn’ in its second incarnation, as an 
elementary theory having links with higher-order intuitionistic logic, through 
the collaboration of Bill Lawvere and Myles Tierney during 1969-70 (which, 
in turn, built upon Lawvere’s work on providing a categorical foundation for 
mathematics, which had been developing since the early 1960s). 

As a graduate student in the early 1970s, I was fortunate to ‘get in on the 
ground floor’ of elementary topos theory, by attending Tierney’s summer school 
lectures [1167] at Varenna in 1971, and later Saunders Mac Lane’s lectures dur- 
ing his Cambridge sabbatical in autumn 1972. At that time the only general 
accounts of the subject in print were the lecture notes [649] by Anders Kock 
and Gavin Wraith, which were not all that widely available, and Peter Freyd’s 
survey article [371], whose view of the subject was idiosyncratic to say the least 
— though they were joined over the next two years by the notes of Tierney’s 
lectures, already cited, and by Wraith’s second account [1236]. 

More or less immediately after completing my Ph.D. in 1974, I embarked on 
the writing of what became the first account in book form of the elementary 
theory of toposes, published by Academic Press in 1977 [504]. With the brash 
self-confidence of youth, I attempted to provide between one pair of covers a full 
account of as many aspects of the elephant as I could muster: the result was 
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a book which, though universally acknowledged as ‘far too hard to read, and not 
for the faint-hearted’,? remained in demand for a long time — and continues to 
be widely cited even today, despite being nearly 25 years out of date. 

For some time [504] was the only available book on elementary topos theory. 
But by the time of Joyal’s 1981 lecture it had been joined by a second, Robert 
Goldblatt’s Topoi: the categorial analysis of logic [411], and over the next decade 
or so a good many other books appeared which either were devoted to topos the- 
ory or contained a good deal of topos-theoretic material: Teoria toposurilor [997] 
by Alexandru Radu in 1981-2, Toposes, triples and theories [87] by Michael Barr 
and Charles Wells in 1985, Introduction to higher order categorical logic [682] by 
Joachim Lambek and Philip Scott in 1986, Toposes and local set theories [95] by 
John Bell in 1988, Categories, allegories [381] by Peter Freyd and Andre Scedrov 
in 1990, Models for smooth infinitesimal analysis [856] by Ieke Moerdijk and Gon- 
zalo Reyes in 1991, Lecture notes on topoi and quasitopot [1248] by Oswald Wyler, 
also in 1991, Sheaves in geometry and logic [751] by Saunders Mac Lane and 
Ieke Moerdijk in 1992, Elementary categories, elementary toposes [760] by Colin 
McLarty, also in 1992, Relative category theory and geometric morphisms [238| 
by Jonathan Chapman and Frederick Rowbottom, yet again in 1992, and finally 
the third volume of Francis Borceux’ remarkable magnum opus Handbook of 
categorical algebra [147] in 1994. 

Each of these books contributed a new view of the elephant, illuminating 
features not visible from other angles, and thus each of them was valuable. But 
I hope I shall not offend the authors listed above by saying that none of them, 
with the possible exception of the last, really attempted to give a picture of 
the whole animal: indeed, in several cases the concentration on one part of its 
anatomy is self-evident from the title. 

Meanwhile, my own book had gone out of print in 1987, despite a corrected 
reprint (some time after the original stock had run out) in 1985. There was some 
suggestion at that time that Academic Press might be willing to publish a revised 
second edition of the book, if I were willing to write it; but their procrastination 
over, and lack of enthusiasm for, the 1985 reprint did not fill me with enthusiasm 
for that course of action. Another reason for my reluctance was the extent to 
which the subject had grown since the mid-1970s: since the principal merit of 
my book, in comparison with the others by then available or forthcoming, was 
its comprehensiveness, there would be no point in replacing it by anything other 
than a comprehensive account of the subject as it stood by then, and such an 
account would inevitably be very much longer than the 1977 book. 


2 The quotation is from the comments of an anonymous referee consulted by OUP about 
the present book. 
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The three blind men 


By the mid-1980s Ieke Moerdijk had also become concerned (despite his involve- 
ment as co-author in two of the books mentioned above) about the lack of any 
single account of topos theory which could properly claim to give a comprehen- 
sive picture of the subject. However, like me he felt daunted by the prospect 
of writing such a book single-handed. Sometime early in 1986, he mentioned to 
me his idea that what was needed was a ‘handbook’ type publication in which a 
small team of experts would contribute extended essays on the different aspects 
of the subject, under the overall supervision of a co-ordinating editor — he felt 
that I was the ideal person to take on the latter rôle. I was not, at first, greatly 
taken with this idea: as I saw it, the chief problem with the ‘handbook’ style was 
that, although the elephant has many different aspects, their interconnectedness 
is such that it is impossible fully to appreciate any one of them without making 
some reference to most of the others. So I foresaw that the job of a co-ordinating 
editor, for such a volume, would be more than usually nightmarish. 

Nevertheless, discussions between Ieke and myself about the idea continued 
to take place on the odd occasions when we met, and they soon came to involve 
my former student Andrew Pitts as another possible author. The ‘handbook’ 
idea gradually metamorphosed into something more like a ‘Bourbaki’ volume or 
the Compendium of continuous lattices [404], in which each author would have 
individual responsibility for the first drafts of particular sections, but the final 
text would have to be agreed by all the authors (and the individual authorship of 
the different sections would not be publicly identified). At a meeting between the 
three of us in December 1987, we formally agreed to go ahead with a three-author 
work on that basis, and a rough table of contents — including the division of the 
book into six parts, although they did not correspond exactly to the six parts in 
the present table of contents - was worked out, together with the responsibilities 
of the three of us for drafting the more introductory sections. The working title 
Sketches of an elephant was also agreed at this meeting. 

By the late summer of 1988 we had enough material in draft to approach 
publishers with a definite proposal, and we signed an agreement with Oxford 
University Press in January 1989. At the time (I blush to recall it now) we 
hoped that we would be able to deliver a complete typescript to the publishers 
by the end of September 1991. We also estimated that the overall length of the 
book might be around 1000 pages; in the event, that estimate has proved more 
accurate than the temporal one. 

Over the next couple of years, a substantial quantity of draft material was 
written, with P. T. J. working mainly on Parts A and B, I. M. on Part C and 
A. M. P. on Part D. Ieke produced large quantities of draft material, principally 
on topics from Chapters C2 and C3, although virtually none of it has survived 
into the version that exists today; on the other hand, stylistic experts may 
just be able to detect the origins of the present Sections D1.1-D1.3 in a draft 
written by Andrew, even though it has been heavily rewritten by me. But it had 
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become clear by early 1991 that the project was a much larger one than we had 
realized: in particular, the problems of co-ordination between the three of us, 
and of reaching agreement on a text that all three could be happy with, seemed 
overwhelming. It was at least partly these problems that led Ieke and Andrew 
independently to decide to withdraw from the project in late 1992. 

I should make it clear that, despite their not seeing the project through to 
completion, I remain enormously indebted to both Ieke and Andrew. If it had not 
been for their support in the early stages, I should never have dared to embark 
on the writing of a work on this scale, and their encouragement kept me going 
until the book had progressed far enough to acquire its own momentum. Also, 
their constructive criticisms of my early drafts of Part A caught many infelicities 
that I should probably not have noticed myself. And on a more mundane level, 
Andrew taught me to use TFX,’ without which the project would have been 
not just foolhardy but downright impossible. 


One blind man 


I thus found myself in early 1993 faced with three options: to abandon the project 
myself, to try to find other co-authors, or to accept the burden of writing the 
whole book myself. The first of these was unattractive because it would mean 
abandoning the large amount of material (including a complete draft of Part A) 
that I had already written, and the second did not appeal since I could not think 
of any potential co-author who would be as easy to work with, or as productive, 
as I knew Ieke and Andrew to be. So I decided to go ahead with the third option, 
although I knew that that would involve a ‘long hard slog’ over several years — 
I found it impossible to estimate how many. 

With my other commitments to teaching, research and administration, 
progress was indeed slow: often, several months would pass without my find- 
ing time to open the files where the text of the book was stored. In addition, 
I often felt like Heracles wrestling with the Hydra: every attempt to draft a 
new section seemed to entail the rewriting of at least two existing ones, so that 
I appeared to come no nearer to the goal of a complete text. (There was also, at 
times, the fear that the subject was advancing faster than I could write it down.) 

Nevertheless, progress did sometimes get made. One thing which helped to 
spur the drafting, or revision, of certain sections was the chance to lecture about, 
the material in them, to audiences around the world, on occasions which included 
the following: 


e In June 1994 I was invited to lecture on classifying toposes at Shaanxi 
Teachers’ University, Xi’an, China, through the good offices of Wang Guo-Jun 


3 Whilst I am on that subject, I should also record my indebtedness to another of my former 
students, Paul Taylor, whose TeX diagram-drawing macros are used throughout the book. 
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and my former student Zhao Dongsheng. This involved the polishing of 
material from Chapters A1, D1 and D3. 

e In May 1995, Francis Borceux invited me to lecture on locales at a summer 
school on ‘Topological Applications of Category Theory’ in Alle-sur-Semois, 
Belgium: this involved material from Chapters C1 and C3. 


During my sabbatical visit (arranged by Max Kelly) to Sydney, Australia, 
during February and March 1996, I gave a number of talks in the Sydney 
Category Seminar covering material from Chapters B1 and B3. 


In July 1997, as an invited speaker at the ‘Workshop on Logic, Langage, 
Information and Computation’ (commonly known as WoLLIC) in Fortaleza, 
Brazil, I had the opportunity (thanks to Ruy de Queiroz) to give a tutorial 
on categorical logic before the main conference: once again, this provided an 
occasion to polish the material in Chapters D1 and D2. 


I mention these meetings in order publicly to record my thanks to the organizers 
named: though they were not aware of it, they helped to bring the project closer 
to completion. 

By the beginning of the third millennium, however, it had become clear to me 
that what was absolutely essential, if the writing of Parts A-D was ever to be fin- 
ished, was to find an extended period of time in which I could set all other work 
aside and concentrate full-time on the book. (Although I had at various times 
begun work on several sections from Part F, I had by then taken the decision that 
the only sensible route to completion was to concentrate on the first four parts, 
which were too strongly interconnected to be written separately, and to leave the 
last two — which, dealing as they do with applications, are less strongly tied to the 
others — for completion at a later date and publication as a separate volume.) By 
great good fortune, such an opportunity arose with the invitation to participate 
in the ‘Mathematical Logic Year’ at the Institut Mittag-Leffler in Djursholm, 
Sweden, for a two-month period in January-March 2001. My gratitude to the 
organizers of that programme (Dag Normann, Viggo Stoltenberg-Hansen and 
Jouko Väänänen), for providing me with this opportunity, is impossible to exag- 
gerate; and I am also deeply grateful to Kjell-Ove Widman and the staff of the 
Institut Mittag-Leffler for providing such a near-perfect working environment. 

The speed with which everything fell into place, once I had the opportunity 
for uninterrupted work on the book, greatly surprised me. Although the propor- 
tion of the total text which was written in Sweden is not so great, the number 
of sections which reached (more or less) their final form during my time there 
is very large: about half of those in Part B, almost all in Part C apart from 
Chapter C1, and most of those in Chapters D4 and D5. (In addition, a good 
deal of material was added to the supposedly complete draft of Part A.) Within 
a month of my return to Cambridge, I was able to produce a complete text of 
the first four parts, except for three sections in Part D. Copies of this draft were 
circulated to a number of colleagues (including Ieke Moerdijk and Andrew Pitts, 
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as well as Martin Hyland, Anders Kock and Gavin Wraith) for their comments 
and suggestions; the latter (which were, in many cases, of considerable value) 
were taken into account in the final revision of the text which took place in 
August-September 2001, after the writing of the last three sections had been 
completed. 

At the time of writing these words, Parts E and F of the original plan remain 
to be written. In view of my experience so far, I am making no promises about 
how long it will take to write them; but I hope that interested readers will not 
have to wait as long for the third volume as they have done for the first two. 


How to read this book 


‘Begin at the beginning,’ the King said gravely, ‘and go on till you come to the end; 
then stop.” 


Whilst the King of Hearts’s advice to the White Rabbit is generally sound, it is 
not particularly helpful in relation to this book. At the December 1987 meeting 
to which I have referred, the three of us took a number of decisions about the 
overall structure and content of the book: despite the many changes of detail 
which have taken place since then, the general principles which we agreed have 
by and large survived unchanged. Since some understanding of these principles 
is important for anyone who seeks to obtain information from this book, I shall 
describe them here. 

The first and most important is that the book is not a textbook. It is not 
addressed to those who are trying to learn about topos theory for the first time,° 
but rather to those who already have some acquaintance with the subject and 
who wish to deepen their understanding, or to learn about aspects of it which 
they have not previously encountered. In keeping with this, there are no ‘exercises 
for the reader’ (which are often the lazy author’s way of leaving out the proofs of 
results that he can’t be bothered to write out in full): instead, all results needed in 
the book (with the exception of a few results from other areas of mathematics, 
for which references to standard works are given) are proved in full, and all 
examples are fully worked out.® This is not to say that the book cannot be used 
as a course text: it would, for example, be possible to base an introductory course 
on the purely category-theoretic side of topos theory on Chapters Al, A2 and A4, 
although the instructor would have to be fairly selective about the specialized 
material which has been inserted in those chapters for future reference, and 
would also probably want to leaven the presentation with some material from 
Parts C and/or D. 


4 Lewis Carroll, Alice’s Adventures in Wonderland (London, 1865). 

5 For those who do seek an introduction to the subject, the one by Mac Lane and 
Moerdijk [751] remains (in the present writer’s opinion) the most recommendable. 

6 It should be understood that this policy does not debar the author from leaving routine 
‘diagram-chasing’ and other calculations to the reader. 
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Secondly, the book is not intended to be read sequentially. The whole point 
of the six-part structure is that the book is seeking to present several different 
approaches to topos theory, so where one starts reading will depend on the 
line of approach one wishes to follow. Most readers, however experienced, will 
probably wish to start by at least skimming through Chapter A1, in order to 
familiarize themselves with the book’s conventions on categorical notation and 
terminology (about which I shall have more to say below); but thereafter a reader 
who already knows the basic category theory of toposes may well decide to jump 
to Chapter B1, C1 or D1, depending on the approach he wishes to take. However, 
almost any reader will have to be prepared to refer backwards and/or forwards 
a good deal. Although it would have been theoretically possible to write entirely 
independent accounts of the geometric and logical aspects of topos theory in 
Parts C and D (and that was more or less our original intention), in fact the 
two are so thoroughly interconnected that such a course would have involved an 
immense amount of duplication (and of course increased the overall length of the 
book). 

There is some duplication, which is quite deliberate: for example, the 
treatment of natural number objects in Section A2.5 overlaps with that in 
Section D5.1, the construction of classifying toposes is treated in two differ- 
ent ways in Sections B4.2 and D3.1, and exponentiability of toposes is treated 
in Sections B4.3 and C4.4. And there are numerous smaller instances, where an 
individual result is proved in two different ways, using material from different 
areas of the book. But in general, if there is an optimal proof of a particular 
theorem, and it clearly belongs (say) in Part D, there does not seem to be any 
point in writing it out a second time in Part C if it happens to be needed there. 
So the reader who wishes to see the full picture must be prepared to skip around 
from one part to another, when necessary. (On the other hand, a reader who is 
willing to ‘take on trust’ the results quoted from other parts of the book should 
be able to follow any given chapter as a (more or less) connected narrative.) 

I have at least done my best to make the cross-references easy to use. All 
numbered references (definitions, lemmas, theorems, examples, etc.) within a 
given section are numbered in a single sequence: thus ‘B3.4.5’ denotes the fifth 
numbered reference (in fact a corollary) in the fourth section of the third chapter 
of Part B. For references within a given Part, the letter is omitted: thus if the 
corollary just mentioned is cited elsewhere in Part B, it appears as ‘3.4.5’. If a 
lemma, proposition, theorem, corollary or scholium’ is subdivided into parts, 
they are normally labelled (i), (ii), (iii) and so on, whereas definitions, examples 
and remarks are subdivided (a), (b), (c),.... The symbol O is used to denote 
the end of a proof; if it appears at the end of the statement of a result, it means 


7 I should perhaps explain that I use the term ‘scholium’ (literally, a marginal note) to 
denote something which follows directly from the proof of a preceding result, as opposed to a 
corollary which follows directly from the statement of the preceding result. 
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either that no proof is given, or that the proof is contained in the discussion 
which precedes the statement. 

At the December 1987 meeting we adopted the principle that, for every major 
result to be included in the book, we should seek to find the optimal proof, 
although that should not preclude us from giving alternative proofs if they helped 
to shed additional light on the result. (I hope it is unnecessary to add that I am 
conscious of having not always succeded in this aim.) Partly for this reason, 
we decided that we would not attempt to give detailed attributions for all the 
results in the book, since the optimal proof is often not the original one, but 
has been arrived at through successive polishing of the latter by several hands. 
Nevertheless, I have attempted to compile a comprehensive bibliography of topos 
theory, and to give attributions where credit is clearly due to a single source. 
(I considered attempting to classify the bibliography entries by subject area, as 
was done in [404], for example; but the task seemed too large and ultimately 
of doubtful value.) And in most sections I have provided a list of references at 
the end of the section for those who wish to pursue the topic(s) covered in the 
section in greater detail. 

We also recognized that the choice of terminology and notation, in a book of 
this nature, was of immense importance. It is impossible to be entirely consis- 
tent about notation, in a subject of this size: there simply aren’t enough different 
letters and typefaces for one to avoid re-using some of them in mutually contra- 
dictory ways. But I have done my best to ensure local consistency of notation, 
within each chapter. As regards terminology, I have consciously attempted to 
use this book to disparage some of the sloppier pieces of inappropriate usage 
which are common among category-theorists; whether it will have any influence 
in this respect of course remains to be seen, but readers who are familiar with 
the existing terminology may well find some of my choices rather surprising at 
first. I have, however, listed alternatives to my chosen terminology in the index, 
where they appear in italics, and in most cases my reasons for discarding them 
will be found in the text at the points to which the index entries refer. 

A comprehensive index is also of great. importance in a book of this nature. 
I have done my best to provide one: in addition to listing alternative terminology 
in italics, as already mentioned, it also contains entries for many of the principal 
theorems, particularly those which occur in several different versions in different 
places, and I hope that this will assist the specialist in finding his way quickly 
to the information which he wants. One point which should be mentioned about 
the index is that, where an index entry begins with a mathematical symbol, it 
is listed in the alphabetical order as if the name of the symbol had been spelled 
out: thus, for example, ‘o-pretopos’ will be found between ‘sifted coverage’ and 
‘signature’. In addition to the general index, there is also an index of standard 
notation, ordered alphabetically on the same principle. 


8 That is, the one which the late Paul Erdés would have described as belonging to 
‘The Book’. 
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Conclusion 


I have already mentioned my indebtedness to several colleagues who have played 
particular rôles in the story which led up to the writing of this book. But there 
are many other workers in topos theory to whom I am indebted, for their ideas 
which have contributed to my own understanding of topos theory, and which 
have in many cases found their way into the text which follows without specific 
acknowledgement. It would be quite impossible to thank them all individually, 
so I hope that they will accept these few words of collective thanks. 

Finally, I must thank the staff of Oxford University Press, and in particular 
Elizabeth Johnston and Alison Jones: not only for their enormous patience in 
waiting for this text ten years beyond the date when we originally promised to 
deliver it, but also for their enthusiastic and helpful response when I was finally 
able to let them know that its delivery was within sight. It has been a pleasure 
to work with them, and I look forward to doing so again when Parts E and F 
are ready for publication. 
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REGULAR AND CARTESIAN 
CLOSED CATEGORIES 


Al.1 Preliminary assumptions 


In this book, we start from the assumption that the reader is already familiar 
with the basics of category theory: specifically, we assume that he has already 
seen the definitions of the terms category, functor, natural transformation, 
adjunction, limit and monad. These definitions can, of course, be found in any 
of the standard textbooks on category theory (for example, [742], [381] or {147]). 
However, there are a number of conventions on which the standard texts differ, 
and we need to state our position on these before proceeding further. 

First, we (generally) write functions on the left of their arguments, and there- 
fore compose morphisms in a category ‘from right to left’: fg means ‘g followed 
by f’. (We normally denote composition, as here, simply by juxtaposition; but 
we Shall occasionally write f -g or f og in order to improve the readability of 
complicated expressions.) Next, we call a functor faithful if it is ‘injective on 
hom-sets’: that is, F: C — D is faithful iff, given morphisms f and g of C, the 
equations dom f = dom g, cod f = cod g and F(f) = F(g) together imply 
f = g. We do not require that F' should also reflect isomorphisms; if it does 
so (and is faithful), we call it conservative. (Of course, there are circumstances 
in which any faithful functor is automatically conservative: for example, if the 
domain category C is balanced (i.e. has the property that every morphism which 
is both monic and epic is an isomorphism), or if F is full as well as faithful.) 

A full subcategory, of course, is one whose inclusion functor is full; but when 
dealing with full subcategories we shall generally assume (sometimes without 
saying so explicitly) that they are also replete, i.e. that any object of the ambient 
category isomorphic to one in the subcategory is itself in the subcategory. Thus 
full subcategories of C correspond to classes of objects of C which are closed 
under isomorphism. In particular, for us a reflective subcategory will always 
mean a full, replete subcategory whose inclusion functor has a left adjoint. We 
use the term reflection for an adjunction whose right adjoint is full and faithful, 
and reflector for a monad which is idempotent (i.e. one whose multiplication is 
an isomorphism); it is well known that these three concepts are essentially the 
same. The following, related, result seems not to be widely known, however; and 
since we shall need it occasionally, we sketch its proof here. 
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Lemma 1.1.1 Let F:C — D be a functor having a right adjoint G. If there is 
any natural isomorphism (not necessarily the counit of the adjunction) between 
FG and the identity functor on D, then (F 1G) is a reflection. 


Proof Using the given isomorphism, we may transfer the comonad structure 
on FG arising from the adjunction to a comonad structure (€,5) on 1p. But the 
monoid of natural endomorphisms of the identity functor on a category is always 
commutative; so, since 6 is a one-sided inverse for €, we deduce that they are 
inverse isomorphisms. Transferring back again, we see that the counit of (F 4 G) 
is an isomorphism, which is well known to be equivalent to the adjunction being 
a reflection. a 


By an equivalence between categories C and D, we mean a pair of functors 
F:C + D, G: D — C equipped with (a choice of) natural isomorphisms between 
the composites F'G and GF and the respective identity functors. It is well known 
that, if such isomorphisms exist, they may be chosen to be the unit and counit of 
an adjunction (in either direction) between F and G; and we shall tacitly assume 
that this has been done whenever we need it. If F is one half of an equivalence 
of categories, then it is not only full and faithful but also essentially surjective 
on objects, that is, every object of its codomain D is isomorphic to one in the 
image of F. Conversely, if F has these three properties, and we are able to choose 
for each object B of D an object GB of C and an isomorphism ag: FGB — B, 
then G extends to a functor D — C such that F and G form an equivalence of 
categories {œ being one of the two natural isomorphisms required). We shall call 
F a weak equivalence if it is full, faithful and essentially surjective on objects. 

The terminology just introduced is in keeping with a general principle which 
will be present throughout this book, though not always explicitly stated: 
namely, that we should make the smallest possible demands on the ‘metatheory’ 
within which we interpret the theory of categories (and in particular we shall 
not assume that it satisfies any form of the axiom of choice, so that there is a 
distinction to be made between the assertion that something exists in all cases, 
and the assertion that we are given a function assigning to each case a choice of 
that thing). This is because we shall need, at various points, to re-interpret the 
results of elementary category theory in several different metatheories, and we 
do not want to have to go through the chore of re-proving these results in each 
new context. (However, for the present, readers who feel uncomfortable about 
this may interpret everything we say as applying to categories defined within 
some fixed universe of sets.) 

Thus, when we say ‘C is a category’, what we mean is that we have given 
interpretations (in some suitable metatheory) of the statements ‘A is an object 
of C’ and ‘f is a morphism of C’, together with interpretations of the notions 
‘domain’, ‘codomain’, ‘identity morphism’ and ‘composite’ which collectively 
satisfy the axioms of the first-order theory of categories. We do not assume that 
the objects or the morphisms of C form sets (in any formalized sense); nor even 
that, given a pair of objects A and B of C, the morphisms A — B in C form a 
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set. Whenever (in a context where we are given a particular model of set theory) 
we wish to make the first or the second of these assumptions, we shall draw 
attention to it by calling C a small category or a locally small category, as the 
case may be. 

As mentioned above, we assume familiarity with the notion of monad; in 
particular, we assume that the reader has seen the constructions of the Filenberg- 
Moore category CT and of the Kleisli category associated with a monad T on C. 
We shall have occasion to use the Crude Monadicity Theorem (sometimes known 
as the Crude Tripleability Theorem) in its ‘reflexive-coequalizer’ form; recall 
that a parallel pair of morphisms f,g: A = B is called reflerive if there exists 
s: B — A with fs = gs = 1g. 


Theorem 1.1.2 LetU:D —C be a functor, and suppose 
(i) U has a left adjoint, 
(ii) U is conservative, and 
(iii) D has and U preserves coequalizers of reflexive pairs. 


Then U is monadic, i.e. the comparison functor K: D —C" (where T is the 
monad on C induced by U and its left adjoint) is part of an equivalence of 
categories. 


Proof Most ofthe standard texts on category theory give this theorem without 
the restriction to reflexive pairs in condition (iii); but an examination of the proof 
will show that only coequalizers of reflexive pairs are used in it. E 


At various points, we shall need to appeal to the standard ‘adjoint lifting 
theorems’ for functors between categories of algebras. For convenience, we state 
them here: 


Proposition 1.1.3 Suppose given a diagram of categories and functors 
T 
D ———_> D 
U Uy’ 


Cc ——— C' 


such that U and U’ are monadic and the square commutes up to natural 
isomorphism. 


(i) If D has coequalizers of reflexive pairs and T has a left adjoint, then T has 
a left adjoint. 


(ii) If T andT also commute with the left adjoints F and F' of U and U' (more 
specifically, if the canonical natural transformation F'T — TF which is the 
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‘mate’ of the isomorphism TU = U'T is an isomorphism) and T has a right 
adjoint, then T has a right adjoint. 


Proof See [503], for example. o 


We shall occasionally wish to consider ‘T-algebras’ when T is a mere endo- 
functor of a category C; that is, pairs (A,a) where A € ob C and a: TA > A. 
(The context - and the typeface — will usually make it clear whether we are 
talking about ‘mere’ T-algebras in this sense, or about T-algebras in the sense 
of Eilenberg and Moore.) We recall the following useful fact about T-algebras, 
first observed by J. Lambek [665]: 


Lemma 1.1.4 Let T:C >C be a functor, and suppose the category CT of 
T-algebras has an initial object (A,a). Then a is an isomorphism. 


Proof (TA,Ta) is also a T-algebra, and so we have a unique morphism 
f: A-— TA such that the left-hand square in 


T 
TA mae TTA 1° > TA 


a Ta a 


Ae Tha 
commutes. But the right-hand square trivially also commutes, so the composite 
af (is a T-algebra homomorphism, and hence) must be the identity on A. Now 
the commutativity of the left-hand square yields fa = T(af) = lra; so f is a 
two-sided inverse for a. o 


In the definition of a category it is normally insisted that ‘hom-sets are dis- 
joint’, i.e. (in elementary terms) that each morphism of a category has a uniquely 
specified domain and codomain. The actual practice of mathematics frequently 
falls short of this ideal (even in defining the category of sets, one has to adopt 
an ‘unnatural’ definition of function), and in order to deal with this we shall 
find it convenient to borrow the notion of protocategory introduced by Freyd 
and Scedrov [381]. A protocategory C consists of two types of entities, objects 
and protomorphisms; the latter may be thought of as possible names for mor- 
phisms in the category which C defines. These data come equipped with two 
primitive predicates, a source—target predicate relating a protomorphism f and a 
pair of objects A, B (written ‘f: A — B’ and read ‘f is a name for a morphism 
from A to B’) and a composition predicate relating three protomorphisms f, g, h 
(written ‘fg = h’ and read ‘h is a possible composite of f and g’; this second 
predicate is usually, but not necessarily, functional in the sense that, given f 
and g, there is at most one h satisfying fg = h). The key axiom states that, if 
g: A— Band f: B > C, there is exactly one h satisfying h: A — C and fg =h. 
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Of course, we also need axioms to express associativity of composition and exis- 
tence of identities; but we do not need to write these out in detail, since they 
may be inferred from the information that the category generated by a protocate- 
gory C has the same objects as C and the instances of the source-target predicate 
(i.e. the triples (f, A, B) satisfying f: A — B) as morphisms. We shall normally 
use the same symbol C for a protocategory and the category which it generates; 
since we shall only use protocategories as a quick way of specifying particular 
categories which we want to study, we do not need to develop an independent 
theory of protocategories. 
An example will help to clarify the ideas introduced in the last paragraph. 


Example 1.1.5 Given a category B, a category C is said to be structured over 
B if it can be generated by a protocategory whose objects come equipped with 
an underlying-object map A+ |A] to the objects of B, whose protomorphisms 
are the morphisms of B, whose source-target predicate is contained in the rela- 
tion ‘dom f = |A] and cod f = |B| (and has the property that 1,4;: A — A for 
all objects A), and whose composition predicate is the composition in B. C is 
thus specified completely by giving its objects, the underlying-object map and 
the source-target predicate. For example, the category Gp of groups is struc- 
tured over the category Set of sets: its objects are all groups, |A| is the (usual) 
underlying set of a group A and the source-target predicate is the relation ‘... is 
a homomorphism from ... to ...’. (Similarly for other categories of universal 
algebras, for categories of topological or uniform spaces, etc.; and for the cate- 
gory Gp(C) of group objects in a category C with finite products, regarded as a 
structured category over C.) 


If C is structured over B, then A +> |A| is the object-map of a functor 
U:C — B, whose action on morphisms sends an instance (f: A — B) of the 
source—target predicate to f; this functor is faithful, and conversely ifU:C — B 
is any faithful functor then C may be structured over B by setting |A| = U(A) 
and 


(f:A->B) © (Ag)(g: A> BinC and U(g) = f). 


A category structured over Set is called concrete (note that we consider concrete- 
ness as an extra structure rather than a property of a category), and a category 
structured over the category 1 with one object and one morphism (equivalently, a 
category definable by a protocategory with just one protomorphism — we usually 
denote the latter by <) is called a preorder. 

A construction which we shall use a great deal in this book is that of the slice 
category C/B, where C is a category and B is an object of C. This may be defined 
conveniently as a structured category over C: its objects are the morphisms of 
C with codomain B, the underlying object of f: A — B is A = dom f, and the 
source-target predicate is given by h: f — g iff gh = f, i.e. ‘the morphisms of 
C/B are commutative triangles’. We note that in this case the forgetful func- 
tor C/B — C is not merely faithful but conservative: if h: f — g and A is an 
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isomorphism in C then h~1: g — f. We shall write B\C for the co-slice cate- 
gory (C°P/B)°?; but co-slices will occur much less frequently than slices in what 
follows. 

Many of the categorical properties which we consider will be stable under 
slicing — that is, if a category C has the given property, then so do all its slice 
categories C/B. As a general rule, given a property P of categories, we shall say 
that C is locally P if all its slice categories have property P. (There is a geometric 
intuition behind this use of the word ‘locally’ — see Chapter C1 — but note that 
the case P = ‘small’ is an exception to our general scheme.) We observe that, 
if f: A — B is an object of C/B, then the slice category (C/B)/f is canonically 
isomorphic to C/A; so a property of the form ‘locally P’ is automatically stable 
under slicing. Note also that if B is a terminal object of C then the forgetful 
functor C/B — C is an isomorphism; and that a slice category C/B always 
has a terminal object, namely the identity morphism 1g. For this reason, we 
shall normally denote the terminal object of a category, if it has one, by 1; and 
if B is an object of C we shall tend to use B also as a name for the unique 
morphism B — 1. 


Example 1.1.6 If B is a set, then the slice category Set/B is equivalent 
(though not isomorphic) to the B-fold cartesian power Set”: in one direc- 
tion the equivalence sends an object f: A — B of Set/B to the family of fibres 
(Ap | b € B) where A, = f~'(b) C A, and in the other it sends a B-indexed 
family (A, | b € B) to the disjoint union 


[I 4 = Ute x {b} |b € B} 


bE B 


together with the obvious projection [] A, — B. We leave to the reader the easy 
verification that these constructions are functorial and inverse to each other up to 
natural isomorphism. 


We shall often ‘extend’ the equivalence just described in an informal way, by 
thinking of objects of C/B (for an arbitrary C) as ‘B-indexed families of objects 
of C’; the formalism underlying this idea is developed in Chapters B1 and B2. 

If C and D are categories, we write |C, D} for the functor category whose 
objects are all functors C — D, and whose morphisms are natural transforma- 
tions between them. A further example of ‘stability under slicing’, which in fact 
extends the example above, and which will be of considerable importance later 
on, is the following. 


Proposition 1.1.7 LetC be a category, and F': C — Set a functor. Then there 
is a category F such that the slice category [C, Set] /F is equivalent to the functor 
category |F, Set]. Moreover, if C is small (resp. locally small), then so is F. 


Proof F is most conveniently defined as a category structured over C: its 
objects are the elements of the disjoint union of the sets F(A), A € ob C, the 
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underlying object of x being the unique A such that x € F(A), and the source- 
target predicate is given by 


f:ag—y iff F(f)(z)=y. 


The fact that F is small (resp. locally small) if C is follows immediately from the 
definition. 
Given an object a: G — F of [C, Set]/ F, we define a functor (a): F — Set 
by 
(a) (x) = {y € G(A) | aa(y) = 2} 


if x € F(A), with ®(a@)(f)(y) = G(f)(y) whenever this makes sense. It is clear 
that a morphism y: (a: G > F) — (8: H — F) in |C, Set]/F induces a natural 
transformation (y): ®(a) — (8), so that ® becomes a functor [C,Set]/F — 
[F, Set]. 

In the opposite direction, given a functor H: F — Set, let Y(H) denote the 
functor C — Set defined by 


W(H)(A)= || He), 


x€FA 


with U(H)(f), for f: A— B in C, defined as the union of the functions 
H(f: x — F(f)(x)) over all x € F(A). &(H) comes equipped with an obvi- 
ous natural transformation to F, and © itself is a functor [F,Set] — [C,Set]/F. 
We leave to the reader the remaining details of the verification of this fact, and 
of the fact that @ and W are inverse to each other up to natural isomorphism. O 


We assume familiarity with the process of ‘Karoubian completion’ or ‘Cauchy 
completion’, that is the splitting of idempotents. An idempotent in a category C 
is of course a morphism e: A — A such that ee = e; e is said to split if it can 
be factored as fg where gf is an identity morphism. (If such a splitting exists, 
then f is necessarily an equalizer of e and 14, and g is a coequalizer of the same 
pair; conversely, if C has either equalizers or coequalizers then all its idempotents 
have splittings. For the same reason, the splitting of an idempotent, if it exists, 
is unique up to canonical isomorphism.) If C is an arbitrary category and € is 
a class of idempotents of C (which we assume contains all identity morphisms 
of C), we define C[E] to be the category generated by the protocategory whose 
objects are the members of £, whose protomorphisms are the morphisms of C, 
and whose source-target predicate is given by 


f:e—e iff (e'fe is defined and) e'fe= f. 


The composition predicate of C[E] is that of C; however, C[E] is not a category 
structured over C as we defined it in 1.1.5, since the identity morphism on an 
object e is e itself, not ldom e- 
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Lemma 1.1.8 There is a full and faithful functor I: C — C[E], which is uni- 
versal amongst functors defined on C and mapping all elements of E to split 
idempotents. 


Proof We define J(A) = 14 and I(f) = f: laom f > leoa f; this is clearly a 
functor, and is full and faithful. If e: A A is a member of €, then 


€ € 
la ————> e —— la 


is a splitting of I(e) in E[E]; and if T: C — D is any functor sending the mem- 
bers of £ to split idempotents, we may (modulo some use of choice!) extend it, 
uniquely up to natural isomorphism, to a functor Ñ: C[E] — D, by defining T(e) 
to be the image of a splitting of T(e). o 


Corollary 1.1.9 For any category C, the functor categories [C, Set] and 
[CIE], Set] are equivalent, where E is the class of all idempotents of C. 


Proof Since Set has (canonical) equalizers, all its idempotents split; so 
every functor C — Set extends canonically to a functor defined on C[E]. It 
is straightforward to verify that the extension also works for arbitrary natural 
transformations between such functors. o 


Thus, when considering functor categories of the form [C, Set} (as we shall do 
frequently throughout this book), there is no loss of generality in assuming that 
C is a category in which all idempotents split. Such categories C are commonly 
called Cauchy-complete or Karoubian. In fact, if C is Cauchy-complete, it can be 
recovered up to equivalence from [C, Set]: recall that it follows from the Yoneda 
lemma that the representable functors C (A, —) are projective in [C, Set], and 
they are also indecomposable in the sense that they have no nontrivial coproduct 
(= disjoint union) decompositions. 


Lemma 1.1.10 JfC is a small Cauchy-complete category, then C° is (weakly) 
equivalent to the full subcategory of |C, Set] whose objects are indecomposable 
projectives. 


Proof The Yoneda embedding yields a weak equivalence from C°? to the full 
subcategory of representable functors in [C, Set], and we have just observed that 
the latter are all indecomposable projectives. Conversely, let F be an indecom- 
posable projective. The family of all natural transformations from representable 
functors to F is jointly epic, and so yields an epimorphism e: G — F from a 
coproduct of representable functors to F. Since F is projective, this epimor- 
phism is split, say by d: F > G. Pulling back the coproduct decomposition of 
G along d yields a coproduct decomposition of F, which must be trivial; so F is 
a retract of a single one of the representable functors whose coproduct is G, i.e. 
it is the image of an idempotent endomorphism of some C (A, —). But since the 
Yoneda embedding is full and faithful, this idempotent derives from a (unique) 
idempotent endomorphism of A. Since C is Cauchy-complete, this endomorphism 
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splits (with image B, say); and since any functor preserves images of idempo- 
tents we have F ~ C (B, -—). Thus the indecomposable projectives in [C, Set] are 
(up to isomorphism) exactly the representable functors. a 


A1.2 Cartesian categories 


In much of this book we shall be concerned with particular classes of categories 
having certain ‘set-like’ categorical properties. (Note, however, that by ‘set-like’ 
we do not mean ‘set-theoretic’; we are here talking about elementary properties, 
ones which may be formulated in the first-order language of categories.) The 
simplest of these, and the one on which (almost) all the others will depend, is 
that of having finite limits; so this section is devoted to recalling some facts 
about it. 
Of course, having finite limits is an elementary property: 


Lemma 1.2.1 For a category C, the following are equivalent: 


(i) Every functor D — C, where D is a finite category, has a limit. 
(ii) C has finite products and equalizers of pairs of morphisms. 


(iii) C has a terminal object, products of pairs of objects and equalizers of pairs 
of morphisms. 


(iv) C has a terminal object and pullbacks of pairs of morphisms. 


Proof See any good textbook on category theory. o 


Lemma 1.2.1 may be read in two different ways: as establishing the equiv- 
alence of statements of the form ‘For all diagrams of type... in C, there exists 
a limit’, or (more constructively) as saying that if we are given an operation 
assigning a particular choice of limit to each diagram of one of certain specified 
kinds, then we can use it to construct an operation which does the same for 
diagrams of certain other kinds. In keeping with our desire to demand as little 
as possible of our metatheory, we shall generally take the latter view: when we 
say that C has limits of a particular kind, we mean that we are given an oper- 
ation assigning limits to all diagrams of this kind, unless we explicitly indicate 
otherwise. This means that the existence of finite limits is, strictly speaking, an 
additional structure rather than a property of a category C; but it does little 
harm to think of it as a property, because the structure — if it exists — is unique 
up to unique isomorphism (unlike the structure of concreteness, for example). In 
fact, most of the ‘properties’ we shall consider in the next few sections are really 
structures, but they all have the same degree of uniqueness about them. 

We call a category cartesian if it has the structure indicated in (iii) or (iv) of 
Lemma 1.2.1. A functor fF’: C — D between cartesian categories is called carte- 
sian if it preserves this structure (but only in the ‘up-to-isomorphism’ sense; it 
is clearly unreasonable to expect that we can make a canonical choice of limits 


12 REGULAR AND CARTESIAN CLOSED CATEGORIES Al.2 


in two categories C and D in such a way that all interesting cartesian functors 
between them will respect these choices). The term ‘left exact’ is often used 
for functors preserving finite limits, but the doubly dead metaphor contained 
in this term (it is derived from exact differentials, by way of exact sequences) 
means that it is best avoided, except possibly in the particular context of abelian 
categories where the notion of exact sequence still has a meaning. 

The category Set is cartesian: any one-element set, say {@}, gives us a choice 
of terminal object, the usual definition of cartesian product gives us a choice 
of binary products, and subsets give us a choice of equalizers. Many categories 
structured over Set (for example, Gp and the category Sp of topological spaces 
and continuous maps) are cartesian, and their cartesian structure may be chosen 
in such a way that the forgetful functor to Set preserves it strictly. If C is 
cartesian, then the category [D,C] of functors D — C is cartesian for any D, 
with structure ‘defined pointwise’ from that of C (that is, the functors ‘evaluate 
at B’: [D,C] — C are cartesian for each B € ob D, and the structure may be 
chosen so that they preserve it strictly). If C is a locally smail cartesian category, 
then for any A € ob C the representable functor C (A, —): C — Set is cartesian; 
hence 


Lemma 1.2.2 For any small cartesian category C, there exists a conservative 
cartesian functor C — Set? (equivalently, C — Set/B) for some set B. 


Proof Let B be the set of all objects of C, and consider the functor whose 
Ath coordinate is C (A, —). O 


Although conservative, the functor constructed in Lemma 1.2.2 is not (nor- 
mally) full; but it can be made so by considering its codomain to be a functor 
category rather than a slice category of Set: 


Lemma 1.2.3 For any locally small cartesian category C, there is a category D 
(which may be taken to be small if C is) and a full and faithful cartesian functor 
C — [D, Set}. 


Proof Take D = C° and use the Yoneda embedding. O 


Both 1.2.2 and 1.2.3 can be used as the basis for ‘metatheorems’ which assert 
that any statement (of a prescribed type) which can be expressed in the first- 
order language of cartesian categories, and which is true is all slices of Set (or 
in all Set-valued functor categories, as the case may be), is true in all cartesian 
categories. We shall not explore this aspect of category theory in detail here. 

The hypothesis of smallness in Lemma 1.2.2 can be relaxed: all we need is a 
set B C ob C such that the functors C (A, —), A € B, are jointly conservative. 
Such a set will be called a generating set for C; if B is a singleton {A}, we call A 
a generator for C. We shall also wish to consider the weaker condition that the 
functors C(A, —), A E€ B, are jointly faithful; in this case we call B a separating 
set (and, if B = {A}, we call A a separator) for C. 
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The notions of separating and generating set can be defined in elementary 
terms (and without assuming local smallness of C). First we note: 


Lemma 1.2.4 Suppose C has equalizers, and F:C — D preserves them. Then 
F is conservative iff it ‘preserves properness of subobjects’, i.e. iff, given a 
monomorphism f in C, F(f) invertible implies f invertible. 


Proof The condition is clearly necessary. Conversely, it implies that F is 
faithful, since given a parallel pair (f,g) in C we have f = g iff the equalizer 
of (f,g) is an isomorphism. Now if f is any morphism of C such that F(f) 
is an isomorphism, then given any parallel pair (g,h) with fg = fh we have 
F(g) = F(h), whence g = h; so f is monic, whence f is an isomorphism by the 
given condition. Q 


Thus, in the absence of any set-theoretic assumptions on a cartesian 
category C, we may define a family G of objects of C to be generating if, given 
a monomorphism f: A — B in C, the condition that every morphism G — B 
with G € G factors through f forces f to be an isomorphism. We leave to the 
reader the task of formulating an elementary definition of separating family (if he 
has not already seen it). In a category C with equalizers, a generating family is 
always a separating family, by the argument contained in the proof of 1.2.4; the 
converse holds if C is balanced, but not in general. 


Example 1.2.5 It is easy to see that the category Sp of topological spaces 
and continuous maps has a single separator, namely the 1-point space. However, 
it does not have any generating set of objects. To see this, note that for any 
infinite cardinal x we can find a set X and two topologies O,,O2 on X such that 
©, C Oz but their restrictions to any subspace of X of cardinality less than x 
coincide. (For example, we may take X to be the ordinal A+ 1, where A is a limit 
ordinal of cofinality at least x, ©, to be the usual order topology on X and © to 
be generated by ©, together with the singleton {A}.) Thus, if G = {G; | i € I} is 
any set of objects of Sp, then by taking « larger than the cardinalities of all the 
G; we obtain a monomorphism (namely the identity map (X,O2) —> (X,0,)) 
which is not an isomorphism, but such that this fact cannot be ‘detected’ by any 
of the Gj. 


The forgetful functor C/B — C is not cartesian (unless B is a terminal object 
of C), but it does preserve pullbacks (and, more generally, all limits of connected 
diagrams) and even creates them. Since C/B always has a terminal object, we 
deduce that it is cartesian provided C has pullbacks; but the converse is also true, 
since pullbacks over B in C are ‘the same thing as’ products in C/B. That is, 


Lemma 1.2.6 A category is locally cartesian iff it has pullbacks. o 


In particular, the property of cartesianness is stable under slicing. 


Example 1.2.7 There are not many (naturally arising) examples of locally 
cartesian categories which fail to be cartesian, but there is one which will be 
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of importance later on, namely the category LH of topological spaces and local 
homeomorphisms. This has the same objects as Sp, but its morphisms are those 
continuous maps f: X — Y such that each point x € X has an open neighbour- 
hood which is mapped homeomorphically by f onto an open neighbourhood of 
f(z). (An equivalent characterization is that both f and the diagonal embedding 
X — X xy X are open maps; cf. C3.1.15.) A pullback in Sp of a local homeo- 
morphism is a local homeomorphism (this is easily seen from the definition, but 
may also be proved using the second characterization above, and the fact that 
open maps are stable under pullback); from this it can be shown that LH has 
pullbacks, and the inclusion functor LH — Sp preserves them (cf. C1.3.2(iv)). 
So by Lemma 1.2.6 each slice category LH/X is cartesian, and the inclusion 
LH/X — Sp/X is a cartesian functor. However, LH does not have a terminal 
object, as may be seen by the following argument: for each infinite cardinal x, 
there exists a space X, (for example, the product of A copies of a discrete two- 
point space, where 2 > x) in which every nonempty open set has cardinality at 
least x; now any space to which X, maps by a local homeomorphism — in par- 
ticular, a terminal object of LH, if it exists — must have cardinality at least x. 
(In contrast to the above, LH does have binary products; hence it also has 
equalizers, but they are not preserved by the inclusion LH — Sp - to obtain 
the equalizer of a pair of morphisms in LH, one must take the interior of the 
subspace which is their equalizer in Sp. As an example of a pair of local home- 
omorphisms whose equalizer in Sp is not an open subspace, we may take the 
identity mapping on R and the map z > —z.) 


Let f: A — B be a morphism in an arbitrary category C. We have seen that 
C/A is isomorphic to a slice category of C/B, and so we have a forgetful functor 
C/A — C/B, whose effect on objects is simply the map g++ fg; we shall denote 
this functor by Ly. 


Lemma 1.2.8 C is locally cartesian iff Us has a right adjoint f*, for every 
morphism f of C. 


Proof If C has pullbacks then we define f*(g), for an object g of C/B, to be 
the left. vertical map in the pullback square 


PS C 


A ———— B 


It is straightforward to verify that this defines a functor C/B — C/A, and that 
it is right adjoint to Ef. Conversely, suppose f* exists for every f. Then, given 
a pair of morphisms (f, g) with common codomain, the counit of the adjunction 
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(Es 4 f*) yields a morphism from the domain of f*(g) to the domain of g making 


the square 
SSS 


f «| 
f 


e ——_> e 


commute, and the universal property of the adjoint says that this square is a 
pullback. o 


We note that the assignment f +> Ðp is itself functorial; that is, BB, = 
ÈX fg for any composable pair (f, g). In general, the assignment f +> f* is not 
functorial in this strict sense; we cannot expect our canonical choice of pullbacks 
in C to be such that the functors (fg)* and g* f* are exactly equal. However, they 
are canonically naturally isomorphic (being right adjoints to the same functor). 
We shall explore this point further in Chapter B1. 

Just as all finite limits can be constructed from pullbacks and a terminal 
object, we can construct more complicated finite limits from pullbacks alone. 
The class of finite limits which can be constructed from pullbacks has been 
characterized by R. Paré [931]: they are the limits over finite simply connected 
categories. (A category C is said to be simply connected if the groupoid reflection 
of C —i.e. the category obtained by freely adjoining inverses for all the morphisms 
of C — is a trivial connected groupoid; equivalently, C is simply connected if every 
functor from C to a groupoid is naturally isomorphic to a constant functor.) 
Clearly, this class does not include ail limits over finite connected categories — 
the ‘parallel-pair’ category which gives rise to equalizers is connected but not 
simply connected — but in fact we may construct all finite connected limits from 
pullbacks and equalizers. However, when we turn to the preservation of limits 
by functors, there is a strong tendency for those which preserve pullbacks to 
preserve equalizers as well: the following is a well-known folk-theorem, but does 
not often appear in print, so we feel it is worth stating explicitly. 


Lemma 1.2.9 Let C be a cartesian category, and T:C — D a functor. If T 
preserves pullbacks, then it preserves all finite connected limits. 


Proof We may factor T as 


5 
EE EL, 


where 1 is the terminal object of C, and T(A) is the image under T of the 
unique morphism A — 1. Since Er, creates pullbacks, T preserves them; but it 
also preserves the terminal object by construction, and hence preserves all finite 
limits. But © preserves all connected limits which exist in D/T1, and so T 
preserves finite connected limits. o 
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Note that Lemma 1.2.9 does not require any hypotheses on the existence of 
limits in D. However, the hypotheses on C cannot be weakened to the existence of 
finite connected limits: as we have already seen, the inclusion functor LH — Sp 
provides an example of a functor which preserves pullbacks but not equalizers, 
even though both exist in LH. 

Although, as already mentioned, not all equalizers are simply connected 
limits, there is an important class of them which are, namely coreflexive equal- 
izers. We have already met reflexive coequalizers in Theorem 1.1.2; dually, we 
say a parallel pair f,g: A 3 B is coreflexive if there exists d: B — A such that 
df = dg = 14. The equalizer of a coreflexive pair may also be regarded as the 
limit of a diagram of shape €, where € is the finite category with two objects A 
and B and five non-identity morphisms f,g: A 3 B, d: B — A and u, v: B 3 B 
satisfying the equations df = dg = 14, fd = u and gd = v (other composites 
being deducible from these). It is easily seen that £ is a simply connected cate- 
gory, since it has a terminal object A; hence, by Paré’s theorem quoted above, 
coreflexive equalizers may be constructed from pullbacks alone. But in fact we 
can be more explicit: 


Lemma 1.2.10 Let f,g: A= B be a coreflexive pair in a category C. Then a 
morphism e: E — A is an equalizer of f and g iff the square 


€ 


E ————> A 
| | 
f 
A ————> B 


is a pullback. 


Proof Given a parallel pair h,k: C = A, the equation fh = gk implies h = 
dfh = dgk = k, where d is a common splitting for f and g; so a pullback of f 
against g has the same universal property as an equalizer of f and g. Q 


Coreflexive equalizers plus finite products suffice to construct all finite limits: 
this may be proved by observing that the standard construction of an arbitrary 
finite limit as an equalizer of a pair of morphisms between products in fact 
requires the equalizer of a coreflexive pair. But we may also note that, if C has 
finite products, then the equalizer of f,g: A 33 B may also be obtained as the 
equalizer of (14, f), (14, g): A 3 A x B; and the latter pair is coreflexive, with 
common splitting given by the first projection. 

Another important property of coreflexive equalizers is that, because the 
diagonal functor € — € x £ is initial (cf. B2.5.12), we may reduce the 
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computation of limits of shape € x € to that of limits of shape €. Explicitly, 
this means the following: 


Lemma 1.2.11 Suppose given a diagram 


A, —— B, Z2 Cı 


Ag — B 2—2 C2 


As —— B; 23 0; 


which commutes in the appropriate sense (e.g. each of the four composites By > 
Cz — C3 is equal to the corresponding composite Bz —> B3 — C3), and in which 
the rows and columns are equalizers and the parallel pairs are all coreflexive. 
Then the diagonal A, — By = C3 is an equalizer. 


Proof It is clear that the two composites along the diagonal are equal; so 
suppose we are given a morphism D — Be equalizing B2 = C3. By composing 
with a common splitting for Cz = C3, we see that D — By also equalizes 
B2 =3 C2, and hence factors uniquely through A, — Bz. Similarly, D > Bə 
equalizes B2 — B3; so its factorization through Az equalizes Ap = A3, since 
A3 — B; is monic, and hence in turn factors uniquely through A; —> As. So we 
have shown that A; — Bz has the universal property of an equalizer. 0 


Corollary 1.2.12 Suppose given a bifunctor Q: C x C — C such that, for each 
A € ob C, the functors A ® (—) and (—) @ A preserve coreflerive equalizers. 
Then, if A; > B; 3 C; (i = 1,2) are corefierive equalizer diagrams in C, so is 
the diagram A; Q A2 > Bı ® Bg 3 C1 Q C2. 


Proof Apply 1.2.11 to the diagram 
A, ® Az —> A, ® Bo aS A; ® Ca 


B, 8 Az —> Bı ® B2 Z3 Bi 8 C2 


Cy, ® A —> C1 ® Bp Z3 C18 C2 
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In practice, we shall most commonly need the duals of 1.2.11 and 1.2.12, 
referring to reflexive coequalizers rather than coreflexive equalizers. Note in par- 
ticular that 1.2.12 contains the result that reflexive coequalizers commute with 
finite products in Set (or, more generally, in any cartesian closed category; cf. 
Section A1.5 below). 

Another class of colimits which are well known to commute with finite prod- 
ucts (and indeed all finite limits) in Set is the class of filtered colimits. We shall 
prove an ‘internal’ version of this commutation result in Section B2.6; but we 
note here that the class of all colimits which commute with finite products in Set 
has been characterized by C. Lair [660] as the class of sifted colimits. Moreover, 
all colimits in this class can be constructed from filtered colimits and reflexive 
coequalizers; see [26]. 


A1.3 Regular categories 


Like many writers on category theory, we shall be deliberately ambiguous in our 
use of the word ‘subobject’: according to the context, a subobject of an object A in 
a category C will mean either a particular monomorphism of C with codomain A, 
or an isomorphism class of such monomorphisms (the notion of isomorphism 
being that of the category C/A). For the present, let us adopt the former view, 
and write Sub(A) (or, if necessary, Sube(A)) for the full subcategory of C/A 
whose objects are the subobjects of A. This category is of course a preorder; we 
follow the usual custom of denoting its unique protomorphism by <. If C/A is 
cartesian, so is Sub(A) (and the inclusion Sub(A) — C/A is a cartesian functor); 
again, we shall follow tradition by calling products in Sub(A) intersections, and 
denoting them by N rather than x. (We shall also fall into the common abuse of 
denoting a subobject of A by the name of its domain, A’ say, leaving the reader’s 
imagination to supply a name for the actual monomorphism A’ — A.) 

A pullback of a monomorphism is a monomorphism; so if C is (locally) carte- 
sian the functors f*:C/B — C/A, defined in Lemma 1.2.8, restrict to functors 
Sub(B) — Sub(A), which we shall again denote by f* (or occasionally by f~'). 
However, the functor © does not similarly restrict, unless f itself happens to be 
a monomorphism. We shall say that C has images if we are given an operation 
assigning to each morphism f of C a subobject im f of its codomain, which is a 
least (in the sense of the preorder <) subobject of cod f through which f factors. 


Lemma 1.3.1 For a category C, the following are equivalent: 


(i) C has images. 
(ii) For each object A of C, the inclusion Sub(A) — C/A has a left adjoint im. 


(iii) (if C has pullbacks) For each morphism f:A—B of C, the pullback 
functor f*: Sub(.B) — Sub(A) has a left adjoint 3p. 
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Proof The equivalence of (i) and (ii) is immediate from the definition. For 
(ii) = (iii), we define 3p to be the composite 


> Š 
Sub(A) —> C/A —Ż> C/B —™s sub(B) 


and verify that it is left adjoint to f*. For (iii) => (ii), we note that applying 4, 
(if it exists) to the terminal object 14 of Sub(A) yields an image of f. O 


The canonical morphism dom f — dom(im f) which is the unit of the 
adjunction of 1.3.1(ii) has the property that it cannot factor through any proper 
subobject of its codomain. We call such a morphism a cover; more generally, a 
family of morphisms with common codomain (A, say) is called a covering family 
if the only subobjects of A through which every morphism in the family factors 
are isomorphisms. (We may use this notion to rephrase our definition of a gen- 
erating family, given in the last section: a family G of objects is generating iff, 
for each object B, the family of all morphisms with codomain B and domain in 
G is covering.) In diagrams and elsewhere, we shall use the notation f: A-> B 
to denote the fact that f is a cover. 


Lemma 1.3.2 Let C be a category with pullbacks, f: A— B a morphism inC. 
The following are equivalent: 


(i) f is a cover. 


(ii) f is orthogonal to the class of monomorphisms in C; i.e., given a 
commutative diagram 


f 


yt 
g h 
Cees) 


where m is monic, there is a unique k: B — C with kf =g and mk =h. 


(iii) (if covers in C are stable under pullback) The functor f*:C/B — C/A is 
conservative. 


Proof (i) < (ii): Condition (i) is a special case of (ii), in which the morphism 
h in the diagram is required to be the identity morphism on B. Conversely, the 
existence of the commutative square says that f factors through the subobject 
h*(m) of B; so if f is a cover this subobject must be an isomorphism, which 
means that h factors (uniquely, since m is monic) through m. Writing k: B > C 
for this factorization, we have mk f = hf = mg, whence kf = g since m is monic. 

(i) $ (iii): If m is any subobject of B through which f factors, then f*(m) = 
la = f*(1g), so if f* reflects isomorphisms then m must be an isomorphism. 
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Conversely, suppose f is a cover; let g: C — B be an object of C/B, and let 


h 
P —— C 
f 
A ——~+ B 
be a pullback. Since h is a cover, the pullback along it of any proper subobject of 
C is a proper subobject of P; but since the forgetful functor C/B — C preserves 
and reflects monomorphisms, this says that f* maps proper subobjects of g to 


proper subobjects of k. Since f* also preserves equalizers (having a left adjoint), 
it is conservative by 1.2.4. o 


Morphisms with the property described in 1.3.2(ii) are sometimes called 
strong epimorphisms or extremal epimorphisms; but we shall not need any special 
name for them, since we shall only consider this property in cartesian categories. 
Note that, in a category with equalizers, a cover f: A —> B is necessarily an 
epimorphism, since it cannot factor through the equalizer of any distinct pair 
of morphisms with domain B. In the opposite direction, a regular epimorphism 
(i.e. one which occurs as a coequalizer) is a cover, because any factorization of 
it through a subobject of its codomain can be shown to be a coequalizer of the 
same pair of morphisms. 

The mere existence of images in a category is not sufficient for many of the 
purposes for which we want to use them; we need to know that they are stable 
under pullback, i.e. that if f and g are two morphisms with common codomain 
then g*(im f) S im(g*(f)) in Sub(dom g). Since monomorphisms are always 
stable under pullback, this is clearly equivalent to saying that covers are stable 
under pullback, as in 1.3.2(iii). We say a category C is regular if it is cartesian 
and has images, and covers in C are stable under pullback. A functor between 
such categories is called regular if it preserves finite limits and covers (and hence 
also images, since a cartesian functor preserves monomorphisms). 

The following property of images in a regular category will be needed in 
Section D1.3. 


Lemma 1.3.3 Let f: A — B be a morphism in a regular category. Then, for 
any subobjects A’ — A and B' — B, we have 35(A'N f*(B’)) = 3p(A') NB’ in 
Sub(B). 
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Proof We may form the diagram 


A'N f*(B) ——— > 3, (A’) NB’ 


f*(B’) da 5 aui B’ 


f 


A ——— B 


in which the front, left and right faces are pullbacks; so the back face is also a 
pullback, and hence its top edge is a cover. But the diagonal of the right face is 
a monomorphism, so 3s(A’) N B’ is the image of the composite morphism from 
top left to bottom right. O 


The isomorphism in the statement of Lemma 1.3.3 is sometimes called 
Frobenius reciprocity; we shall meet it again, in a more general context, in 1.5.8 
below. 

Regularity, like cartesianness, has good stability properties: if C is regular, 
then so are any functor category [D,C] and any slice category C/B. Moreover, 
the functors ‘evaluate at B’: [D,C] — C are regular (which tells you how to prove 
the first assertion) and the forgetful functor C/B — C preserves covers (which 
tells you how to prove the second). A preorder is regular iff it is cartesian: 
every morphism in a preorder is monic (and epic), and so the only covers are 
isomorphisms. 

The category Set is regular: covers are surjective functions, and images are 
the usual set-theoretic ones. The same remarks apply to Gp or, more generally, 
to any category monadic over Set. Note that in Set and Gp the covers coin- 
cide with the epimorphisms, but in other cases they do not ~ for example in 
the category Mon of monoids, the inclusion of the additive monoid of natural 
numbers in the group of integers is an epimorphism which is not a cover. The 
category Sp has images: monomorphisms are injective continuous functions, and 
covers are surjections X — Y such that Y is topologized as a quotient space of 
X (these are exactly the regular epimorphisms in Sp, though they are not all 
the epimorphisms). But Sp is not regular, because the class of quotient maps 
is not stable under pullback (cf. [276]). However, LH is locally regular: because 
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local homeomorphisms are open maps, it is easily seen that any surjective local 
homeomorphism is a cover (in Sp, and hence also in LH); and any local home- 
omorphism can be factored as a surjective local homeomorphism followed by an 
open inclusion. 

The reader will have observed from the above examples that, whilst covers do 
not always coincide with epimorphisms, they have a strong tendency to coincide 
with regular epimorphisms. Indeed, we have 


Proposition 1.3.4 In a regular category, the covers are exactly the regular 
epimorphisms. 


Proof We have already noted that regular epimorphisms are always covers. 
Conversely, let f: A —> B be a cover in a regular category, and let (a,b): R 3 A 
be its kernel-pair (i.e. the pullback of f against itself). We shall show that f is 
a coequalizer of a and b. Let c: A — C be any morphism with ca = cb, and let 


„h 
TE EE TE E es 


be the image factorization of (f,c): A— B x C. We shall show that g is an 
isomorphism, so that hg™!: B — C is a factorization of c through f (which is 
clearly unique, since covers are epimorphisms). To this end, it suffices to show 
that g is monic, since the cover f factors through it. So suppose (k,l): E 3 D 
are such that gk = gl. Form the pullback 


P ps E 
(m,n) (k,l) 
dxd 


Ax A—>DxD 


Now fm = gdm = gkp = glp = gdn = fn, so (m,n) factors through (a,b) by a 
morphism q: P — R. Thus we have 


hkp = hdm = cm = caq = cbq = cn = hdn = hip. 


But d x d is a cover, since it is the composite of la xd: Ax A—~AxD 
and dx 1p: Ax D— DxD, both of which are pullbacks of d (it is easy 
to verify that a composite of covers is a cover, using 1.3.2(ii)), and hence 
p is a cover and in particular epic. So we deduce that hk = Al, and hence 
(g, h)k = (g, h)l: E — B x C. But (g,h) is monic by definition, so k = 1, which 
completes the proof. O 
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Earlier writers on regular categories (for example, [68], [419]) tended to use 
regular epimorphisms rather than covers in formulating the definition. The above 
lemma shows that their definition is equivalent to the one we have adopted: 


Scholium 1.3.5 Let C be a cartesian category with coequalizers. Then C is 
regular iff regular epimorphisms are stable under pullback in C. 


Proof One direction follows directly from 1.3.4. Conversely, suppose regular 
epimorphisms are stable under pullback. Given an arbitrary morphism f: A — B 
in C, form its kernel-pair (a,b): R = A and the coequalizer d: A — D of a and 
b; then since fa = fb we have an induced factorization f = gd. Arguing 
(almost) exactly as in the proof of 1.3.4, we may show that g is monic; thus 
we have factored f as a regular epimorphism (in particular, a cover) followed 
by a monomorphism, and so C has images. Moreover, if f itself is a cover, then 
the monic part g of the above factorization is an isomorphism, and so f is a 
coequalizer of a and b; hence covers are stable under pullback in C, since they 
coincide with regular epimorphisms. o 


Of course, a regular category need not have coequalizers for arbitrary pairs of 
morphisms; the most we can deduce from the definition is that it has coequalizers 
for those pairs which occur as kernel-pairs (and we note that the proof of 1.3.5 
still works if we assume only that these coequalizers exist). A pair of morphisms 
which occurs as a kernel-pair has a number of special properties of interest: 


Definition 1.3.6 Let (a,b): R=3 A be a parallel pair of morphisms in a 
cartesian category. 
(a) We say (a,b) is a relation if (a,b): R — A x A is monic. 


(b) We say (a,b) is refiexive if there exists r: A — R with ar = br = 14 
(cf. 1.2.2). 


(c) We say (a,b) is symmetric if there exists s: R — R with as = b and bs = a. 
(d) We say (a,b) is transitive if there exists t: P + R, where P is the pullback 


q 


P ———— R ; 
b 
R———> A 


such that at = ap and bt = bq. 


(e) We say (a,b) is an equivalence relation if it has all four of the above proper- 
ties. (Note that if (a,b) is a relation, then the morphisms r, s and t which 
verify the other three properties are unique if they exist.) 
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It is easily verified that the kernel-pair of any morphism is an equivalence 
relation. We say an arbitrary equivalence relation is effective if it occurs as a 
kernel-pair; in a regular category, an effective equivalence relation is necessarily 
the kernel-pair of its own coequalizer. By an effective regular category, we mean 
a regular category in which every equivalence relation is effective. (The old name 
for this is ‘exact category (in the sense of Barr)’, but the word ‘exact’ has been 
so badly overworked by category-theorists that it is now best avoided, except in 
the context of abelian categories.) 

Not every regular category is effective. Consider the category TF of torsion- 
free abelian groups; it is easily verified that the inclusion functor from TF to 
the category Ab of all abelian groups has a left adjoint and preserves regular 
epimorphisms, from which one may deduce that TF is regular and cocartesian. 
But the group 


R= {(a,b)€Zx Z|a=bmod 2}, 


together with its two projections to Z, is an equivalence relation in TF (because 
it is one in Ab) which is not effective; its coequalizer in TF is the unique 
homomorphism Z — 1, whose kernel-pair is the whole of Z x Z. The reason is, of 
course, that we have excluded the torsion group Z/2Z, which ‘ought to be’ the 
coequalizer of R = Z, from membership of the category TF. In Section A3.3 
we shall develop a technique for ‘effectivizing’ a regular category, essentially 
by adjoining the ‘missing quotients’ which should correspond to non-effective 
equivalence relations, which if applied to the particular category TF will produce 
Ab (at least up to equivalence). Note that Ab itself is effective regular, as 
is any category monadic over Set. Also, if C is effective regular, so are any 
functor category [D, C] and any slice category C/B: kernel-pairs and coequalizers 
of equivalence relations are both constructed pointwise in [D, C] and created by 
the forgetful functor C/B > C. 


Example 1.3.7 Not all equivalence relations in LH are effective. Let S! denote 
the unit circle, let R be the disjoint union of copies of S! indexed by the integers, 
let a: R — S! be the codiagonal map (the map whose restriction to each copy 
of S! in R is the identity) and b: R + S! the map which rotates the nth copy 
of S! in R by n@, where 8 is an irrational multiple of 7. Then it is easily verified 
that (a and b are local homeomorphisms and) (a,b) is an equivalence relation, 
but there is no local homeomorphism f: S! — X even satisfying fa = fb, let 
alone having (a,b) as its kernel-pair. However, this is all that can go wrong: if an 
equivalence relation (a,b) in LH is coequalized by some local homeomorphism, 
it is straightforward to verify that the coequalizer of (a, b) in Sp lies in LH, and 
that (a,b) is its kernel-pair (in Sp, and hence in LH). So we deduce that the 
slice categories LH/X are all effective regular. 
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In a regular category, we shall normally consider projectivity with respect to 
covers; that is, we shall say an object P is projective if, given any diagram 


P 


f 


A ——— B 


where f is a cover, there exists h: P— A with fh = g. Because covers 
are stable under pullback, this is equivalent to saying that every cover with 
codomain P is split epic; if C is locally small, it is also equivalent to saying that 
C (P, —): C > Set is a regular functor. 

The support of an object A in a regular category is the image of the unique 
morphism A — 1. We say A is well-supported if its support is 1, i.e. if A — 1 is 
a cover. And we say a regular category C is capital if 1 is a generator for the full 
subcategory of well-supported objects of C. 


Lemma 1.3.8 In a capital regular category, the terminal object is projective. 


Proof We have to show that every cover A —> 1 is split epic. If A & 1 there is 
nothing to prove; if not, then the two projections A x A= A are not equal and 
so (since A x A is well-supported if A is) there exists a morphism 1 — A x A 
not factoring through their equalizer A — A x A. Composing this with one of 
the projections yields a morphism 1 > A. E 


In Section D1.5 we shall develop a technique for embedding an arbitrary 
small regular category (conservatively) in a capital one; using this, and the 
Lemma just proved, we shall be able to show that every such category admits 
a conservative regular functor to Set/B for some B. (In fact this latter result 
has a strengthening, just as 1.2.2 was strengthened by 1.2.3: every small regular 
category admits a full and faithful regular functor to a functor category [D, Set] 
where D is small. But we shall not prove the stronger result in this book: it may 
be found in [68] or [145] — the latter gives a proof very similar in spirit to the 
arguments which we shall deploy in Section D1.5.) 

We conclude this section by describing a construction for embedding an 
arbitrary cartesian category C in a regular category Reg(C). 

The basic ideas underlying the construction are 


(i) there should be a full and faithful functor 7: C + Reg(C), preserving the 
cartesian structure of C (though not the regular structure, if C happens to 
have it already); 


(ii) each object of Reg(C) should be the image of a morphism of the form 
I(f); 
(iii) each object of the form I(A) should be projective in Reg(C). 
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(We shall give some justification for the third requirement in 1.3.10(a) below.) 
These ideas lead us to the following definition: the objects of Reg(C) are the 
morphisms of C, and morphisms (fı: Ai > Bı) > (f2: A2 —> B2) in Reg(C) 
are equivalence classes of morphisms g: A; — A2 in C such that fog coequalizes 
the kernel-pair of fı, two such morphisms gı and g> being equivalent if they 
have equal composites with f (equivalently, factor through its kernel-pair). We 
write [g] for the equivalence class of g. It is straightforward to verify that this 
equivalence relation is respected by composition in C, and hence Reg(C) is a 
category. (We could equivalently think of the morphisms fı — fz in Reg(C) 
as individual morphisms A, — Bəz in C which coequalize the kernel-pair of fı 
and admit at least one factorization through fo; but the description given first 
is more convenient for defining composition.) The functor 7: C — Reg(C) sends 
an object A to the identity morphism 14; it is again straightforward to verify 
that this is full and faithful. It is also easy to see that [(1¢) is a terminal object 
of Reg(C). 

To form the pullback of a pair of morphisms [gi]: fı > f3, [g2]: fo > f3 in 
Reg(C) (where fi: A; — B; in C), we form the pullback 


pee a 


p2 fag 


392 

Ag Le B3 

in C, and define f4 to be (fıpı, fop2): P — B, x Bo. (Note that this construction 
depends only on the equivalence classes [g1] and [g2], and not on the particular 
representatives gı and g2.) It is clear that pı and p2 represent morphisms f4 — fı 
and f4 — fo in Reg(C), and that the square 


[pı] 


jot 


[p2] [g1] 


[92] 
fg fs 
commutes; to show that the square has the universal property of a pull- 
back, suppose given [hi]: fs > fi and [h2]: fs > f2 with [gii] = [g2h2]. Then 
fagihi = fagohe, so the pair (hy, h2) factors through the pullback P (say by 
h3: As — P); and hg defines a morphism [h3]: fs — fa making the appropriate 
diagram commute. Moreover, it is easy to verify that the equivalence class [hg] is 
independent of the choice of representatives hı and hz, and that it is the unique 
factorization of ([h1], [k2]) through ({pi], [p2]) in Reg(C). 


Al1.3 REGULAR CATEGORIES 27 


Next, we note that a morphism [g]: fı — fz in Reg(C) is monic iff the kernel- 
pair of fog coincides with that of fı. We claim that if a morphism [e]: fı > fo 
is representable by a split epimorphism e in C, then it is a cover in Reg(C) (this 
is not a necessary condition for being a cover, but we shall see that it is satisfied 
by the first half of the image factorization of an arbitrary morphism of Reg(C), 
as constructed above); and this condition is stable under pullback, for if we have 
a pullback square 


(fa: P > By x B2) ua (fi: Ay > Bı) 


[p2] [e] 


(f3: Ag —_ B3) EE (fa: Ag = Bo) 


and e is split (in C) by m: Ap — Aj, then pı is split by the factorization of 
(mg, lA): A3 — A; x A3 through the pullback P. To prove the claim, suppose 
e is split epic in C and we have a diagram 


h 
(fi: Ay — By) 2 (f3: A3 > B3) 


[el [9] 


(f2: Ag = B2) 


where [g] is monic. Then we claim that the composite hm: Ag > A3 (where m 
is any splitting for e in C) defines a morphism fz — f3 in Reg(C). For if (a,b) 
denotes the kernel-pair of f2, then we have 


faghma — foema = fea = fod = foemb = foghmb 


since the commutativity of the diagram says that foe = fogh, and hence 
f3hma = f3hmb since [g] is monic, and so the kernel-pair of fag coincides with 
that of f3. It is now straightforward to verify that [g] and [hm] are inverse 
isomorphisms in Reg(C); so [e] is a cover, as claimed. 

Now we define the image factorization of an arbitrary [g]: fı — fo to be 


lA, 
(fi: Ay > Bi) Hal (fog: Ay > Be) ack (f2: A2 > B2) 


(again, note that this construction depends only on the equivalence class of g); 
it is clear from what we have said above that the first factor of this factorization 
is a cover, and the second factor is monic. So every morphism of Reg(C) admits 
a cover-monic factorization; and this factorization is stable under pullback. 
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We have thus proved most of 


Theorem 1.3.9 For any cartesian category C, there exists a regular category 
Reg(C) and a full cartesian embedding I: C — Reg(C) such that all objects I(A) 
are projective in Reg(C), and any cartesian functor F:C + D fromC to a reg- 
ular category D extends, uniquely up to canonical isomorphism, to a regular 
functor F: Reg(C) — D. 


Proof We have shown that Reg(C) is a regular category and J is a full embed- 
ding; also J preserves the terminal object by construction, and whilst it does not 
preserve pullbacks ‘on the nose’ as we constructed them in Reg(C), it is easy to 


see that if 
P ——> A, 


Ag ————> A3 


is a pullback in C, then Z(P) is isomorphic to the pullback (P — A; x A2) as 
constructed above in Reg(C) (the isomorphism being represented by the identity 
morphism on P, in both directions). For the projectivity assertion, suppose given 
a diagram 

I(Aj) 


[g] 


(f2: A2 > Bə) eh (f3: A3 — Bs) 


in Reg(C), where [e] is a cover. On replacing [e] by the first half of its cover-monic 
factorization if necessary, we may assume that e is actually a split epimorphism in 
C, with splitting m: A3 — Ag, say. Then mg represents a morphism J(A,) > f2 
in Reg(C), and we have [e][mg] = [g]. 

For the final assertion, we define F( f: A— B) to be the image of 
Ff: FA— FB in D, and if [9]: fı > fo is a morphism of Reg(C) we define 
F(|g]) to be the factorization of the composite 


PA Say eG 


through FA; > Ff, (which exists by the condition in the definition of morphisms 
of Reg(C), plus 1.3.4 and the fact that F preserves kernel-pairs). The fact that 
this definition is independent of the choice of the representative g follows easily 
from the fact that F( fz) — F.Bz is monic. The verification that F is a regular 
functor, and that it is (up to canonical isomorphism) the unique regular functor 
extending F, are easy exercises which we leave to the reader. E 
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Remarks 1.3.10 (a) If C is locally small, then the Yoneda embdding C — 
[C°P, Set] extends to a regular functor Reg(C) — [C°P, Set]. It is easy to see 
that this extension, like the Yoneda embedding itself, is full and faithful; and its 
image is weakly equivalent to the full subcategory of objects which are images 
in [C°P, Set] of morphisms between representable functors. So we could take 
the latter category as an alternative (non-elementary) definition of Reg(C) in 
the case when C is locally small. This also explains why it was reasonable to 
expect the objects in the image of J to be projective, since the representables 
are projective in [C°P, Set] (cf. 1.1.10). 

(b) It is easy to characterize the regular categories C which are equivalent to 
one of the form Reg(D) for some D; they are exactly those regular categories 
which ‘have enough projectives’ (that is, for every object A, there exists a cover 
P —> A with P projective), in which the full subcategory of projective objects 
is closed under finite limits, and in which every object admits a monomorphism 
into a projective. (The first condition does not imply the second; but it does 
imply that the full subcategory P of projective objects is weakly cartesian in the 
sense defined in (e) below, since for any finite diagram in P we may obtain a 
weak limit by taking a projective cover of the limit in C.) 

(c) If C is already regular, then 1.3.9 implies that it is a retract of Reg(C), 
since the identity functor on C extends to a unique regular functor Reg(C) —> C. 
Of course, the latter sends an object f: A — B of Reg(C) to the image of f inC. 
In general, this functor is very far from being an equivalence (cf. (d) below); but 
we note that it is so iff every object of C is projective (equivalently, every cover 
in C is split epic). For if the retraction Reg(C) — C is an equivalence, then its 
inverse J must preserve covers, whence all covers in C are split epic; conversely, 
if this holds, then an arbitrary object f: A— B of Reg(C) is isomorphic to 
I(B’), where B’ — B is the image of f in C. Thus, for example, we have 
Reg(Set,;) ~ Sets, and if we assume the axiom of choice (cf. Section D4.5), 
then Reg(Set) ~ Set. 

(d) The apparent simplicity of the construction of Reg(C) is somewhat decep- 
tive; it can produce a category very much ‘larger’ than the category C from 
which we start. As an example, we note that for any object A of C, the pre- 
order Subpeg(c)(I(A)) is equivalent to the preorder reflection of the slice category 
C/A, i.e. the quotient of the latter by the equivalence relation which identifies all 
parallel pairs of morphisms. For a morphism |g]: (f: B —> C) — I(A) is monic 
iff the kernel-pair of g coincides with that of f, and it is then isomorphic in 
Sub(I(A)) to [g]: g — I(A); conversely, every object of C/A gives rise to a sub- 
object of I(A) in this way, and we have [g] < [h] in Sub(/(A)) iff there exists 
a morphism g — h in C/A. Now if (for example) we take C to be the functor 
category [D, Set] where D is the finite category (e = e) (we may think of the 
objects of C as directed graphs (A = V), where V is the set of vertices of the 
graph and A the set of arrows, and the two functions assign to each arrow its 
source and its target), then it is not hard to see that, for any directed graph G, 
the preorder Subc(G) is equivalent to a small category (in the terminology to be 
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introduced in the next section, C is well-powered). But the preorder reflection 
of C contains a proper class of non-isomorphic objects, and thus Reg(C) is not 
well-powered. To see this, for each ordinal a define Ga to be the graph of the 
strict order-relation on a: that is, the vertices of Ga are the ordinals less than 
a and there is one arrow p — y iff 8 < y. Then it is easy to see that a graph 
morphism Ga — Gg corresponds to a strictly order-preserving function from a 
to 8; in particular, such morphisms exist iff a < 8. So Subgeg(cy(1) contains a 
copy of the ordered class On of all ordinals. 

(e) It is of interest that the construction of Reg(C) works (with only minor 
modifications to the description above), and produces a regular category, even if 
we merely assume that C is ‘weakly cartesian’ (i.e., for every finite diagram in C, 
there exists a ‘weak limit cone’ satisfying the existence clause in the definition 
of a limit, but not the uniqueness). We shall not pursue the matter here; the 
interested reader is referred to [231]. 

(f) It is also possible to combine the ‘regularization’ construction C > 
Reg(C) with the effectivization construction C ++ Eff(C) for regular categories, 
to be described in Section A3.3, so as to yield a ‘single-step’ construction of an 
effective regular category from a (weakly) cartesian category. Again, we shall not 
go into the details here, since it is sufficient to have the two separate construc- 
tions available (but see [221], and also 3.3.13(i) below). We note that there is 
a significant difference between the two constructions: regularization adds new 
images for all the morphisms of C (except for those which factor as a split epi- 
morphism followed by a monomorphism), and so even if C is already regular it 
may be very different from Reg(C) (cf. the example mentioned under (d) above), 
whereas effectivization preserves the coequalizers of all those equivalence rela- 
tions that are already effective, so that if C is effective regular then it is equivalent 
to Eff(C). 


Suggestions for further reading: Barr [68], Borceux [145], Carboni [220], 
Carboni & Celia Magno [221], Carboni & Vitale [231], Freyd & Scedrov [381], 
Pedicchio & Rosicky [944]. 


Al1.4 Coherent categories 


We have seen that the cartesianness of a category C implies the cartesianness of 
the preorders Sub(A), A € ob C; the next level of structure we require is that 
they should be cocartesian as well. Of course, for a preorder to be cocartesian, 
all we need is the existence of finite coproducts (which, as usual, we shall call 
unions in this context). For this, a sufficient condition (for a regular category C) 
is the existence of finite coproducts in C: if C has colimits of a particular kind, 
then so does C/A (and the forgetful functor C/A — C creates them), and if C is 
regular then Sub(A) is reflective in C/A (1.3.1(ii)) and thus inherits any colimits 
which exist in C/A. Explicitly, this says that we may form the union of two 
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subobjects m: A’ > A and m’: A” — A by forming the coproduct A’ + A” and 
then taking the image of the morphism A’ + A” — A induced by m and m’. 

However, as with image factorizations, the mere existence of unions in Sub(A) 
doesn’t get us very far; we want them to be stable under pullback. We define a 
coherent category to be a regular category in which each Sub(A) has finite unions 
and each f*: Sub(B) — Sub(A) preserves them. (Freyd and Scedrov [381] call 
such a category a pre-logos, and Makkai and Reyes [790] call it a logical category; 
our name is based on the fact that these categories have exactly the structure 
needed to interpret coherent logic — cf. D1.2.6.) 

Somewhat surprisingly, the assumption of coherence has implications for the 
existence and properties of colimits in C itself. We begin with the initial object: 
let 1 denote the terminal object of C, and let 0 be (the domain of) the smallest 
subobject of 1, i.e. the initial object of Sub(1). (This notation will be standard 
from now on.) 


Lemma 1.4.1 Jn a coherent category C, 


(i) any morphism with codomain 0 is an isomorphism; 
(ii) 0 is an initial object of C. 


Proof (i) By assumption, if A is any object of C, we have a pullback 


0 
1 
where 04 is (the domain of) the smallest subobject of A. But if A — 1 factors 
through 0 — 1, then the pullback of 0 — 1 along it is the whole of A; so this 
condition implies that A has no proper subobjects. And if A has a morphism to 
0, so does A x A; so the diagonal A — A x A is an isomorphism, or (equivalently) 
the two projections A x A — A are equal, or equivalently again A — 1 is monic. 
But this means that A is a subobject of 1 contained in 0; so A — 0 must be an 
isomorphism. 

(ii) From (i), it follows that the projection 0 x A — 0 is an isomorphism for 
any A, and hence there exists a morphism 0 — 0 x A — A for any A. But any 


two morphisms 0 = A must be equal, since their equalizer is an isomorphism. 
Oo 


04 ———-~> 


A ——> 


In any category, an initial object with the extra property described in 1.4.1(i) 
is called strict; thus Lemma 1.4.1 says that any coherent category has a strict 
initial object. (Note, incidentally, that the deduction of (ii) from (i) in 1.4.1 used 
only the cartesianness of C.) 
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Next, we look at binary unions. We shall frequently require 


Lemma 1.4.2 Let A, A2, A3 be three subobjects of an object A in a coherent 
category. The distributive law 


Ain (A2 U A3) x (A N Az) U (Ay N A3) 
holds in Sub( A). 


Proof The operation A; N (—) may be considered as the composite 


: in 
Sub(A) > Sub(A1) —“> Sub(A) 


where m is the name of the monomorphism A, — A (recall that Jm is just the 
restriction of Em if m is monic). But m* preserves binary unions by assumption, 
and Jm does so because it is left adjoint to m*. o 


The proof of the next result may appear somewhat complicated at first. It 
can be (at least formally) simplified by recasting in terms of categorical logic, 
once we know that coherent categories provide a sound interpretation of coherent 
logic (see D1.3.14). 


Proposition 1.4.3 Let mı: A, A and mz: A2— A be subobjects in a 
coherent category C. Then the square 


nı 
A, Ag >——> A; 


n2 pı 


Peres ee 


is both a pullback and a pushout in C. 


Proof That the square is a pullback is true in any cartesian category 
with unions, since it is obtained by applying the pullback-preserving functor 
Sub(A) — C/A — C to a pullback in Sub(A). The interest of the proposition 
thus lies in the fact that it is a pushout, since the inclusion Sub(A) — C/A does 
not generally preserve colimits. 

To prove the result, let us simplify our notation by writing B,C 
for A; N Ag, A, U Ag respectively, and suppose we are given a pair of 
morphisms fı: A > D, fz: A42 — D with fini = fong. The morphisms 
(pı, fi): A1 > C x D and (p2, f2): Ag ~ C x D are both monic, since p; and 
pz are; let (g, h): E + C x D be their union in Sub(C x D). We aim to show 
that g is an isomorphism, so that hg~!: C — D is a factorization of (fi, f2) 
through (pı, p2) — if so, it is clearly unique, since the equalizer of two distinct 
factorizations would be a proper subobject of C containing both A; and Ao. 
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Let m: C x D> C be the projection. Since 3+: Sub(C x D) — Sub(C) is left 
adjoint to 7*, it preserves unions, so 


4(9,h) = 4e(pi, fi) U dn (po, fe) = pi U p2 = lc; 


but this says exactly that g is a cover. So it suffices to show that g is monic. Let 
(k,l): F — E be two morphisms with gk = gl. Since E is the union of A; and 
Ag, we get two different decompositions of F as a union of subobjects: 


F = k* (A1) U k* (A2) =1°(A1) UP (A2) . 


By applying the distributive law (1.4.2) in Sub(F'), we obtain a decomposition 
of F as the union of the four subobjects k*(A;) N I*(A;) (i,j = 1,2). Our aim 
is to show that the equalizer of hk and hl is the whole of F, by showing that 
it contains each of these four subobjects; this will suffice to prove the result, 
since the pair {g, h) is monic and we must then have k = l. We shall consider 
the subobject k*( A1) M1*( Az) in detail; the other three are similar or easier. To 
simplify the notation, let us write G for k*(A,) NI*(Ag), and let v: G > Aj, 
v2: G — A» be the unique morphisms making the squares 


vı v 


p E AEE Gees A, 

5 ; |- 
k l 

E «e F — E 


commute. Now we have 
piwi = guv = gkw = glw = guzv2 = p202, 
so the pair (v1, v2) factors through the pullback (n1, n2) (say by z: G > B). But 
now hun, = fini = fana = hugne, so we have 
hkw = huv = huns = hugner = hugu = hlw, 
and so the equalizer of hk and hl contains w, as required. E 


Unions with the property that the square in the statement of 1.4.3 is a 
pushout have been called effective unions by Barr [78]. A particular case of 
1.4.3 is of interest: 


Corollary 1.4.4 Let A, and Áz be subobjects of an object A in a coherent 
category which are disjoint (i.e. such that A, N Az S 0). Then A, U Ag is a 
coproduct of A, and A2. 


Proof Pushouts under 0 are the same thing as coproducts. O 
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In a general category, a coproduct A+ B is called disjoint if the coprojections 
A— A+B, B- A+B are monic and their intersection in Sub(A + B) is an 
initial object of the latter. We call a coherent category positive if it has disjoint 
finite coproducts; by 1.4.4, this is equivalent to saying that any two objects 
of the category may be embedded as disjoint subobjects of some third object. 
Incidentally, when working in a category where coproducts are known to be 
disjoint, we shall usually denote them by the disjoint union sign II rather than 
the addition sign +. 

If F:C 4D is a coherent functor between coherent categories (i.e. a regu- 
lar functor which preserves finite unions) and C is positive, then F necessarily 
preserves finite coproducts as well. In general, however, even when a coherent 
category has finite coproducts they need not be disjoint, nor preserved by coher- 
ent functors. Any distributive lattice, considered as a preorder, is a coherent 
category (the distributivity is necessary, by 1.4.2), but it is not positive unless 
it is degenerate (i.e. satisfies 0 = 1). If C is a coherent category which is not 
a preorder (e.g. C = Set), then the inclusion Sub(1) — C/1 = C is a coherent 
functor which does not preserve coproducts. 

The category Set is coherent and positive. If C is coherent, so are any functor 
category [D,C] (unions of subobjects in [D,C] are constructed pointwise; they 
are preserved by pullback because the latter is also computed pointwise) and 
any slice category C/B (we have an isomorphism Subc;p(f) = Sube(dom f), 
which commutes with pullback functors); and positivity is inherited by both 
these constructions. LH (like Sp) has disjoint coproducts which are preserved 
by pullback along an arbitrary morphism; from this, and the remarks at the 
beginning of this section, it follows that LH/X is a positive coherent category 
for any X. However, categories which are monadic over Set, like Gp and Ab, 
tend not to be coherent; the two examples just cited have neither strict initial 
objects nor distributive subobject lattices. (Ab does have effective unions — as 
does any abelian category, for reasons entirely different from those adduced in 
the proof of 1.4.3 - but Gp does not: consider any two nontrivial subgroups of 
the nonabelian group of order 6.) 

If C is a positive coherent category and f: A — |]; B: is a morphism of C 
whose codomain is an m-fold coproduct, then the subobjects A; = f*(Bi) (1 < 
i < m) of A form an m-fold decomposition of A; that is, they are pairwise dis- 
joint, and their union is the whole of A. (We allow the possibility that some of the 
A; may be 0.) Using this idea, we may describe a (non-elementary) construction 
for ‘positivizing’ a given coherent category by freely adjoining disjoint coprod- 
ucts, as follows. Given C, we define Pos(C) to be a category whose objects are 
nonempty finite sequences (Aj, A2,...,An) of objects of C (we could allow the 
empty sequence as well, but since C already has a strict initial object we don’t 
need to). Here (Ai,...,Ay,) is to be thought of as the coproduct of the A; in 
Pos(C), so we need only describe those morphisms of Pos(C) whose domains are 
sequences of length 1, say f: (A) > (Bi,..., Bm); such an f is specified by an 
m-fold decomposition (A,,..., Am) of A and a family of morphisms f;: A; > B; 
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(1 <i < m) in C. Composition of morphisms in Pos(C) is defined in the obvious 
(but messy) way; we may regard C, up to isomorphism, as a full subcategory of 
Pos(C), whose objects are the sequences of length 1. (Note also that if a finite 
family of objects (Ai,...,Am) happens to have a disjoint coproduct in C, then 
this sequence is (canonically) isomorphic in Pos(C) to the singleton sequence 


(Lis 4:).) 


Proposition 1.4.5 Pos(C) is a positive coherent category. Moreover, the 
embedding C — Pos(C) is a coherent functor, and it is universal (in the ‘up-to- 
unique-isomorphism’ sense) among coherent functors from C to positive coherent 
categories. 


Proof This is all straightforward but tedious verification, and we omit most 
of it. The product of two objects (Ai,..., An) and (Bi,...,Bm) is the sequence 
of length mn whose entries are the products A; x B}; their coproduct is 
the sequence of length m + n obtained by concatenating them. The image 
of a morphism f: (Aj,...,An) — (Bi,...,Bm) may be written in the form 
(Ci,...,Cm) — (Bi,...,Bm), where C; € Sub(B;) is the union, for 1 <i < n, 
of the images of the morphisms from appropriate subobjects of A, to B}; in 
particular, we note that any subobject of (Bi,...,Bm) is isomorphic to one of 
the form (C1,...,Cm) where each C; € Sub(B;). (We also obtain a criterion for 
a morphism to be a cover, which we may use to verify that covers are stable 
under pullback in Pos(C).) If F: C — D is a coherent functor and PD is positive, 
we define an extension F: Pos(C) > D of F by setting 


n 


F(Ay,...,An) = [[ F4) 


i=l 


(and verify that this is a functor); but since any coherent functor Pos(C) — D 
extending F must preserve finite coproducts, it is canonically naturally isomor- 
phic to F. E 


Example 1.4.6 Let L be a distributive lattice, regarded as a coherent cat- 
egory. We may describe Pos(Z) explicitly as follows: its objects are finite 
sequences (li,...,in) of elements of L, and a morphism (ki,...,km) — 
(li;..- ln) may be represented as an n x m matrix (f:j) of elements of L, with 
the property that the jth column is an n-fold decomposition of k; for each j, and 
the entries in the ith row are all below l;, for each 7. Composition of morphisms 
in this case is simply ‘matrix multiplication’, with addition and multiplication 
of matrix entries interpreted as join and meet of elements of L. We shall meet 
(close relatives of) this category again in 2.6.4(e) and 3.2.2(b); for the present, let 
us note that in the particular case when L is the two-element lattice 2 = {0,1}, 
we obtain a category equivalent to the category Set, of finite sets and func- 
tions between them (since a matrix satisfying the conditions above defines a 
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function from {j | k; = 1} to {i | l; = 1}, sending j to the unique i such that 
fiz = 1). 


In, the particular case just described, Pos(Z) is effective as a regular cate- 
gory as well as positive as a coherent category; but in general the positivization 
process can destroy effectiveness. Any lattice L is effective as a regular category 
(vacuously so, since reflexivity forces each equivalence relation in L to be a pair 
of identity maps), but Pos(Z) is not effective unless Z is a Boolean algebra. If l is 
any non-complemented element of L, then the two morphisms (1,/,/,1) — (1,1) 
in Pos(L) defined by the matrices 


1 2 0 0 
0021 
102 0 
0? 01 


may be shown to define an equivalence relation which is not effective. Fortunately, 
however, the effectivization process which we shall describe in 3.3.10 does not 
destroy coherence or positivity. 

From the fact that Pos(2) ~ Sets, we may deduce a result which will be 
useful later. 


and 


Corollary 1.4.7 For any positive coherent category C, there is (up to isomor- 
phism) a unique coherent functor A: Sets — C. 


Proof There is clearly a unique coherent functor 2 — C, since such functors 
must preserve 0 and 1; so this is immediate from 1.4.5. oO 


The functor A is easy to describe explicitly: it sends an n-element set to 
the n-fold copower of 1 in C. It is worth noting that it is not merely coherent 
but cocartesian (that is, it preserves arbitrary coequalizers and not just those 
of equivalence relations); this is because the equivalence relation generated by a 
parallel pair in Set; may be constructed from the latter using finite limits, finite 
coproducts and image factorizations, all of which are preserved by A. 

A coherent category which is both positive and effective is called a pretopos. 
We note that a pretopos is ‘almost cocartesian’: it has finite coproducts and 
coequalizers of equivalence relations (and any coherent functor defined on it 
preserves these), but it may fail to have arbitrary coequalizers (and even if it 
does, a coherent functor defined on it may fail to preserve them — see D3.3.9 for 
a counterexample, but cf. also 1.4.19 below). A pretopos also has an important 
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class of pushouts: 


Lemma 1.4.8 In a pretopos, any pair of monomorphisms with common 
domain has a pushout. Moreover, if 


Me ae 
| 
k 
C ———— P 


is a pushout square with f and g monic, then h and k are monic, and the square 
is also a pullback. 


Proof Given f and g, it suffices to find an object D containing B and C 
as subobjects in such a way that their intersection is A; for then we can define 
P = BUC and appeal to Proposition 1.4.3. Let Do = BUC, and let u: B —> Do, 
v: C — Do be the coprojections; define 


(where A, and A» are both copies of A, which we have tagged for future 
reference), and let a,b: R — Do be the morphisms whose components are respec- 
tively (u,uf,vg,v) and (u,vg,uf,v). It is straightforward to verify that (a,b) is 


an equivalence relation; let q: Do — D be its coequalizer. Then (a,b) is the 
kernel-pair of g, and hence we may construct a diagram 


(1, f) 


As É 


A; WC > R — Dp 


(9,1) b q 
E EEE 5 ED y. 


in which all squares are pullbacks (and the unnamed maps are coprojections). 
So we need only verify that qu and qv are monic; but we have a similar diagram 
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of pullbacks 
B >——> BU aay B 
Bil A >——> R oe Do 
(1, f) b 
B >———> Do rS D 
which establishes that qu is monic, and similarly for qv. O 


Note that the object D constructed in the proof of 1.4.8, being covered by 
BIC, is actually the union of its subobjects gu: B — D and qu: C — D; thus 
it is itself the pushout P which we required. 


Corollary 1.4.9 In a pretopos, every monomorphism is regular (i.e. occurs as 
an equalizer) and every epimorphism is regular. In particular, a pretopos is a 
balanced category. 


Proof If f: A— B is monic, then by 1.4.8 we have a pullback square of the 


form 
f 
A>——> B 
B eee C 
from which it follows that f is an equalizer of g and h. The third assertion 
follows because an epic equalizer is necessarily an isomorphism; now if f is any 


epimorphism in C, then im f is both epic and monic, so it is an isomorphism 
and f is a cover, hence regular epic by 1.3.4. O 


A subobject A’ — A in a coherent category is said to be complemented if 
there exists A” »+ A such that A’M A” = 0 and A’U A” & A. By 1.4.2, the 
complement A” is unique up to isomorphism if it exists. We say a coherent cat- 
egory is Boolean if every subobject in it has a complement; more precisely, if 
we are given an operation ~ assigning to each monomorphism A’ — A a com- 
plement =A’ — A for it in Sub(A). Note that ~ is automatically a functor 
Sub(A)°P — Sub(A) (i.e. if A’ < A” then =A” < nA’), and its square ~~ is iso- 
morphic to the identity. It is clear that pullback functors f*: Sub(B) — Sub(A) 
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preserve complements (i.e. they commute up to isomorphism with ~), from which 
we deduce 


Lemma 1.4.10 Let f: A— B be a morphism in a Boolean coherent category. 
Then the functor f*: Sub(B) — Sub(A) has a right adjoint V;, as well as its left 
adjoint As. 

Proof Define Vy = 74,-: by the remarks above, this is right adjoint to ~f*-, 
which is isomorphic to f*. 0 


A coherent category C satisfying the conclusion of Lemma 1.4.10 (for every 
morphism f of C) is called a Heyting category (the Freyd-Scedrov name is ‘logos’; 
we have chosen a name which emphasizes the fact that these categories are gen- 
eralizations of Heyting algebras — see 1.5.11 below). A Heyting functor between 
Heyting categories is a coherent functor which commutes up to isomorphism with 
the right adjoints Yp. It follows easily from the proof of 1.4.10 that a coherent 
functor whose domain is Boolean must be a Heyting functor (even if its range 
is not Boolean). Further examples of Heyting functors are the pullback functors 
f*:C/B —> C/A, where f: A — B is a morphism in a Heyting category C; that is, 


Lemma 1.4.11 Let 
E EEN A 


P 
kf 
B eee C 
be a pullback square in a Heyting category C. Then the square 
Sub( A) Ms Sub(P) 
Ve Vee 


* 


Sub(C) -> Sub(B) 
commutes up to isomorphism. 


Proof The corresponding diagram of left adjoints 
= 
Sub(A) <— Sub(P) 
f* k* 


dg 
Sub(C) <— Sub(B) 
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commutes, by the definition of the functors 4(_) and of a regular category. So 
the result follows from uniqueness of adjoints. o 


The commutativity of the square in the statement of 1.4.11 (and of the 
corresponding square for existential quantification, appearing in the proof) is 
sometimes referred to as the Beck-Chevalley condition. We shall meet this 
condition again, in different contexts, in 2.2.4 and 4.1.16 (and yet again in 
B1.4.2). 

The category Set is Boolean (and ‘complement’ means what it usually does). 
Booleanness is stable under slicing (and under the positivization process of 1.4.5), 
but not in general under passage to functor categories; in fact we have 


Lemma 1.4.12 For a locally small category C, the functor category [C, Set] is 
Boolean iff C is a groupoid. 


Proof Suppose C is a groupoid; let F': C — Set be a functor, and F’ — 
F a subfunctor. We may suppose without loss of generality that the values 
F'(A), A € ob C, of F’ are actual subsets of the corresponding F(A). Define 


F"(A) = {z € F(A) | z g F'(A)}; 


then F” is also a subfunctor of F', since if f: A— B in C and z € F”(A) then 
we cannot have F(f)(z) € F'(B), for else r = F(f~')F(f)(x) would be in 
F'(A). And F” is clearly a complement for F”, since unions and intersections of 
subobjects in [C, Set] are computed pointwise. 

Conversely, suppose [C,Set] is Boolean. The functor F = C (A, —) has no 
complemented subobjects other than itself and 0, for if F = F’U F” then one of 
F'(A) and F”(A) must contain 14, which forces the corresponding one of F” and 
F” to be the whole of F. Thus if f: A — B is any morphism of C, the subfunctor 
Fy of F defined by 


F,(C) = {g: A — C | g factors through f} 


must-be the whole of F (since it is clearly nonzero); so 14 € F(A), ie. f is a 
split monomorphism. But a category in which every morphism is split monic is 
a groupoid. E 


Subobjects of X in LH correspond (up to isomorphism) to open subsets of X; 
so if LH/X is Boolean then all its open subsets must be closed. If X satisfies the 
To separation axiom, this forces it to be discrete; but for a discrete space X, we 
have an isomorphism LH/X = Set/|X| (where |X] is, as usual, the underlying 
set of X), and so LH/X is Boolean in this case. 

Despite the foregoing results, [C, Set] is a Heyting category for any small 
category C, and LH/X is a Heyting category for any space X. Both assertions 
will follow from 1.4.18 below (and the former will also follow from two results to 
be proved in the next section, namely 1.5.5 and 1.5.13). 
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In a coherent category, we define the negation (or pseudo-complement) of a 
subobject A’ — A to be the largest subobject of A disjoint from A’, if this exists. 
We shall denote it by ~A’ — A; there is no harm in re-using our notation for com- 
plements, since it is clear that a complement (if it exists) is a pseudo-complement, 
and conversely a pseudo-complemented subobject A’ + A is complemented iff 
A UnA S A. In a Heyting category, negations always exist: indeed, we have a 
more general construction. 


Lemma 1.4.13 Let A; — A and Az — A be subobjects in a Heyting category. 
Then there exists a largest subobject Az — A such that A3N A, < Az. Moreover, 
if we denote this subobject by (Ai = A2) — A, then the binary operation on 
subobjects thus defined is stable under pullback. 


Proof We define (A; = A2) — A to be Vm(A1 N Az — Aj), where m is 
the name of the monomorphism A, — A. It is an immediate consequence of 
the adjunction (m* 4 Ym) that this subobject has the required property; the 
stability under pullback follows from 1.4.11. O 


The stability of > (when it exists) under pullback is no surprise, since this 
assertion is equivalent to the Frobenius reciprocity condition established for reg- 
ular categories in 1.3.3 — see 1.5.8 below. In the terminology to be introduced 
in the next section, 1.4.13 says that subobject lattices in Heyting categories are 
Heyting algebras. Of course, we obtain the negation ~A’ as (A’=> 0). For future 
reference, we note 


Lemma 1.4.14 A Heyting category C is Boolean iff ~=: Sub( A) — Sub(A) is 
(isomorphic to) the identity, for every object A of C. 


Proof The necessity of the condition has already been mentioned. For the 
sufficiency, observe that we always have 


~(A' U 7A’) = 0, 


because a subobject A” disjoint from A’ U ~A’ is in particular disjoint from A’, 
and hence contained in ~A’. So 


a7(A’ UnA’) S A, 
from which the result follows. 0 


An object A of a coherent category is said to be decidable if the diagonal map 
Aa: A> Ax Ais a complemented subobject. Of course, in a Boolean coherent 
category every object is decidable; but the converse is false. In a distributive 
lattice considered as a coherent category, every object is decidable since diago- 
nal maps are isomorphisms; but we can also give positive counterexamples using 
1.4.15 and 1.4.16 below. (However, in a topos (and, more generally, in a coher- 
ent category with a subobject classifier, as defined in Section A1.6 below), the 
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decidability of every object is equivalent to Booleanness. This is because, if A is 
decidable, then every morphism 1 — A is a complemented subobject (since it is 
split monic, and so can be expressed as an equalizer of 14 and A — 1 — A, or 
equivalently as a pullback of A,), and so decidability of the subobject classifier 
Q forces the generic subobject T: 1 — Q to be complemented.) 

We note that the objects 0 and 1 are always decidable (their diagonal maps 
are isomorphisms); any subobject of a decidable object is decidable (since, if 
f: A~ B is monic, then A, is the pullback of Ag along f x f); a disjoint 
coproduct A II B of decidable objects is decidable (since we have 


(AU B) x (AIL B) & (Ax A) (A x B) (B x A)U(B x B) 


by stability of unions under pullback, and the complement of the diagonal A Aus 
is then the union of the second and third summands above with the complements 
of A4 and Ag); and a product A x B of decidable objects is decidable (since its 
diagonal is the intersection of the pullbacks of A4 and Ag along the appropriate 
product projections, and a finite intersection of complemented subobjects is 
complemented). Thus we have proved 


Proposition 1.4.15 IfC is a coherent category, then the full subcategory Ca 
of decidable objects of C is also coherent, and positive if C is; and the inclusion 
Ca > C is a coherent functor. QO 


Examples 1.4.16 (a) In a functor category [C,Set], an object F is decidable 
iff the assignment 
Ar {(t,y)€ FAx FA| 2 #y} 


defines a subfunctor of F x F, iff F(f) is injective for every morphism f of C. 
Using this, it is easy to give examples of categories C for which [C, Set]4 is not 
Boolean. 


(b) In the category LH/X, an object p: E — X is decidable iff the diagonal 
embedding E — E xy E is closed as well as open, since subobjects in LH 
correspond to open subspaces. If X is a Hausdorff space (ie. if the diagonal 
X = X x X is closed), this condition is equivalent to saying that E is also 
Hausdorff, since the canonical inclusion E xx E — E x E is a pullback of 
X = X x X and hence closed. 


We remark that, although decidability is preserved by coherent functors, it is 
not reflected by conservative coherent functors - as may be seen by considering 
the forgetful functor |C, Set] + Set?” © for suitable C. 

Although, in this chapter, our main concern is with the elementary (and 
hence finitary) structure of categories, it is worth digressing briefly, before we 
end this section, to consider infinite unions and intersections. 

In the context of a particular model of set theory, we shall say that a 
category C is well-powered if each of the preorders Sub(A), A € ob C, is equiv- 
alent to a small category. In passing, we remark that category-theorists often 
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refer to the dual condition (i.e. the well-poweredness of C°P) by saying ‘C is 
cowell-powered’; the rules of syntax clearly demand that the term should be 
well-copowered, and that is what we shall use in this book. 


Remark 1.4.17 The property of being well-powered is closely related to local 
smallness. First, a well-powered category with finite products is locally small, 
since if two morphisms A = B are not equal then their graphs are non-isomorphic 
subobjects of A x B (cf. 2.2.3 below). Conversely, a locally small category with 
finite intersections of subobjects and a generating set of objects is well-powered, 
since a subobject of a given object A is determined (up to isomorphism) by 
the set of morphisms from members of the generating set to A which factor 
through it. And any locally small category with a subobject classifier as defined 
in Section A1.6 below (in particular, any locally small topos) is well-powered, 
even if it does not have a generating set. 


We note that the particular category Reg(C) which we considered in 1.3.10(d) 
is locally small but not well-powered; hence it does not have a generating set. On 
the other hand, a ‘large group’ (that is, a groupoid with one object but a proper 
class of morphisms) is well-powered (since any two subobjects are isomorphic) 
but not locally small. 

When talking about well-powered categories, we generally interpret ‘subob- 
ject’ as meaning ‘isomorphism class of monomorphisms’, so that Sub(A) becomes 
a (small) partial order rather than a preorder. Of course, such an ordered set 
has arbitrary (set-indexed) unions iff it has arbitrary intersections; the latter are 
automatically preserved by pullback functors (if C has pullbacks) since they have 
left adjoints, but the former may not be. In fact we have 


Lemma 1.4.18 Let C be a well-powered regular category whose subobject lat- 
tices have arbitrary (unions and) intersections. Then C is a Heyting category iff 
arbitrary unions of subobjects are stable under pullback in C. 


Proof  f*: Sub(B) — Sub(A) preserves arbitrary unions iff it has a right 
adjoint, by the poset version of the adjoint functor theorem. O 


A category satisfying the conditions of Lemma 1.4.18 will be called a 
geometric category. As hinted above, both [C, Set] (for any small C) and LH/X 
(for any space X) are geometric categories; in the first case this follows from 
the fact that both unions and pullbacks are computed pointwise (and Set is a 
geometric category), and in the second from the fact that subobjects correspond 
to open subsets. 

We shall say that a geometric category C is oo-posttive if every set-indexed 
family of objects of C has a disjoint coproduct (equivalently, can be embedded as 
a pairwise-disjoint family of subobjects of a single object). The category Sets is 
an example of a geometric category (since its subobject lattices are finite) which 
is positive but not oo-positive. Of course, there is an infinitary analogue of the 
Positivization process of 1.4.5, in which we take the objects of Pos,.(C) to be 
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arbitrary set-indexed families of objects of C, rather than finite families. And we 
say C is an oo-pretopos if it is an oo-positive geometric category which is effective 
as a regular category. For future reference, we note 


Lemma 1.4.19 An co-pretopos is cocomplete. 


Proof Since an oo-pretopos C has set-indexed coproducts by definition, it 
suffices to show that it has coequalizers. But by 1.3.4 we know that equivalence 
relations in C have coequalizers; so it suffices to show that, for any parallel 
pair f,g: B = A in C, there is an equivalence relation a,b: R = A such that a 
morphism k: A — C satisfies hf = hg iff ha = hb. 

First we form the image factorization 


q (c, d) 


B ———— I >——> AxA 


of (f, g); since q is epic, we have hf = hg iff hc = hd. Next we replace I by its 
reflexive and symmetric closure J = I U AU T° in Sub(A x A); using the fact 
that unions are pushouts (1.4.3), it is clear that any morphism which coequalizes 
c and d will also coequalize the morphisms 7, d: J = A. Then we construct the 
transitive closure R of J, as the union of its finite powers (each of which is 
constructed as the image of a map to A x A from an appropriate pullback); yet 
again, a morphism which coequalizes Z and d will coequalize the morphisms from 
each of these finite powers to A, and hence (by the infinitary analogue of 1.4.3, 
which is proved in the same way as the original) it will coequalize the projections 
a,b: R= A. But R is an equivalence relation; so this is sufficient. 


We shall also occasionally wish to consider regular categories which have 
pullback-stable (finite or) countable unions of subobjects, but not necessar- 
ily unions of larger cardinality. We shall call these o-coherent categories, or 
o-pretoposes if they are effective and o-positive (i.e. have countable disjoint 
coproducts). We note that the proof of 1.4.19 uses only countable unions, and 
so shows that a o-pretopos has all countable colimits. 


Suggestions for further reading: Barr [78], Börger [169], Freyd & 
Scedrov [381], Ghilardi & Zawadowski [403], Makkai & Reyes [790]. 


A1.5 Cartesian closed categories 


The next level of ‘set-like’ categorical structure which we study is something of 
a new departure: it is concerned not with the preorders Sub(A), A € ob C, but 
with the whole category C. To put the same thing another way: the classes of 
categories studied in the last three sections correspond (in a sense made precise 
in Chapter D1) to certain fragments of first-order logic, but cartesian closed 
categories correspond to the typed lambda-calculus (see Section D4.2), which 
belongs to the realm of higher-order logic. 
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Let C be a category with finite products. For a fixed object A of C, we can 
regard the assignment B ++ Bx A as the object-map of a functor (—) x A: C => C. 
We say that A is ezponentiable if this functor has a right adjoint (generally 
denoted (~)4, though we may use other notations such as [A,—] or (A => —) 
from time to time). In elementary terms, this adjunction is best described in 
terms of its counit: A is exponentiable if we are given an operation assigning 
to each object B of C an object B4 and a morphism ev: BÂ x A —> B (called 
‘evaluation’) such that for any h: C x A —> B there is a unique h: C —> B4 for 
which 


hx 
CGH! BAA 


B 


commutes. We call h the exponential transpose of h. 
Let Cex, denote the full subcategory of exponentiable objects of C. 


Lemma 1.5.1 
(i) Cexp has finite products, and the inclusion Cexp > C preserves them. 


(ii) The assignment (A,B) + BA defines a bifunctor (Cexp)°? x C > C. 
(iii) If C is Cauchy-complete, then so is Cexp- 


Proof (i) The terminal object 1 is exponentiable, because (—) x 1 is isomorphic 
to the identity functor. If A and B are exponentiable, then the object (C?)A, 
together with the composite 


1 
(C8)AxAxBO*s 


OP xB = SC. 

has the universal property of an exponential C(4*); so A x B is exponentiable. 
(ii) Given f: A’ + A and g: B — B’, we define gf: B4 — B’ to be the 

exponential transpose of 


A noe Sey 1A ev t 
B* x A ———> B” x A ——~—> B'; 


it is straightforward to verify that this works. 

(iii) Suppose A is exponentiable, and let e: A — A be an idempotent endo- 
morphism. Then we know e splits in A as A + B — A; we have to show that 
B is also exponentiable. But, for any C, we have an idempotent C°: C4 — CA 
by part (ii); and it is straightforward to verify that the image of this idempotent 
has the universal property of an exponential C®?, o 
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In Gp and Ab, the only exponentiable object is the terminal object. In Sp, 
any locally compact space is exponentiable (and the converse is true, modulo the 
need to take some care in defining ‘locally compact’), the exponential Y* being 
the set of continuous maps X — Y equipped with the compact-open topology. In 
Set, and in the category Cat of small categories, every object is exponentiable: 
in Set we take the exponential B4 to be the set of functions A — B, and in 
Cat we take D? to be the functor category |C, D]. It is the latter situation we 
wish to study in this section. 

We say a category C is cartesian closed if it has finite products and all its 
objects are exponentiable. This terminology is very well established, and we do 
not feel able to depart from it; but the reader must be warned that it produces a 
slight awkwardness, in that a cartesian closed category need not be cartesian in 
the sense of having all finite limits. (The primary justification for this is provided 
by the connection with the typed lambda-calculus, to which we have already 
alluded; the cartesian closed categories arising from models of the latter do not 
in general have limits other than finite products. However, another justification 
may be found in the fact that, even when C is cartesian, the subcategory Cexp 
is not in general closed under equalizers, in contrast to 1.5.1(i) and (iii) - Sp 
provides a counterexample. See also 1.5.4 below.) Nevertheless, most of the 
cartesian closed categories we shall meet will have equalizers as well as finite 
products; we shall indicate this (with a note of mild reproof in our voice) by 
calling them properly cartesian closed. 

If F:C > D is a finite-product-preserving functor between cartesian closed 
categories, then we have morphisms 64,2: F(B4) > F(B)*™) for objects A, B 
of C, which form a natural transformation between bifunctors C°? x C —> D 
(ef. 1.5.1(ii)): explicitly, 64,5 is the transpose of 


Fi(ev) 
F(B“) x F(A) = F(B4 x A) ——> F(B). 

We say F is a cartesian closed functor if 04, g is an isomorphism for all A, B € 
ob C (with the same convention as before about ‘properly cartesian closed’). 

Recall that if B is an object of a cartesian category C, we use B also as the 
name of the unique morphism B — 1, so that B*:C — C/B denotes the right 
adjoint of the forgetful functor £g:C/B— C. (Explicitly, B*(A) is the product 
projection A x B —> B.) 


Lemma 1.5.2 Let C be a cartesian category. 


(i) An object B of C is exponentiable iff B* has a right adjoint IIg: C/B —> C. 


(ii) If A is erponentiable in C, then B*(A) is exponentiable in C/B for any B. 
Moreover, B* preserves any exponentials which erist in C. 


Proof (i) (—) x B is the composite 


B* “Bp 
pe ME Fs I 
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so if II, exists we may define (—)® as the composite TI go B*. Conversely, suppose 
B is exponentiable. Given f: A — B, form the pullback 


Tle (f) ——> A? 


fP 


I —_-» BB 

where q is the (isomorphic) projection 1 x B — B. Now, for any object C 
of C, morphisms C — IIg(f) correspond to morphisms k: C — A® such that 
f®h = QC, and hence to morphisms h: C x B > A such that fh is the product 
projection C x B — B, i.e. to morphisms B* (C) — f in C/B. It is straightforward 
to verify that this correspondence is natural in C and f. 

(ii) Given an object f: C — B of C/B, define f®°) to be the left vertical 
morphism in the pullback square 


P ——> C^ 


fA 
p A 
B ———> B 
where p is the projection B x A — B. If g: D > B is another object of C/B, 
then the product g x B*(A) in C/B is the diagonal of the pullback square 


gxl 
D x A ——> BxA , 


p 


D ———> B 


from which it follows easily that morphisms h: g x B*(A) > f in C/B corre- 
spond to morphisms h: D > C4 in C such that f4h = pg, i.e. to morphisms 
g — fP") in C/B. Once again, this correspondence is easily seen to be natural. 
Finally, if f is itself of the form B*(E), we see that the pullback P above may 
be rewritten in the form 


1x Dp 
E^ x BS E^ x BA 


i 


B—~———> B^ 
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(where the vertical arrows are product projections), because (—)“, being a right 
adjoint, preserves products. 


Corollary 1.5.3 A cartesian category C is locally cartesian closed (i.e. each 
slice category C/B is cartesian closed) iff f*: C/B — C/A has a right adjoint I, 
for each morphism f: A— B of C. Moreover, if C is locally cartesian closed, 
then the functors f* are cartesian closed. 


Proof Recall the isomorphism (C/B)/f =C/A. o 


Unlike the other conditions we have considered so far (and despite the result 
of 1.5.2(ii)), cartesian closedness is not stable under slicing, so ‘locally cartesian 
closed’ is a stronger condition than ‘cartesian closed’. (By convention, a locally 
cartesian closed category is assumed to have a terminal object, so that it is in 
particular cartesian closed — and indeed properly cartesian closed, since all its 
slice categories are required to have finite products, i.e. the category itself is 
required to have pullbacks.) 

For example, Cat is not locally cartesian closed; this may be seen, given 
the result of 1.5.3, by observing that pullback functors in Cat do not in general 
preserve coequalizers. (Indeed, Cat is not even regular: there is a regular epi- 
morphism 2 + 2 — 3 in Cat, where n denotes an n-element totally ordered set, 
whose pullback along a suitable morphism 2 — 3 is not epic.) J. Giraud [406], 
and independently F. Conduché [247], determined the exponentiable objects in 
Cat/C for an arbitrary C; they are functors F:D—C such that, given a fac- 
torization F(f) = gh of a morphism in the image of F, it can be lifted to a 
factorization f = g'h' with F(g’) = g and F(h’) = h, ‘uniquely up to morphisms 
which are sent by F to identities’. 


Remark 1.5.4 Despite the foregoing, it follows easily from 1.5.2(ii) that, if 
C is cartesian closed and B is any object of C, then the full subcategory Cg of 
C/B whose objects are those of the form B* A, A € ob C, is cartesian closed, 
though not in general properly so (it is closed under products in C/B because 
B* preserves them, but not in general under equalizers). Equivalently, Cg may 
be regarded as the Kleisli category of the comonad on C whose functor part is 
(—)x B, with counit and comultiplication given by the projection nı: A x B> A 
and 1x A: A x B — Ax Bx B respectively; of course, the Kilenberg-Moore 
category of this comonad (i.e., the category of coalgebras) is (isomorphic to) 
C/B itself. In 4.2.1 below we shall investigate when the category of coalgebras 
for a comonad on C inherits (local) cartesian closedness from C; and we shall 
meet the Kleisli category Cg again in D4.2.10. 


The category Set is locally cartesian closed; this may be seen using the 
equivalence Set/B ~ Set” of 1.1.6, plus the (obvious) fact that a product of 
cartesian closed categories is cartesian closed. More generally, 


Proposition 1.5.5 For any small category C, the functor category [C, Set] is 
locally cartesian closed. 
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Proof Since, by 1.1.7, any slice of [C, Set] is equivalent to another cate- 
gory of the same type, it suffices to prove that [C, Set] is cartesian closed. 
In fact the definition of exponentials in this category is forced upon us by 
the Yoneda lemma: if the exponential G? exists, then elements of G’(A) 
must correspond bijectively to morphisms C (A, —) — G7, and hence to mor- 
phisms C (A, —) x F — G in [C, Set]. But [C, Set] is locally small, so we may 
take the set of all such morphisms as the definition of G*(A) (and make it 
into a functor C — Set: if f: A— B in C, GF (f) is the operation ‘compose 
with C (f,—) x 1:C(B,—) x F ~C(A,—) x F”). We define the evaluation map 
ev: GF x F > G by eva(¢,z) = ġa(la, x£), where ġ:C(A,—) x F +G and 
x € F(A). And, given a morphism 9: H x F — G, we define its exponential 
transpose 6: H > GF by 


(84(z))a(f,2) = On(H(Ff)(z), 2) 


for z € H(A), f: A — Band z € F(B),. The rest of the proof is straightforward 
verification. O 


An alternative, less explicit, proof of 1.5.5 may be given by observing that 
IC, Set] (and its slice categories) have all small colimits, and that pullback func- 
tors preserve them (because they do so in Set, and both pullbacks and colimits 
in [C, Set] are constructed pointwise), and then applying the Adjoint Functor 
Theorem to obtain the right adjoints needed for 1.5.3. However, the explicit 
description of exponentials in (C, Set] has many uses: for example, 


Scholium 1.5.6 
(i) Let C be a small category with finite products. Then the Yoneda embedding 
C — {C°?, Set] preserves any exponentials which exist in C. In particular, 
if C is cartesian closed, then the Yoneda embedding 1s a cartesian closed 
functor. 


(ii) For any small category C, the functor A: Set — [C, Set] which sends a set 
A to the constant functor with value A is cartesian closed. 


Proof (i) Let us write Y for the Yoneda embedding; recall that it sends an 
object A to the functor C (—, A). Now if A,B,C are three objects of C, then 
elements of Y(B)* (4)(C) are by definition morphisms Y (C) x Y(A) — Y (B) in 
(CoP, Set]; but these correspond bijectively to morphisms C x A > B in C (since 
Y is full and faithful, and preserves products), and hence (if the exponential B4 
exists in C) to morphisms C — B^ - that is, to elements of Y(B4)(C). It is easily 
verified that these bijections define a natural isomorphism Y (B4) = Y(B)¥ A), 
and that this isomorphism is the canonical morphism ĝ4.g defined earlier. 

(ii) Let S and T be two sets, and A an object of C. Then 1.5.5 tells us 
that ATAS(A) is the set of natural transformations a: C (A, —) x AS — AT. 
Given such a transformation, we get a mapping ag:.5—T defined by 
afs) = a alla, s). But then, by naturality, for any f: A— B we have 
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ap(f,s) = AT(f)(aa(1a,5)) = @o(s); so the entire natural transformation is 
determined by ag. Thus we have a bijection from the exponential T° in Set to 
AT‘5(A), for each A; it is straightforward to check that these bijections them- 
selves form a natural isomorphism A(7'°) > ATS, and that this isomorphism 
is the canonical map 05,7. J 


In particular, 1.5.6(i) shows that any small cartesian closed category admits a 
full and faithful cartesian closed functor to a properly cartesian closed category. 

The category LH/X is also cartesian closed for any space X (and hence 
locally cartesian closed); the proof of this fact is deferred until 4.2.4(e). 

Exponentiability is closely related to the notion of partial product, which was 
originally introduced in the category Sp by B.A. Pasynkov [938] and extended 
to more general categories by R. Dyckhoff and W. Tholen [321]. In a carte- 
sian category C, a partial product of an object A and a morphism f: E — B 
is defined to be an object P(A, f) equipped with morphisms p: P(A, f) > B 
and à: P(A, f) xg E — A, which is universal among such; i.e., given any object 
C equipped with morphisms C — B and C xg E — A, there exists a unique 
morphism C — P(A, f) commuting (in the obvious sense) with the structure 
morphisms. We observe that the notion of partial product includes as special 
cases that of an ordinary categorical product A x B (obtained by taking f = 1g) 
and that of an exponential A” (obtained by taking B = 1). 

The fundamental existence theorem for partial products, due to Dyckhoff 
and Tholen, is as follows. 


Proposition 1.5.7 Let f: E— B be a morphism in a cartesian category C. 
Then the partial product P(A, f) exists for all objects A of C iff f is exponentiable 
as an object of C/B. 


Proof First suppose f is exponentiable. Then we define p: P(A, f) — B to 
be the exponential (B*A)/ in C/B, and à: P(a, f) xg E = Eg(p x f) 3 A to 
be the transpose of the evaluation map p x f — B*A. It is easy to verify that 
this has the right universal property: for, given an object C equipped with 
maps r: C + B and l: C xg E — A, we can regard the latter as a morphism 
La(r x f) — A, and transpose it across the adjunctions to obtain the required 
morphism r — p. 

Conversely, if P(A, f) exists for all A, then it is easily seen that P(—, f) 
is (covariantly) functorial: given h: A — A’, we get a morphism P(A, f) > 
P(A’, f) induced by p: P(A, f)— B and hà: P(A, f) xp E — A — A’. So, 
given any g: A + B, we may define gf to be the left vertical morphism in the 
pullback square 

C ——> P(A, f) 


P(g, f) 


B ais P(B, f) 
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where f corresponds to the pair (lg: B + B,f: B xg E = E — B). Then it is 
easily verified that, for any h: D — B, morphisms D — C over B correspond to 
morphisms k: D — P(A, f) satisfying P(g, f)k = fh, and these in turn corre- 
spond to morphisms D xp E —> A over B. O 


We shall see an application of partial products in 2.5.17 below. And in 
Section B4.4 we shall investigate a 2-categorical generalization of the notion 
of partial product, and show that significant examples of such partial products 
exist in the 2-category of toposes and geometric morphisms. 

The following result concerning cartesian closed functors will be needed at 
numerous points later on. 


Lemma 1.5.8 Let F:C — D be a functor between cartesian closed categories, 
and suppose F has a left adjoint L. Then F is cartesian closed iff the canonical 
morphism 


(L71,€aLn2): L(B x FA) —>LBXxA 


(where € is the counit of (L 4 F)) is an isomorphism for all objects A, B of C 
and D respectively. 


The isomorphism £(B x FA) & LB x A is commonly called the Frobenius 
reciprocity law for the adjunction (Z 4 F'). Note that Lemmas 1.3.3 and 1.4.13 
provide an example of an adjunction (between cartesian closed posets) satisfying 
the above conditions. 


Proof First suppose F is cartesian closed. We construct an inverse for 
(£7, ca Larz) by forming the composite 


L(nF A) x1 
ra. L((B x FA)P4)x A abe L(FL(B x FA)PA) xA 


L0- x1 
1 
L(B x FA) <—_ L(Bx FA)A x A <«— “~ LF(L(B x FA)4) x A 


where À is the unit of the exponential adjunction, 7 is the unit of (L 4 F), and 8 
is the (invertible) exponential comparison map for F'. The verification that this 
is indeed inverse to (Lr, €a Larz) is tedious but straightforward. 

Conversely, suppose the Frobenius reciprocity law is satisfied; let 64,2 denote 
the inverse of (Z71,€4L72). Since F has a left adjoint, we know that it pre- 
serves finite products, so we need only construct an inverse for the exponential 
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comparison map @: F(A?) — FA?C., This time we form the composite 


FAFO y PLFA) FA > F(L(FAFC) x €)°) 


F(5°) 


F(eC 


e C 
F(A°C) <——— F(LFAF) een): Ww 


F((L(FA® x FC))°) . 
Again, it is straightforward to verify that this is inverse to 0. a 


Two particular cases of 1.5.8 are worth noting: 
Corollary 1.5.9 Let F and L be as in 1.5.8. 


(i) If F is cartesian closed and L preserves 1, then F is full and faithful. 


(ii) If F is full and faithful and L preserves binary products, then F is cartesian 
closed. 


Proof (i) Putting B = 1 in the Frobenius reciprocity law, we deduce that 
€: LF — 1¢ is an isomorphism; so F is full and faithful. 

(ii) Conversely, if F is full and faithful, then € is an isomorphism, and so 
the Frobenius reciprocity law follows from the assertion that L preserves binary 
products. O 


In a cartesian closed category C, a class Z of objects (or the full subcategory 
which it determines) is called an exponential ideal if B € T implies B4 € T for 
every object A of C. 


Lemma 1.5.10 In any cartesian closed category, the class of subterminal 
objects (i.e. those B such that B — 1 is monic) is an exponential ideal. 


Proof If B is subterminal, then there is at most one morphism C x A > B 
for any A and C, and hence at most one morphism C > B4. o 


In particular, it follows from 1.5.10 that Sub(1) is a cartesian closed preorder 
in any cartesian closed category C (and if C is locally cartesian closed, then 
Sub(A) is cartesian closed for all A). In a cartesian closed preorder the expo- 
nentiation is normally denoted (A = B) instead of Bô (and called implication 
rather than exponentiation); its defining property is 


C<(A=>B) iff (CNA)<B. 


A cartesian closed preorder is necessarily locally cartesian closed; this may be 
seen using 1.5.4, since if C is a preorder then every object of C/B is isomorphic 
to one in the image of B* (if A < B, we have A = AN B). Moreover, if C isa 
preorder then we have Sube(A) = C/A for every A, since every morphism of C is 
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monic, and so we see that (provided C is also cocartesian) it is (locally) cartesian 
closed iff it is a Heyting category as defined after 1.4.10. 

A poset (that is, a small preorder in which the only isomorphisms are iden- 
tities) which is cartesian closed is called a Heyting semilattice (sometimes a 
Brouwerian semilattice); if it is also cocartesian, then it is called a Heyting alge- 
bra. For future reference, we need to note that the theory of Heyting semilattices 
is algebraic: 


Lemma 1.5.11 The category HSLat of Heyting semilattices (and cartesian 
closed functors) is isomorphic to the category of models of the algebraic theory 
defined by a constant 1 and two binary operations A and =, satisfying the 
equations 


lAgr=2, ©£ALT=2, TAY=YANG, 

gA(yAz)=(£ZAY)Az, (ez) =1, 

aA(z=y)=zAY, yA (=y) =y, 
r=>(y A2) = (1>) ^A (1=>2). 


Proof Let A be a model of the above theory. It is well known that the first 
four equations are equivalent to saying that A is a meet-semilattice (that is, a 
cartesian poset) with respect to the partial order defined by a < b if a = a Ab. 
So we have to show that the last four equations imply that a> (—): A = A is 
right adjoint to (—) Aa. 

First we note that the last equation implies that a= (—) is order-preserving. 
Now if c Aa < b, we have 


c=cA (asc) < (asc) = (asc) A (aa) = (a= (cAa)) < (a=b), 
and conversely if c < (a=b), then 
cNa<(as>b)ANa=aNb<b. 


So A is a Heyting semilattice. Conversely, it is an easy exercise to verify that 
the last four equations hold in any Heyting semilattice: for example, the last one 
holds because a=>(—), being a right adjoint, preserves limits. O 


We may, of course, give a similar algebraic presentation of the theory of 
Heyting algebras, by adding a further constant 0 and a binary operation V, plus 
appropriate equations relating V to A. Complete Heyting algebras (i.e. ones in 
which every subset has a least upper bound) are also called frames or locales; 
they will be studied extensively in Chapter C1. 

We have not yet discussed the relationship between cartesian closedness and 
the categorical properties studied in Sections A1.3 and A1.4. ‘Mere’ cartesian 
closedness does not imply very much: the example of Cat shows that it need 
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not imply regularity. The only positive result we have for merely cartesian closed 
categories is 


Lemma 1.5.12 Ifa cartesian closed category C has an initial object, then it is 
strict (cf. 1.4.1). 


Proof Write 0 for the initial object, and suppose there exists a morphism 
f: A—0. Then the projection A x 0 — A is split epic (the splitting being 
induced by (14, f)); but A x 0 = 0 since A x (—) preserves colimits. So A is a 
retract of the initial object, and hence must itself be initial. 0 


On the other hand, locally cartesian closed categories tend to have most of 
the good properties of the categories we have studied in previous sections. 


Lemma 1.5.13 Let C be a locally cartesian closed category. 


(i) If C has coequalizers (of effective equivalence relations), then it is regular. 
(ii) If C is cocartesian, then it is a Heyting category. 


Proof (i) Since pullback functors in C have right adjoints, they preserve any 
coequalizers which exist; so this follows from 1.3.5. 

(ii) Let f: A— B be a morphism of C. Since Ilp: C/A — C/B is a right 
adjoint, it preserves monomorphisms, and hence restricts to a functor Sub(A) > 
Sub(B), which is clearly right adjoint to f*: Sub(B) — Sub(A). Moreover, f* 
preserves finite unions of subobjects because it has a right adjoint (and the 
unions exist because C has finite coproducts; cf. the remarks at the beginning of 
Section A1.4). Oo 


We cannot hope to have any sort of elementary converse to 1.5.13. Let Set, 
be the full subcategory of Set whose objects are all finite or countable sets; then 
Set, is a Boolean pretopos (being closed under the appropriate operations on 
Set), but fails to be cartesian closed for obvious cardinality reasons: there are 
uncountably many morphisms 1 x N — 2, but only countably many morphisms 
1 — A for any A. Nevertheless, there is a non-elementary converse of sorts: if C 
is an oo-pretopos with a separating set of objects, then it is (a topos, and hence) 
locally cartesian closed. We shall prove this result in C2.2.8; cf. also 3.4.11. 

A category satisfying the hypotheses of 1.5.13 need not be effective as a 
regular category: for a striking counterexample, see C4.2.4. However, it comes 
surprisingly close to being positive as a coherent category. We shall say that 
an object Q of an arbitrary category C is quasi-initial if it admits at most one 
morphism to any other object of C (equivalently, if C has an initial object 0, 
the unique morphism 0 — Q is epic). And we say that a coproduct A + B is 
quasi-disjoint if the coprojections A ~ A+ B and B — A+B are monic, and 
their intersection in Sub(A + B) has a quasi-initial object for its domain. 


Lemma 1.5.14 Jn a locally cartesian closed, cocartesian category C, coprod- 
ucts are quasi-disjoint. In particular, if 0 has no proper epimorphic images in C 
(for example if C is balanced), then C is a positive coherent category. 
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Proof Given a coproduct A+ B, form the pullbacks 
Ress Ae ; 


V 


nE E TA; 

then (r,s) is also a coproduct diagram. But r is split epic, being split by the 
diagonal A: A — R; and it is split monic, because the pair of maps (1r, As) 
must factor through (r,s). So it is an isomorphism, and hence vı is monic. 
Similarly, v2 is monic. Now suppose fı, f2: S = C are any two morphisms. Then 
the pairs (yr: R —> A+C,vef;:S — A+C) (i = 1,2) have only one possible 
factorization through (r, s), namely vı, since r is epic. So v2 fı = nis = v2 f2; but 
we have already seen that vz must be monic, so fı = f2. Hence S is quasi-initial. 

This proves the first assertion of the lemma; the second follows immediately 
from it and 1.5.13(ii). The fact that a balanced category C is a special case of 
the second assertion follows from the fact that the initial object of C is strict by 
1.5.12, and so any morphism with domain 0 is monic. o 


The example of a Heyting algebra, considered as a category, shows that we 
cannot hope to improve on the result of 1.5.14. 

Finally in this section, we return to the regularization construction considered 
at the end of Section A1.3. 


Proposition 1.5.15 Let C be a cartesian category. If C is cartesian closed 
(resp. locally cartesian closed), then so is Reg(C). 


Proof First consider the case of ‘mere’ cartesian closedness. Given two objects 
(fı: Aı > By) and (f2: Az > Bz) of Reg(C), we define the exponential fof to 
be (g: E + B41), where E — Ao“! is the equalizer of 


f2° 
Ais By 
fr’ 
(and a,b: K, =3 A, is the kernel-pair of fı), and g is the composite 


fo” 
E >——> A^ ——> B“. 


The evaluation map [e]: fof x fı > fz is represented by the composite 


Ex A, >> A” x Ay — > Azi 


we must check that this does indeed represent a morphism of Reg(C), i.e. that 
its composite with fs coequalizes (c x a) and (d x b) where c,d: Kz =3 E is the 
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kernel-pair of g. But this composite can also be written as 


x1 
PA SPA Se, 


so it coequalizes (c x 1) and (d x 1); and it coequalizes (1 x a) and (1 x b) by the 
definition of £. 

Now, given [h]: fs x fi > fe in Reg(C), let h: Az — E be the factorization 
of the transpose h: A3 — Aj“: through the equalizer (which exists because foh 
has equal composites with (1 x a) and (1 x b), by assumption). Then h defines 
a morphism f3 > g in Reg(C), since (f24!)h coequalizes the kernel-pair of f3. 
We need to check that the equivalence class {h] is independent of the choice of 
h; but if [hi] = [h2] (that is, fohy = fzh2), then clearly fe“! coequalizes hı and 
hg, and so g coequalizes hı and hg. Also, we have {e]({h] x 1) = fev(h x 1)] = [h]; 
and if k: A3 > E is any morphism such that {e]([k] x 1) = [h], then ev(k x 1) 
and h represent the same morphism f3 x fı — fe, whence k and h have equal 
composites with g. So we have verified that fo has the universal property of 
an exponential in Reg(C). 

The locally cartesian closed case is similar but more complicated. We shall 
content ourselves with giving the construction of the exponential [g2|!] (where 
lox): fı — fs and [g2]: fo — fs) in Reg(C)/ fs, leaving the reader to verify that 
it has the appropriate universal property. Let a,b: Kı <3 A; be the kernel-pair 
of fı: A; — Bı, and let k denote the composite fsgıa = f3gib. Now form the 
equalizer 

2 a 
E >——> (fagz) ——S (B5(Ba))* 
(Fa) 


in C/B3, where Ta: fsg2 > B3(B2) is the transpose of fo: Ag = Eg, (f392) > Ba; 
and let g4: Aq —> A3 denote the pullback f3(£) in C/A3. Define By to be 
¥ B,{(Bs(Be))*™), and let f4: A4 — By be the map obtained by applying 
Xp, to the composite 


(fo Pw 
Ep E) — E > (fg) > (B}(B2)) (F399) 


in C/B3, where the first factor is the counit of (Es, 4 f3). Then we claim that 
ga: Ay — Az defines a morphism [g4]: f4 —> fa in Reg(C), and that [g4] has the 
universal property of an exponential (g/t! in Reg(C)/ fz. The remaining details 
are tedious but routine verification. m 


Suggestions for further reading: Carboni & Rosolini [230], Johnstone [531], 
Lambek [668], Lambek & Scott [682], Rosický [1060]. 


A1.6 SUBOBJECT CLASSIFIERS 57 


A1.6 Subobject classifiers 


In the earlier sections of this chapter, we have been much concerned with prop- 
erties of subobjects in various classes of categories. Many problems concerning 
subobjects are simplified if we assume that they are all obtained by pullback 
from one particular one. In this section we investigate some of the consequences 
of that assumption. 

Let C be a category with pullbacks. A generic subobject in C is a monomor- 
phism T: B’ > B such that, given any monomorphism m: A’ — A in C, there 
is a unique f: A — B for which we have a pullback square 


A ———> B’ 


m T 
f 


A —— B 

(i.e. such that f*(T) = m in Sub(A)). The symbol T is pronounced ‘true’ or 
‘top’, according to taste; the reason for this nomenclature will become clear later. 
f is called the classifying map or characteristic morphism of the subobject. m; 
it is clear that two subobjects of A have the same classifying map iff they are 
isomorphic in Sub( A), and hence (recalling our convention that a category with 
pullbacks comes equipped with a choice of ‘canonical pullbacks’) a category with 
a generic subobject comes equipped with a choice of canonical representatives 
for the isomorphism classes of objects of Sub(A), for any A. (Thus, in such a 
category, it does even less harm than usual to confuse individual monomorphisms 
with isomorphism classes thereof.) 


Lemma 1.6.1 The domain of a generic subobject is a terminal object of C. 


Proof For any A, a morphism f: A — B factors through T: B’ — B iff f*(T) 
is an isomorphism; so there is exactly one morphism which does so. Q 


In particular, a category with a generic subobject necessarily has a terminal 
object; so there was spurious generality in our initial assumption that C had 
pullbacks rather than all finite limits. (It is possible to gain still further spu- 
rious generality by requiring C merely to have binary products, plus arbitrary 
pullbacks of the particular monomorphism T: for the equalizer of a parallel 
pair f,g: A 3 A’ may be obtained as the pullback of T along 6(f,g), where 
ð: A’ x A’ — B is the classifying map of the diagonal subobject A’ — A’ x A’.) 
The codomain of a generic subobject is traditionally denoted 2, and called a sub- 
object classifier for C; from now on, therefore, we shall write the generic subobject 
as T: 1 -+. One consequence of 1.6.1 is worth remarking immediately: 


Corollary 1.6.2 In a category with a subobject classifier, every monomor- 
Phism is regular. In particular, such a category is balanced. 
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Proof The generic subobject is regular monic, because it is split by the unique 
morphism Q — 1. But in any category, a pullback of a regular monomorphism 
is regular monic: if m is an equalizer of g and h, then f*(m) is an equalizer of 
gf and hf. The second assertion follows from the first as in the proof of 1.4.9. O 


Lemma 1.6.3 Let C be a cartesian category with a subobject classifier Q. 


(i) For every object A of C, the preorder Sub(A) ts cartesian closed; and in 
fact 2 has the structure of an internal Heyting semilattice in C (cf. 1.5.11). 


(ii) If C has finite unions of subobjects which are stable under pullback (for 
example, if C is a coherent category), then Q is an internal Heyting algebra 
in C. 


Proof (i) Let A:Q xQ —NQ be the classifying map of the subobject 
(T, T): 1— Q x Q. We note that this subobject is the intersection of n*(T) and 
m5(T), where 7, and 72 are the product projections 2 x Q = Q; and since inter- 
sections of subobjects are stable under pullback, we see that if f, f’: A 3 Q are 
the classifying maps of subobjects A’ — A and A” >> A respectively, then the 
composite A(f, f’) classifies A'N A” — A. Since the composite TA: A — 1-2 
classifies the terminal object of Sub(A), it is now easy to verify that the constant 
T and the binary operation A give Q the structure of an internal meet-semilattice 
(i.e. they satisfy the diagrammatic forms of the first four equations in the state- 
ment of 1.5.11). Now let 2; — 2 x Q be the equalizer of A and 7, and let 
= Q x Q — Q be the classifying map of this subobject. Then we see that if f 
and f’ are the classifying maps of subobjects A’ — A and A” — A, the mor- 
phism (f, f’): A 2 x Q factors through Qı — 2 x Q iff A N A” & A’, ie. iff 
A’ < A” in Sub(A). If now m: B — A is a third subobject of A, classified by 
g: A > Q, and (A’=> A”) > A denotes the subobject classified by the composite 
=> (f, f’), we see that 


‘B<(A'=>A") in Sub(A) iff >(f,f)m=TB 
iff (f, f’)m factors through 2) = 2 x 2 
if m*(A’) < m*(A”) in Sub(B) 
iff A’N B= JAnm*(A’) < A” in Sub(A) . 


So we have verified that Sub(A) is cartesian closed for any A; and since its 
implication operation is induced by the binary operation => on 2, it is again 
straightforward to verify that this operation, with T and A, satisfies the last 
four equations of 1.5.11. 

(ii) Let L: 1 — Q (pronounced ‘false’ or ‘bottom’) classify the least subobject 
0 — 1 of 1, and let V: Q x 2 — Q classify the union of the subobjects mj (T) and 
13(T). Since unions in C are stable under pullback, they are induced by these 
operations on {2 in the same sense that intersections are induced by T and A; so 
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it is again easy to verify that Q is an internal lattice in C, and hence an internal 
Heyting algebra. i 


In the presence of a subobject classifier, the verification that a category is a 
Heyting category becomes appreciably simplified: 


Proposition 1.6.4 LetC be a regular category having a subobject classifier Q. 
Then C is a Heyting category iff it ‘admits universal quantification over Q’ in the 
sense that, for every object A, the pullback functor aj: Sub( A) > Sub(A x 2) 
has a right adjoint V,,. In particular, if Q is exponentiable in C, then C is a 
Heyting category. 


Proof ‘Only if’ is obvious, so we need to prove ‘if’. First we define 0 — 1 
to be Yo(T: 1 — Q) (taking A = 1 in the condition above). Then the counit 
Q*(0) = N*Ve(T) < T ensures that, for any subobject m: A’ — A in C, we have 
A*(0) = x," (0) < x%,(7) = m in Sub(A), where xm is the classifying map of 
m. So the posets Sub(A) have least elements, which are stable under pullback. 

To obtain binary unions, we first form the subobject of Q x Q x Q classified 
by the map v = ((m => 73) > ((m2 > 13) > 13)), where = is the Heyting 
implication constructed in 1.6.3(i), and then apply universal quantification along 
mi2: Nx NQ xN —RN xN. Let v:Q x 1 — Q classify the resulting subobject. An 
easy calculation shows that we have mı < v and 72 < v in the poset C (0°,2) = 
Sub(3), and hence m < V and mz < V in C(M?,Q). Thus, given any two 
subobjects m;: A; — A (i = 1,2) in C, the subobject classified by the composite 
V(Xm1i:Xm;) is an upper bound for mı and mz in Sub(A). But if m3: A3 — A 
is any such upper bound, then the composite 


q mXmarXma) ooo >g 


reduces to Xma; that is, (Xm: , Xm2, Xms ) factors through the equalizer of v and 73. 
Hence the inequality V12 < v, which is the counit of the adjunction (nfz 4 Yarz) 
translated into terms of classifying maps, yields V(xm1,Xm2) < Xm; ìn C (A,Q). 
So the posets Sub(A) have binary joins, which are stable under pullback; thus C 
is a coherent category. 

It remains to construct universal quantification along an arbitrary mor- 
phism f: A— B of C. Given a subobject m: A’ = A, we first form a sub- 
object U — Ax Bx Q by pulling back the graphs (1, f): A Ax B and 
(L, Xm): A> Ax along the appropriate product projections and forming 
their intersection. Then we apply existential quantification along the projec- 
tion m23: Ax Bx Q > B x 2 to obtain V + B xQ, and let V’ > B x Q be the 
Subobject classified by (v=>72), where v is the classifying map of V. Finally, we 
apply universal quantification along 7): B x Q — B to V’, to obtain a subobject 
W —> B; we claim that this subobject is the universal quantification V,(4’). 
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First we show that f*(W) < A’ in Sub(A). For this, consider the diagram 


Sub(A x B x Q) aa Sub(B x 2) MOTE ii ub(B x 9) Bite Sub(B) 


(1 x f x 1)* (f x 1)" (f x 1)" f* 


Sub(A x A x ) Baie Sub(A x N) ee Sub(A x N) Bua Sub(A) 


where the outer squares commute by the Beck—Chevalley conditions (note that 
the Beck—Chevalley condition for Vz, follows from that for 4+, as in 1.4.11), and 
the central square commutes by the stability of implication under pullback (here 
A and B denote the subobjects (1, TA): A A x Q and (1,TB): Bo Bx 
classified by the second projections). If we chase the subobject U around 
the top and right-hand edges of the diagram, we obtain f*(W); but if we 
chase it down the left-hand edge, we obtain a subobject which contains the 
subobject (1,1, Xm): A> Ax AxQ, and whose image under 723 therefore 
contains (1, Xm): A+ Ax Q. Since implication is order-reversing in its first 
variable, the rest of the bottom edge sends this to a subobject contained 
in Vx, ((l, Xm) = (1,TA)), that is the largest subobject A” — A such that 
(A” x Q)N(1, Xm) < (1, TA). But the latter is easily seen to be the equalizer of 
Xm and TA — that is, A’ > A. 

Now suppose we have a subobject B’ — B such that f*(B’) < A’ in 
Sub(A). To show B’ < W in Sub(B), we need to show that B’ x 2 < V’ in 
Sub(B x 22), or equivalently that (B’ x Q)NV < (1, TB) in Sub(B x 2). By the 
Frobenius reciprocity condition 1.3.3, this is in turn equivalent to showing that 
(Ax BxOX)NU < ((1,T(Ax B)): Ax BoAxXx Bx) in Sub(A x Bx 2). 
But the condition f*(B’) < A’ implies that the intersection of A x B’ with 
the graph of f in Sub(A x B) is contained in A’ x B’; and the intersecton of 
A’ x Q with the graph of xm in Sub(A x Q) is contained in (1, TA): Am A xQ. 
So the inequality we require follows from the definition of U (plus stability of 
intersections under pullback). 

Finally, suppose 2 is exponentiable in C. Then by 1.5.2(ii) A*(Q) is 
exponentiable in C/A for all A, whence by 1.5.2(i) we obtain functors 
Hr, : C/(A x Q) > C/A right adjoint to the pullback functors. Restricting these 
to subobjects, as in 1.5.13(ii), yields the required functors Y+. Oo 


We have just seen that the existence of the exponentials A® for all A, in a 
regular category with subobject classifier 2, has important consequences. In the 
next chapter, we shall see that the existence of 04 for all A has even more far- 
reaching consequences (cf. 2.2.7 and 2.3.4). Note also that the first part of the 
proof of 1.6.4 does not make any use of regularity; thus a cartesian category with 
subobject classifier, which admits universal quantification over Q, automatically 
has pullback-stable finite unions of subobjects. 
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So far we have not seen any examples of subobject classifiers. However, we 
have 


Lemma 1.6.5 A Boolean coherent category in which the coproduct 1+1 exists 
and is disjoint (in particular, a Boolean pretopos) has a subobject classifier. 


Proof Define Q = 1+1 and let T: 1 — Q be the first coprojection (and write 
L for the other coprojection). Given a subobject A’ — A, let A” — A be its 
complement; then A is the coproduct of A’ and A” by 1.4.4, so there is a unique 
f: A—Q such that the diagram 


A! > A e A” 


f 


1 >—— > 2) «e 1 


commutes. Now f*(T) is a subobject of A containing A’ and disjoint from A”, 
so it must be (isomorphic to) A’; but conversely any morphism f: A — Q which 
makes the left-hand square above a pullback must make the right-hand square 
commute, since 1¢(A”) must be contained in the complement of T. So there is 
a unique such f. o 


In particular, Set has a subobject classifier, as does |C, Set] for any groupoid 
C (not necessarily locally small; note that we did not make use of local smallness 
of C in the first half of the proof of 1.4.12). But we also have 


Lemma 1.6.6 For any small category C, the functor category [C, Set] has a 
subobject classifier. 


Proof As in the proof of 1.5.5, the Yoneda Lemma determines what Q must 
be: elements of Q2(A) correspond to morphisms C (A,—) > Q in |C, Set], and 
hence to (isomorphism classes of) subobjects of C (A, —). Now a subfunctor F of 
C (A, —) is determined by the set 


R = {f € mor C | dom f = A and f € F(cod f)}; 
R is a cosieve on A, i.e. a set of morphisms with common domain A such that 
f € R implies gf € R for any g composable with f, and any cosieve on A may 
arise in this way. So we define 2(A) to be the set of cosieves on A; given a 
morphism f: A —> B and R € 2(A), we define 

Q(f)(R) = {g € mor C | dom g = B and gf € R}. 


It is clear that this makes 2 into a functor C —> Set. For any A, we define 
Ta to be the cosieve of all morphisms with domain A; then T is a natural 
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transformation from 1 (i.e. the constant functor with value 1) to 2. Now, given 
a subobject F’ — F in [C, Set], we define ¢: F — 2 by 


ga(z) = {f | dom f = A and F(f)(x) € F'(cod f)} 


for x € F(A); it is clear that ¢ is a natural transformation F — Q, and that 
palz) = Ta iff x € F'(A), ie. that 


is a pullback. But ø is the unique natural transformation with this property; 
for, given a cosieve R, we have f € R iff Q(f)(R) = Tooa f, and hence if y is a 
natural transformation making the square a pullback we have f € paom s(x) iff 
Wood (F(f)(£)) = Teoa f, iff F(f)(x) € F’(cod f). So Q is a subobject classifier 
for [C, Set}. Oo 


The category LH/X also has a subobject classifier for any X; once again, we 
defer the proof of this until 4.2.4(e). Possession of a subobject classifier is stable 
under slicing, in an obvious sense: 


Lemma 1.6.7 If 2 is a subobject classifier in C, then B*(Q) is a subobject 
classifier in C/B, for any object B of C. 


Proof Let f: A— B be an object of C/B, and 
A> A 

F j 
B 


a subobject of f (note that the forgetful functor C/B — C preserves and reflects 
monomorphisms). Let g: A > Q be the classifying map of m: A’ — A in C; then 
(g, f): A— N x B defines a morphism f — B*(Q) in C/B, which is easily seen 
(since C/B — C preserves and reflects pullbacks) to be the unique morphism 
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making 
[=> BM) 
m B*(T) 
f ——> B*(0) 
a pullback. O 


Rather remarkably, it is also stable under co-slicing (unlike many of the prop- 
erties we have considered up to now}; we shall not need this result subsequently, 
but we include it for completeness. 


Lemma 1.6.8 IfQ is a subobject classifier in C, then TB: B—>1—Q is a 
subobject classifier in the co-slice category B\C. 


Proof Let 
B 


f f 


Aes a 
be a monomorphism in B\C, and let g: A—Q be the classifying map of 
m: A’ > Ain C. Since f factors through m, the composite gf is equal to TB, 
i.e. g is a morphism f — TB in B\C. The rest is straightforward. g 


The following result, though seemingly rather special, will be of use in various 
contexts later on. 


Lemma 1.6.9 LetC and D be cartesian categories such that C has a subobject 
classifier; let S,T: C 3 D be a pair of cartesian functors, and 0: S + T a natural 
transformation. Then the following are equivalent: 


(i) 8 is pointwise monic, i.e. 04 is monic for each A € ob C. 
(ii) For each monomorphism m: A’ — A in C, the naturality square 


Bar 
SA’ Ss TA’ 


is a pullback. 


64 REGULAR AND CARTESIAN CLOSED CATEGORIES A1.6 


Moreover, if these conditions hold, then 04 is an isomorphism for each 
subterminal object A of C. 


Proof (i) = (ii): Consider first the generic monomorphism T: 1 — Q. In the 
naturality square 


6 
$1 ———> T1 


ST TT 


On 
SQ ———> TQ 


the top edge is an isomorphism since § and T both preserve 1, and the bottom 
edge is monic by assumption; it is easy to see that these conditions force it to be 
a pullback. Now every m: A’ — A is a pullback of T, and S and T both preserve 
pullbacks; so a straightforward diagram-chase shows that the naturality square 
for m is also a pullback. 

(ii) = (i): Given an object A of C, apply (ii) to the diagonal morphism 
A:A= AxA. Since both S and T preserve binary products, this yields a 
pullback square 


6 
SA A 
A A 
GA x O4 


SA x SA ———_>TAxTA 


but this says precisely that 04 is monic. 
The final assertion is immediate from (ii), since we have already noted that 
6, must be an isomorphism, and @, is a pullback of 6, if A is subterminal. O 


In contrast to 1.5.15, the regularization Reg(C) of a cartesian category C 
does not in general inherit a subobject classifier from C. The reason may be seen 
from 1.3.10(d): if we take C to be the category of directed graphs, then C has 
a subobject classifier by 1.6.6, but Reg(C) does not because, as we saw after 
1.4.17, it is locally small but not well-powered. 

The example of a functor category [C, Set] shows that a subobject classifier 
need not be finite in any very strong sense: for example if C is the ordered set N 
of natural numbers, then there are infinitely many cosieves on each number n, 
namely the empty cosieve and the cosieve of all numbers > m, for each m > n. 
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We can picture the transition maps Q(n) — Q(n + 1) as follows: 


e ——— m 6 


JS / 
SSS. 


è ———_—_—_>- 6 


Here the generic subobject is the morphism 1 — Q corresponding to the top row 
of dots; the bottom row corresponds to the empty cosieve. 

However, Q does satisfy Dedekind’s definition of finiteness: all its monic 
endomorphisms are isomorphisms (although it may have epic endomorphisms 
which are not invertible, as can be seen from the picture above). In fact we can 
be more specific: 


Lemma 1.6.10 Let f: Q — Q be a monomorphism. Then ff = in. 
Proof First form the pullbacks 
U ———_> 1 and 


V 1 
f 


Q ——_—> 2 Veste 


EE 


Since f is monic, U is a subobject of 1, and V is a subobject of 1 contained in U. 


Now in the diagram 

V — V U ———— 1 
| |. f | 
U 1 l ———> 20 


> 


a a, (ents Geert 
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all three squares are pullbacks, so the bottom composite must equal g. Thus 
ffg=fTU = 9, ie. the square 


commutes; but since f f is monic it must be a pullback, and hence fff = f since 
both classify the subobject g of Q. But f is monic, so ff = Ig. 


Note that the object U in the above proof is in fact isomorphic to 1. In the 
opposite direction, let V — 1 be a subterminal object, and let v: 1 — Q be its 
classifying map. v itself must be monic, and so has a classifying map f: Q — Q; 
when is f an isomorphism (equivalently, monic)? It is not hard to verify that f 
is the composite 


1 
O26) 2- S025 6 


where <> is the bi-implication of the Heyting semilattice structure on 2 (cf. 1.6.3), 
or equivalently the classifying map of the diagonal subobject Q — Q x Q. Now, 
in any Heyting semilattice H, we have 


((ueev) Sv) =(((usv) A (v>4u)) >) A (v>((usv) A (v>4))) 
ge ad ae cada eek Nia ET 
=((u>u)>((uSv)Sv)) ALA (v>4) 
=((u>v)Sv)A(v>1). 


It follows that we always have ((u<v)<@v) > u, and that we have equality 
here iff ((u =v) =v) < ((v=>u) =u). If this condition holds for all u, we say 
v is a widespread element of H; in the case when H is a Heyting algebra (i.e. 
has finite joins as well as meets), it is equivalent to saying that the principal 
filter fT (v) = {u € H |v < u} is a complemented lattice (and hence a Boolean 
algebra). From the above discussion, we may conclude 


Corollary 1.6.11 In a cartesian category with a subobject classifier Q, the 
automorphisms of Q correspond bijectively to (isomorphism classes of) subob- 
jects V of 1 which are ‘internally widespread’, in the sense that V x A >= A is 
widespread in Sub(A) for all A. 0O 


Further, the group operation on automorphisms corresponds to the 
bi-implication operation on subobjects of 1; it is again easy to verify that, in 
any Heyting semilattice, this operation is associative when restricted to the 
widespread elements, and it makes them into a group of exponent 2. 
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For example, let C be a small category. Subobjects of 1 in [C, Set} correspond 
to cosieves on C (that is, to subsets R C ob C such that the codomain of any 
morphism with domain in R is also in R), and it may be verified that a cosieve 
R is internally widespread iff every morphism with codomain not in R is an 
isomorphism; equivalently, iff the objects not in R are all strict in the sense 
defined after 1.4.1. Thus the group of automorphisms of Q may be identified 
with the set of isomorphism-closed subsets of the set of all strict objects of C 
(the group operation being symmetric difference). 

We shall meet the notion of widespread subobject again, in a different 
context, in Section D4.5. 


Suggestions for further reading: Borceux & Veit [168], Engenes [335], 
Johnstone [506, 531]. 


A2 
TOPOSES- BASIC THEORY 


A2.1 Definition and examples 


We are now ready to present the definition of an (elementary) topos, which is 
the central notion of this book: it is a category having all the good ‘set-like’ 
properties discussed in the previous chapter, except (possibly) for Booleanness. 
It is a remarkable fact that all these properties are implied by the conjunction 
of just two of them: 


Definition 2.1.1 A topos is a properly cartesian closed category with a 
subobject classifier. 


The proof that this implies all the other good properties will occupy much 
of Sections A2.2-A2.4; the present section is largely devoted to setting up some 
examples which we shall use later on in the book. Before that, however, we 
must mention that the above definition of a topos, though it is appreciably 
simplified from that originally adopted by F. W. Lawvere and M. Tierney when 
they began the subject in 1969, is not the simplest possible; one can combine the 
exponentials and the subobject classifier into a single structure, that of power 
objects. 

Let € be a topos (as defined above), and A an object of €. We write PA 
for the exponential Q4, and €4>—> PA x A for the subobject classified by the 
evaluation map ev: PA x A — 9. (The reason for this notation is that, in Set, 
PA may be identified with the power-set of A — recall that Q is just a two-element 
set in this case — and €, is then the subset 


{(A’,a) |a€ A'}, 


i.e. the membership relation between subsets and elements of A. Admittedly, 
the latter is ‘back to front’, which has led some authors to denote it by 34 
(or 4 3) rather than €4, but we prefer to use the more familiar symbol.) This 
subobject has the following universal property: given an object B and a subobject 
R > B x A (we think of R as a relation from B to A, a point of view which will 
be extensively explored in Chapter A3), there is a unique morphism r: B + PA 
(namely the exponential transpose of the classifying map B x A — Q of R) for 
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which there is a pullback square 


R ————> Ea 


rx] 
Bx A——>PAxA 


In a general cartesian category E, by a power object of an object A we mean 
an object PA equipped with a subobject €4~> PA x A satisfying the above 
universal property. (We call the morphism r above the name of the relation R; 
as with subobject classifiers, it is easy to see that the mapping from relations 
to their names is a bijection from isomorphism classes of objects of Sub(B x A) 
to morphisms B — PA.) We say £ has power objects if there is an operation 
assigning to each object of £ a power object for it. By the preceding discussion, 
a topos has power objects; conversely, it is clear that a cartesian category with 
power objects has a subobject classifier, since the latter is (up to isomorphism) 
the same thing as a power object for the terminal object 1. We shall see eventually 
that a cartesian category with power objects is also cartesian closed (in fact, 
locally cartesian closed) and hence a topos; but since we shall not arrive at this 
result until the middle of Section A2.3, we shall for the moment use the term weak 
topos for a cartesian category with power objects, in order to avoid confusion. 

If F: € — F is a cartesian functor between (weak) toposes, we have for each 
object A of £ a comparison morphism 


oa: F(PA) —> P(F(A)), 


which is the name of the relation F(€4) — F(PA x A) = F(PA) x FA (recall 
that a cartesian functor preserves monomorphisms as well as finite products). 
We say F is a logical functor if @, is an isomorphism for all A; equivalently, if 
F (PA), equipped with the relation F(€4), is a power object for F(A) in F, for 
all A. Once we have proved that a weak topos is a topos, it will follow that a 
logical functor is cartesian closed, as defined in Section A1.5; conversely, if F is 
cartesian closed and ¢, is an isomorphism then F is logical, since 64 may be 
factored as 


84,9 ra 
F (98) —> FO — a5 


(where Q¢ and Qr denote the subobject classifiers of E and F respectively). 
Thus a cartesian functor is logical iff it ‘preserves exponentials and the subobject 
classifier’. 

We now turn to our list of examples. 


Example 2.1.2 The category Set is of course a topos. So too is the full sub- 
category Set; C Set of finite sets (and the inclusion functor Set; — Set is 
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logical); more generally, the category Set, of sets of cardinality less than «x is a 
topos provided x is a limit power cardinal, i.e. a cardinal such that à < « implies 
Qn < K: 


Example 2.1.3 For any small category C, the functor category [C, Set] is a 
topos, as we saw in 1.5.5 and 1.6.6. Again, if C is a finite category, then |C, Set ;| 
is a topos; the proofs of the results just cited still work with ‘set’ replaced by 
‘finite set’ throughout. We shall see later (in Section B2.3) that if C is a finite 
category and £ is any topos whatever, then [C,€] is a topos. 


Example 2.1.4 Let G be a group, i.e. a small groupoid with one object. A 
functor G — Set is the same thing as a (left) G-set, ie. a set on which G acts 
(associatively) by permutations. So, by 2.1.3, the category [G, Set] of G-sets 
(and G-equivariant maps) is a topos. It is worth describing the topos structure 
explicitly in this case: Q is a two-element set with trivial G-action (cf. 1.4.12 
and 1.6.5), and the exponential B4 is, by the construction of 1.5.5, the set of 
G-equivariant maps G x A — B (where G acts on itself by left translations). But 
such a map f: G x A — B is entirely determined by its restriction to {e} x A, 
where e is the identity element of G, since f(g,a) = g - f(e,g~* - a); and this 
restriction may be any function A — B. So we may re-interpret BÅ as the set 
of all functions A — B; the G-action is then by conjugation (i.e. (g- f)(a) = 
g-(f(g~'-a)). In particular, we note that the forgetful functor [G, Set] — Set is 
logical. It is also easy to verify that the above description of Q and exponentials 
yields a topos structure on the category [G, Sets] of finite G-sets, even when G 
is an infinite group; similarly, if G is a ‘large group’ (i.e. a one-object groupoid 
which is not small) then [G, Set] is still a topos. 


Example 2.1.5 Generalizing the last observation, let C be a category such 
that each slice category C/B, B € ob C, is equivalent to a small category, 
even though C itself may not be small. (Any groupoid G satisfies this condition, 
because any slice G/B is a trivial connected groupoid, and hence equivalent 
to 1; other examples of such categories include the ordered class On of ordinals, 
and the category Setm of sets and injective functions.) Then it is not hard to 
see that (the duals of) the constructions of 1.5.5 and 1.6.6 ‘preserve smallness’ 
(that is, they yield set-valued rather than class-valued functors), and hence that 
(C°P, Set} is a topos. (Similar examples can be constructed with ‘finite’ replacing 
‘small’.) We note that the toposes which arise in this way, unlike those we have 
considered hitherto, need not be locally small; for example, in [On°?, Set], there 
is a proper class of distinct subfunctors of (the constant functor with value) i, 
and hence a proper class of morphisms 1 — Q. 


Example 2.1.6 Generalizing in a different direction, let G be a topological 
group. We say that a G-set A is continuous if the action map G x A — A is 
continuous for the discrete topology on A; equivalently, if for each a € A the 
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stabilizer subgroup (or isotropy subgroup) 
Ga = {9 €G|g-a=a} 


is open in G. We shall show that the full subcategory Cont(G) C [G, Set] of 
continuous G-sets is a topos. First we observe that the inclusion Cont(G) — 
[G, Set] has a right adjoint: given an arbitrary G-set A, the subset 


Ac = {a € A | Ga is open in G} 


is a union of G-orbits (because elements in the same G-orbit have conjugate 
stabilizers) and hence a sub-G-set of A; and the inclusion A, — A is univer- 
sal among morphisms from continuous G-sets to A (because, if f: B— Aisa 
G-equivariant map and b € B, the stabilizer G; contains G,, and hence is 
open if G is). Now Cont(G) is closed under finite limits in [G, Set] (and hence 
cartesian); it follows easily that we may construct exponentials in Cont(G) by 
constructing them in [G,Set] and then applying the functor A — Ae. Finally, 
the subobject classifier of [G, Set] lives in Cont(G); and since every sub-G-set 
of a continuous G-set is continuous, it is still a subobject classifier there. 


There is some spurious generality in allowing G to be an arbitrary topo- 
logical group in Example 2.1.6. Of course, if a connected topological group 
acts continuously on a discrete set, the action must be trivial; and so we may 
as well assume that G is totally disconnected, since for a general G we have 
Cont(G) = Cont(G/Gp), where Go is the connected component of the identity 
element of G. In fact we can go slightly further than this, and restrict ourselves 
to groups which are nearly discrete, i.e. such that the intersection of all open 
subgroups is the trivial subgroup {e} (again, if it isn’t, we can factor it out with- 
out affecting Cont(G)); the additive group of rationals is a totally disconnected 
group which is not nearly discrete. (This is a bad counterexample, since the 
uniform completion of the rationals is connected; there are complete counterex- 
amples, but they are much harder to construct — see {1121].) A further possible 
restriction is to the class of those (nearly discrete) topological groups in which 
the open subgroups form a base of neighbourhoods of the identity; indeed, from 
one point of view it would be more sensible to describe this example as a con- 
struction on groups equipped with a normal filter of subgroups (i.e. one closed 
under conjugation), rather than with a topology. (Note that every normal filter 
of subgroups F generates a group topology, in which we take the cosets gH, 
H € F, as a base of neighbourhoods of an arbitrary g € G. But a group topol- 
ogy is not uniquely determined by its filter of open subgroups, even if it is nearly 
discrete.) 

However, we cannot cut down to the class of pro-discrete groups, i.e. groups 
which are expressible as inverse limits of discrete groups (equivalently, groups in 
which the intersection of all open normal subgroups is trivial). Two important 
examples of nearly discrete groups, which we shall encounter in Sections D3.4 and 
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C5.4, are the group of all permutations of N and the group of all order-preserving 
permutations of Q (topologized in each case by saying that the pointwise sta- 
bilizers of finite subsets of N (resp. Q) form a base of neighbourhoods of the 
identity element), neither of which has any proper open normal subgroups. 

The foregoing remarks form the beginnings of a theory of ‘Morita equivalence’ 
for topological groups, where we call two such groups Morita-equivalent if their 
toposes of continuous actions are equivalent. In B3.4.14(d) we shall generalize the 
result of 2.1.6 from topological groups to topological groupoids (and subsequently 
to localic groupoids); and in Section C5.3 we shall investigate the general notion 
of Morita equivalence for such groupoids. 


Example 2.1.7 A seemingly minor, but significant, variation of Example 2.1.6: 
again, let G be a topological group, but this time consider uniformly continuous 
G-sets, i.e. those G-sets A for which the pointwise stabilizer of the whole of A 
(the subgroup of elements of G acting as the identity on A) is open in G. It is 
easy to see that if A and B are arbitrary G-sets, the pointwise stabilizer of A x B 
(resp. Bå) equals (resp. contains) the intersection of the pointwise stabilizers of 
A and B; hence the full subcategory Unif(G) C [G, Set} of uniformly continu- 
ous G-sets is closed under finite limits and exponentials. It is also closed under 
arbitrary subobjects, and contains the subobject classifier of [G, Set]; hence it is 
a topos, and the inclusion functor Unif(G) — [G, Set] is logical (as is the inclu- 
sion Unif(G) — Cont(G) - although Cont(G) — [G, Set] is not in general). In 
this case there is no loss of generality in restricting our attention to pro~discrete 
groups, since the pointwise stabilizer of a G-set is a normal subgroup of G; thus 
(for example) if G is either of the permutation groups mentioned in the last 
paragraph but one above, then Unif(G) is isomorphic to Set. However, if we 
take G to be the profinite completion of the additive group Z of integers (i.e. 
the inverse limit of its finite quotients), then continuous G-sets may be identified 
with Z-sets in which every orbit is finite, and such a set is uniformly continuous 
iff there is a finite bound for the sizes of its orbits; thus Unif(G) is not equivalent 
either to Set or to Cont(G). Note also that in this case Unif(G) does not have 
infinite coproducts, and hence cannot be coreflective in [G, Set]. 


Example 2.1.8 Let X be a topological space, and let O(X) denote the lattice 
of open sets of X, considered as a (small) preorder. A presheaf (of sets) on X 
is, by definition, a functor O(X)°? — Set; thus the category [O(X)°?, Set] of 
presheaves on X is a topos. If F is a presheaf on X, we call the elements of 
F(U) (for U an open set in X) sections of F over U, and we describe the map 
F(U) — F(V) induced by an inclusion V C U in O(X) as restriction of sections 
from U to V. The example we have in mind is that, for any object p: Y — X of 
Sp/X, we may define a presheaf T (p) by taking I'(p)(U) to be the set of sections 
of p over U (i.e. continuous maps s: U + Y such that ps is the inclusion map 
U — X); in fact, as is easily seen, T defines a functor Sp/X — [O(X)°?, Set]. 
Presheaves in the image of this functor have the property that we can ‘patch 
together’ sections over the members of an open covering of U to obtain sections 
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over U: abstracting from this example, we define a presheaf F to be a sheaf if, 
given any open covering (U; | i € I) of an open set U, and any family (s; | 7 € J) 
of elements of F(U;) which are compatible in the sense that, for each pair (i,j), 
the restrictions of s; and s; are equal in F(U;NU;), there exists a unique s € F(U) 
whose restriction to each U; equals s;. We write Sh(X) for the full subcategory 
of [(O(.X)°P, Set] whose objects are sheaves. 

We claim that Sh(X) is a topos. The full proof of this is deferred until 
Section A4.4, where we shall in fact prove a much more general result; but we 
sketch the argument here for completeness. First, since limits in [O(X)°?, Set] 
are computed pointwise, Sh( X) is closed under (finite) limits (in fact it is reflec- 
tive in [O(X)°?, Set]), and hence cartesian. Next, it is an exponential ideal in 
[O(.X)°P, Set]: if G is a sheaf and F is any presheaf, then the exponential GF, 
constructed as in 1.5.5, is a sheaf. Thus in particular Sh(X) is cartesian closed. 
Finally, we define a presheaf Q on X by taking Q(U) to be the set of open subsets 
of U, the restriction map Q(U) — Q(V) being (—) NV; this is easily verified 
to be a sheaf (patching is just forming unions in this case), and we have a map 
T:1—4Q defined by Ty = U € Q(U). Now if F’ — F is a monomorphism 
in Sh(X) (for convenience we assume that the values F’(U) of F’ are actual 
subsets of F(U)) and s € F(U), there is a unique largest V C U such that the 
restriction of s to V belongs to F’(V) (namely the union of all such V), because 
F’ is a sheaf; so defining hy(s) to be this V produces a morphism h: F — Q, 
which may be verified to be the classifying map of F’ — F. 


In Section C1.3 we shall show that the functor T: Sp/X — {O(X)°?, Set] 
defined in the preceding example restricts to (one half of) an equivalence 
LH/X ~ Sh(X), so that LH/X is also a topos. However, we shall provide 
an independent proof of the latter fact in 4.2.4(e) below. 

It is worth noting from Examples 2.1.6 and 2.1.8 that the properties of 
preserving Q and preserving exponentials are independent, even for cartesian 
functors which are full and faithful: the inclusion Cont(G) — [G, Set] preserves 
Q but not exponentials, and the inclusion Sh(X) — [O(X)°?, Set] preserves 
exponentials but not 2. 


Generalizing from the last example, we may define a notion of sheaf on any 
small category C, provided it is equipped with a suitable notion of ‘covering’, as 
follows: 


Definition 2.1.9 Let C be a small category. By a coverage on C, we mean 
a function assigning to each object A of C a collection T(A) of families 
(fi: Ai > A | t € I) of morphisms with common codomain A (called T- covering 
families), such that 


(C) If (fj: Ai > A | i € I) is a T-covering family and g: B — A is any morphism 
with codomain A, there exists a T-covering family (hj: B, > B | j € J) such 
that each gh; factors through some f;. 


A site is a small category equipped with a coverage. 
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The old name for coverage is ‘Grothendieck (pre-)topology’; but since this 
concept is a very considerable abstraction from the notion of a topological space 
(and, in particular, has nothing to do with the notion of a topology on a set), 
the latter name seems unhelpful. Also, the definition of a Grothendieck topology 
is normally taken to include additional ‘stability properties’ of covering families, 
besides (C) above, which are satisfied in many examples but are really irrelevant 
to the definition of a sheaf (we shall meet these conditions in Section C2.1, when 
we introduce the notion of Grothendieck coverage); and that of a Grothendieck 
pretopology normally includes the additional assumption that the underlying 
category C has pullbacks, and replaces (C) by the stronger condition 


(C’) H (fi: Ai — A | i € T) isa T-covering family and g: B — A is any morphism 
with codomain A, then the family (g*(f,) | ¢ € I) is T-covering. 


Again, (C’) is satisfied in most of the examples we shall meet, but it is not 
necessary for the purpose of defining sheaves, and the extra generality allowed 
by (C) will be useful in some cases. 

We say a functor F: C°P — Set satisfies the sheaf axiom for a family of 
morphisms (f;: A; ~ A | i €e J) if, whenever we are given a family of ele- 
ments s; E€ F(A;) which are compatible in the sense that, whenever g: B — A; 
and h: B - A; satisfy fig = fj;h (here i and j need not be distinct), we have 
F(g)(s;) = F(h)(s;), then there exists a unique s € F(A) such that F(f;)(s) = si 
for each 7 € I. (Once again, this definition may be simplified if C has pullbacks; it 
then suffices to check the compatibility of s; and s; on the pullback of f; against 
f;, rather than on arbitrary pairs (g,h) as above. In this case, the condition 
may conveniently be expressed diagrammatically: F satisfies the sheaf axiom for 
(A; ~ Alice JL) iff 


F(A) —> [[ #4) =< [[ eu: xa Åj) 


ier i,j 


is an equalizer diagram.) We say F is a T-sheaf if it satisfies the sheaf axiom 
for every T-covering family; and we write Sh(C,T) (or simply Sh(C), if T is 
obvious from the context) for the full subcategory of [C°P, Set] whose objects 
are T-sheaves. 


Lemma 2.1.10 If (C,T) is a site, then Sh(C,T) is a topos. 


Proof Again, we shall only sketch the details here; for a more detailed proof, 
we refer to 4.4.5 below, or to C2.2.6 (where we shall prove a more general result, 
under a weaker hypothesis on C than smallness). The fact that Sh(C, T) is closed 
under finite limits (indeed, under arbitrary limits) in [C°?,Set] is immediate 
from the form of the definition. It is also an exponential ideal, and hence carte- 
sian closed. We recall from 1.6.6 that the subobject classifier of [C°P, Set] was 
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defined by 
Q(A) = {sieves on A in C} . 


We define a sieve R on A to be T-closed if, given f: B — A in C, the existence 
of a T-covering family (g;: B; > B | j € J) such that each fg; belongs to R 
forces f € R; and we write Q7(A) G 2(A) for the set of T-closed sieves. It then 
follows easily from condition (C) that Qr is a subfunctor of Q; moreover, every 
sieve has a T-closure (the intersection of the T-closed sieves which contain it), 
and using this we may define a notion of patching for closed sieves and hence 
show that Qr is a sheaf. Finally, if F is a sheaf, it follows easily from the proof 
of 1.6.6 that the classifying map F — Q of a subfunctor F’ >= F factors through 
Nr — Q iff F’ is a sheaf; hence Qr is a subobject classifier for Sh(C, T). O 


It is clear that if C = O(X) for a space X and we take T(U) to consist of all 
jointly-surjective families of open inclusions (U; C U Į i € F), then the notion of 
T-sheaf coincides with that of a sheaf on X; so 2.1.10 includes 2.1.8 as a special 
case. But there are many other examples of toposes which can be constructed as 
Sh(C,T) for suitable sites (C, T), and which will play important rôles in various 
parts of this book. We list some of them here: however, we should remark that 
for a given topos € there may be many different sites of definition for E (that is, 
sites (C,T) such that € ~ Sh(C,T)), and the sites which we describe below are 
not necessarily the most convenient ones for defining these particular toposes. 
In C2.2.3 we shall prove a ‘comparison Jemma’ which enables us to determine 
when two sites give rise to equivalent toposes of sheaves. 


Examples 2.1.11 (a) Let C be a small regular category. The regular coverage 
on C has, as covering families on A, all singletons {f} where f is a cover (in 
the sense of Section A1.3) with codomain A. Condition (C”) is satisfied by the 
very definition of a regular category. We note that Proposition 1.3.4 has an 
interpretation in terms of this coverage: it says that every representable functor 
C(—, A): C°P — Set is a sheaf for it (since such functors take coequalizers in C 
to equalizers in Set). (A site is called standard if its underlying category C is 
cartesian and all the representable functors C°? — Set are sheaves; we shall see 
in C2.2.8 that every topos generated by a site can be generated by a standard 
one, and in fact most of the sites one encounters in practice are standard.) 
Thus the Yoneda embedding C — [C°P, Set] factors through Sh(C); the resulting 
functor Y: C — Sh(C) is still full, faithful and cartesian, but it also preserves 
covers (in general, if f: A — B is a morphism of C, the image in [C°P, Set] of 
Y(f): C(-, A) — C (-—, B) is (the subfunctor of C (—, B) corresponding to) the 
sieve on B of all morphisms factoring through f, and if f is a cover it is easy 
to see that this cannot be contained in any proper subsheaf of C (—, B)). Thus 
we conclude that any small regular category admits a full and faithful regular 
functor into a topos. It is worth noting, too, that if C happens to be cartesian 
closed as well as regular, then this structure is also preserved by the embedding 


76 TOPOSES-BASIC THEORY A2.1 


C — Sh(C,T), by 1.5.6(i) and the fact that Sh(C,T) is an exponential ideal in 
[C°P, Set]. 

(b) Let C be a small coherent category. The coherent coverage on C simi- 
larly has as covering families all finite families (fi: A; > A | 1 < ¿i < n) which 
are covering in the sense defined in Section A1.3 — equivalently, such that the 
union of the images of the f; is the whole of A. (In particular, the initial object 
0 of C is covered by the empty family.) Once again, this site is standard; the 
extra information which we need (in addition to 1.3.4) to show that the repre- 
sentable functors are sheaves is contained in 1.4.1 and 1.4.3. (The toposes which 
arise from coherent sites are called coherent toposes (unsurprisingly, but perhaps 
unfortunately — as we shall see, every topos is a coherent category); they will 
be investigated in Section D3.3.) And once again, it may be verified that the 
Yoneda embedding, as a functor C — Sh(C), preserves covers and finite unions, 
and hence any small coherent category admits a full and faithful coherent functor 
into a topos. 

(c) The next example is a synthesis of Examples 2.1.4 and 2.1.8. Let G be 
a (discrete) group, and X a space on which G acts by homeomorphisms. The 
topos Shg(X) of G-equivariant sheaves on X is defined to be Sh(Oc(X),T), 
where the category Og(X) may conveniently be defined by a protocategory, 
as in Section Al.1: its objects are the open subsets of X, its protomorphisms 
are the elements of G, and its source-target predicate says that g: V — U iff the 
homeomorphism g: X — X induced by g maps V into U. Composition is induced 
by composition in G (that is, O¢(X) is a category structured over G, as defined 
in 1.1.5). The coverage T is defined by saying that a family (g;: V; => U |i €T) 
is covering iff the union of the images of the gi: V; — U is the whole of U. It is 
not hard to see that Og(X) has pullbacks, and that T satisfies condition (C’). If 
Y is another space on which G acts, and p: Y — X is a G-equivariant continuous 
map, then the assignment U ++ I'(p)(U), defined in 2.1.8, extends to a functor 
Og(X)°? — Set: if s: U — Y is a section of p over U, and g: V — U in Og(X), 
then the composite 


7 =1 
oS ey E 

is a section of p over V, which we define to be I'(p)(g)(s). It is also easy to verify 
that T (p) is a T-sheaf, and that I defines a functor [G, Sp]/X — Sh¢(X), where 
[G, Sp] is the category of spaces equipped with a G-action and G-equivariant 
continuous maps. Once again, this functor becomes part of an equivalence if we 
restrict it to the full subcategory [G,LH]/X C [G,Sp]/X whose objects are 
G-spaces equipped with a G-equivariant local homeomorphism into X. 

(d) Let C be a small full subcategory of Sp which is closed under passage to 
open subspaces (at least in the sense that, if X is in C and U is an open subspace 
of X, then some space homeomorphic to U is in C). For example, we might take 
(a suitable skeleton of) the category of all topological spaces of cardinality less 
than x, for a suitable cardinal «x. On C we have a coverage defined by saying that 
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the covering families are open coverings in the usual sense, i.e. jointly-surjective 
families of open inclusions. (Condition (C’) is satisfied because the pullback of 
an open inclusion along an arbitrary continuous map is an open inclusion.) Once 
again, this site is standard (at least if C is closed under finite limits in Sp): the 
assertion that the representable functors are sheaves is essentially the statement 
that, if X and Y are spaces and f: X — Y is a function which is continuous 
when restricted to each set in some open covering of X, then f is continuous. 
This example can also be given in a G-equivariant version (where G is a fixed 
group), similar to (c) above. 

(e) Similar examples to (d) may be constructed from appropriate full sub- 
categories of the category Mf of (paracompact) smooth manifolds and smooth 
maps, again taking open coverings to define the coverage. A similar assertion to 
that at the end of the last paragraph holds with ‘smooth’ replacing ‘continuous’; 
so once again the representable functors on such a site are sheaves. However, 
the category Mf does not have all pullbacks (for example, if f: R—R is a 
smooth function such that f(x) 4 0 iff x > 0, then the functions «+ (z,0) and 
zr (zx, f(z)) from R to R? have no pullback); so this is one example where we 
obtain the benefit of not insisting, in our definition of a site, that the under- 
lying category should have pullbacks. (On the other hand, the pullback of an 
open inclusion along an arbitrary morphism does exist in Mf; and the strong 
condition (C”) is still satisfied in this case.) 

(f} There are also ‘algebraic’ versions of the last two examples, in which the 
category of spaces or manifolds is replaced by the category of affine schemes 
(the opposite of the category of commutative rings). For definiteness, let k be 
a field, and let C°P be the category of finitely-presented k-algebras (= quotients 
of polynomial rings in finitely many variables over k). The Zariski coverage Z 
on C consists of jointly-surjective families of Zariski-open inclusions; in algebraic 
terms, these are families of homomorphisms of the form 


(A —> Als] ] s € S) 


where A € ob C and S is a set of elements of A which is not contained in any 
proper ideal of A. For any homomorphism f: A — B of k-algebras, we have a 
pushout square 

A ————> A[s"] 


f 


B ——> B{(f(s))~"] 
in C°P, from which it is easy to verify that the Zariski coverage satisfies condition 
(C’). The topos Sh(C, Z) is called the Zariski topos over k. 
(g) Of course, the open-set lattice O(X) of a space X, with the usual notion 
of open covering, provides an example of a site whose underlying category is 
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a poset. But there are other such examples: we describe a particular one here, 
which we shall meet again in C3.3.11(c) and D4.1.9. Let A and B be two fixed 
infinite sets, and let P be the set of partial functions A — B with finite domain 
(that is, functions from finite subsets of A to B), ordered by the relation ‘is an 
extension of’ (i.e., f < g if dom g C dom f and g(a) = f(a) whenever g(a) is 
defined). We define a coverage T by saying that T(f) consists of all sets of the 
form 


{géP\g< f and be im g} 


where b ranges over the elements of B. (Note that here, as we usually do when 
dealing with coverages on posets, we are thinking of covers of f as sets of elements 
of P lying below f, rather than as families of morphisms with codomain f.) It is 
again an easy exercise to verify that the representable functors P°P — Set are 
all sheaves; but (P, T) is not a standard site, because P does not have pullbacks 
(and here the use of the weak form (C) of the stability condition is essential: 
it is also easy to verify, since if (g; | ¿ € T) is the cover of f associated with a 
particular element b of B, and h < f, then the cover of h associated with the 
same element b satisfies the condition). 

An important property of this particular site is that the disjoint union (i.e. 
the coproduct in |P°P, Set]} of any family of sheaves is again a sheaf; in general 
the category Sh(C,T) is not closed under coproducts (or any other colimits) in 
[C°P Set]. In particular, since the constant functor with value 1 is a sheaf, we 
see that all constant functors P°P — Set are sheaves. To verify this property, 
suppose we have a family of sheaves (F; | i € I), a particular cover (g; | j € J) € 
T(f), and a compatible family (z; | 7 € J) of elements of the disjoint unions 
Lic: Fi(93), 7 € J. We claim that the z; must in fact all belong to F;(g;) for 
the same index i, so that they can be uniquely ‘patched together’ to an element 
ze Fi(f) C Lie, Fil f). For if g; and g; satisfy 9; < gp < f, then it is clear 
that z; and z; must belong to the same summand of the coproduct; and for 
any two elements g; and go we can find a zigzag 


gı S93 >95 S94 = G2 


of elements of the covering family linking them. Specifically, suppose the cover is 
that associated with an element b of B; we assume b is not in the image of f (if 
it is, then f itself belongs to the cover and we have nothing to prove). Then we 
take g3 to be an extension of f defined at just one more element a, of A than f, 
chosen so that gi(a@1) = 6; and gq is similarly defined from go. If g3 and g4 are not 
identical (because the new elements a@,, a2 of their domains are not equal) then 
we take gs to be their unique common extension with domain dom f U {a;, a2}. 
(We shall return to this ‘zigzag condition’ on covers in C3.3.10.) 

(h) Given a category C, it is easy to see that the collection of all singleton 
families (f), f a morphism of C, satisfies the coverage axiom (C) iff, given any 
two morphisms f: A > C, g: B — C of C with common codomain, we can find 


A2.1 DEFINITION AND EXAMPLES 79 
morphisms h, k forming a commutative square: 


E dF 


D 
i 
B 9 


SAEN g 


ty 


For monoids, this condition is commonly known as the right Ore condition, and it 
seems sensible to use this terminology for categories as well. It is clearly implied 
by the condition that C has pullbacks (in which case the coverage satisfies the 
strong condition (C’)), but is considerably weaker. There are several interesting 
examples of sites where all singletons cover; one which will reappear at several 
points in this book is that obtained by taking C to be the dual of the category 
Set ;,, of finite sets and monomorphisms. The corresponding topos Sh(Set*, , T) 
is commonly known as the Schanuel topos, in honour of S. H. Schanuel who 
was the first to investigate its properties; it was also extensively exploited by 
A. Joyal [552]. 

For the present, we note an interesting characterization of the objects of 
the Schanuel topos: they are exactly the functors Set s,, — Set which preserve 
pullbacks. (Note that Set.,, has pullbacks — indeed, the inclusion Set s,, — Sets 
creates them — though it does not have a terminal object.) To see this, note first 
that a functor F: Set sm — Set satisfies the sheaf axiom for the singleton family 
(f: [m] — [n]) (where we use the notation [n] for an n-element finite set) iff 
F(f) is injective, and its image consists precisely of those x € F({n]) satisfying 
F(g)(x) = F(h)(x) whenever gf = hf. But the first condition is implied by the 
assertion that F preserves the pullback of f against itself; for the second, we can 
always find a pair (g, h) such that 


[m] >———> [n] 
Í g 


[n] >"> [p] 


is a pullback, so the condition that F preserves pullbacks implies that if 
F(g)(xz) = F(h)(x) then there exists y € F([m]) with F(f)(y) = x. Conversely, 
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if F is assumed to be a sheaf, and we are given an arbitrary pullback square 


ee a 
g h 


Boetes i 


then given elements z € F([n}),y € F([p]) with F(h)(£) = F(k)(y), we claim 
that for any pair of morphisms (a,b) with af = bf we can extend the above 
diagram to one of the form 


[m] cau [n] = fr] 
g h e 


C 
p) >——> [d Sis 


with ea = ch, eb = dh and ck = dk (just take s = r + q — n), and so we 
necessarily have F(e)F(a)(x) = F(c)F(k)(y) = F(d)F(k)(y) = Fle) F()(z), 
whence F'(a)(z) = F(b)(z) since F(e) is injective. Thus there exists a unique 
z € F({m]) with F(f)(z) = z; and we also have F(g)(z) = y since F(k) is 
injective. So F preserves the above pullback. 

(i) It is easily verified that if a covering family contains a split epimorphism, 
then every functor C°P — Set satisfies the sheaf axiom for it. Hence a group does 
not admit any nontrivial coverage: if T is a coverage on a group G, then Sh(G,T) 
is either the whole of [G°?, Set] (if every T-covering family is nonempty) or 
equivalent to 1 (if the empty family covers the unique object of G). However, 
monoids can admit nontrivial coverages. For example, let M be the free monoid 
on two generators z and y, and let T be the coverage whose only covering family 
(on the unique object x of M) is (x: * > *,y: x > *). The fact that T satisfies 
condition (C) is immediate from the fact that every morphism of M, other than 
the identity, factors through either x or y (that is, every nontrivial word in z and 
y must begin with either z or y). It is easy to see that z and y are monic in M, and 
that there are no pairs (z,t) with zz = yt; hence a T-sheaf is just a right M-set A 
(that is, a functor M°P — Set) for which the map a > (a: x,a: y) is a bijection 
A — Ax A. The category Sh(M,T) is sometimes called the Jénsson-Tarski 
topos, in honour of B. Jónsson and A. Tarski who were the first to investigate it 
(as a variety of universal algebras, not as a topos) in [551]; see also [525]. 
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(j) Another monoid-based example: let N* be the one-point compactification 
of the discrete space N of natural numbers, and let M be the monoid of contin- 
uous endomorphisms of N* (i.e. the full subcategory of Sp whose only object is 
N+). We say that a continuous map f: N+ — Nt is non-degenerate if its image 
is infinite (which implies that f maps the non-isolated point oo of N* to itself). 
We now define a family of morphisms (f;: N* — Nt | 2 € I) to be covering if 


(i) it is jointly surjective, and 


(ii) for any infinite subset S of N, there exists ¿ € I such that f; is non- 
degenerate and has image contained in § U {oo}. 


The verification that this defines a coverage on M is straightforward. Now, for 
any space X, the set Sp(Nt+, X) of continuous maps Nt — X (that is, con- 
vergent sequences in X, together with their limits) has an obvious right M-set 
structure by composition of continuous maps; and it is a sheaf for the coverage 
just described. (The key point in verifying this is the well-known fact that if 
(£n) is a sequence of points of X and Zæ a point such that every subsequence 
of (£n) contains a subsequence converging to Zæ, then (z,) converges to £o.) 
In fact the assignment X ++ Sp(N*,X) defines a functor Sp — Sh(M); this 
functor is faithful, and it is full when restricted to the subcategory of sequential 
spaces (i.e. spaces where convergent sequences determine the topology, in the 
sense that any subset closed under limits of sequences is actually closed; note 
that this subcategory contains all metrizable spaces). More detailed information 
about this example will be found in [507]. 

(k) Our final example, also monoid-based, is a ‘recursive’ as opposed to 
‘continuous’ version of (j), due to P. S. Mulry [864]. Let M be the monoid of all 
(total) recursive functions N > N, and define T to consist of all finite families 
(f;: N-+N{|1 <i <n) such that the union of the images of the f; is the whole 
of N. To verify condition (C) in this case, let (f; | 1 < i < n) be such a family, 
and g: N + N an arbitrary recursive function. For each 7, the function 


n= pm. fi(m) = g(n) 


(where um . P, as usual, denotes ‘the least m such that P’) is partial recursive; 
and so if its domain g~1(im f;) is nonempty we can find a total recursive function 
hi which enumerates it. Now the family (h; | g~'(im fi) 4 8) is T-covering, and 
for each such 7 the function 


ki(n) = um . film) = ghi(n) 


is total recursive and satisfies f,k; = gh;. It is easy to verify that the unique 
representable functor M°P — Set is a T-sheaf (though it would not be so if we 
removed the requirement that members of T be finite); more generally, the cat- 
egory En of enumerated sets in the sense of Ershov [336] may be identified with 
a full subcategory of Sh(M,T). An enumerated set is a pair (A,e), where A is 
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a set and e: N — A is a surjection; a morphism f: (A,e) — (B, d) of enumerated 
sets is a function f: A — B which is ‘tracked’ by some recursive function, i.e. 
for which there exists a recursive g: N — N such that 


N— +N 
f 

A-————> B 
commutes. We define a functor F: En — Sh(M,T) as follows: F(A,e) is the 
set of all functions N — A which can be written in the form eh for some total 
recursive h, with M acting by composition, and if f: (A,e) — (B, d) in En, then 
F(f)(eh) = feh (= dgh, where g is some recursive function tracking f). To show 
that every M-equivariant map F(A,e) — F (B, d) is induced by composition with 
a (unique) function f: A — B, we use the fact that the constant maps N —> A 
may be identified as the M-invariant elements of F(A,e); and this f must be 


a morphism of En, because the composite fe belongs to F(B,d). For more 
information about the properties of this topos, see [864]. 


We conclude this section with a couple of methods of constructing new 
toposes from old, which are often useful. 


Example 2.1.12 [the glueing construction] Let F: € — F be a cartesian 
functor between toposes. We define the glued topos of F, GI(F), by taking its 
objects to be triples (A, B,a) where A and B are objects of £ and F respec- 
tively and a: B — F(A) in F, and morphisms (A, B,a) — (C, D, y) to be pairs 
(f: A— C,g: B — D) such that 


commutes. (In other words, G1(F) is just the comma category (F | F), as usually 
defined; cf. B1.1.4(c).) It is immediate that GI(F) is a category; also that it is 
cartesian, with finite limits constructed pointwise. We claim that for any carte- 
sian functor F between toposes, GI(F) is a topos. To construct the exponential 
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(C, D,y)'4:7.™), we form the pullback 


ornes a 


h y? 


F(C4) ae Fic)?“ pase F(C)? 


(where 6 is the exponential comparison morphism defined in Section 
A1.5), and we take (C,D,y)'45 = (C4,E,h); then the evaluation 
morphism (C4 x A, E x B,h x a) > (C,D,y) is (ev,ev(k x 1g)). Given a 
morphism (f,g): (X,Y,&) x (A,B,a) — (C,D,y), its exponential transpose 
(X,Y,£) — (C4, E, h) has f as its first component and the morphism induced by 
g: Y — DP and F(f) o£: Y — F(C4) as its second. For the subobject classifier 
in GIF), we take (Q2, Q, q), where Q is defined by the pullback square 


Q ————> M) 


(p,a) 


x 
Nr x F(Qe) ——> Ar x Nr 


Given a subobject (m,n): (A’, B’,a’) — (A,B,a), the classifying map 
(f.g): (A, B,a) > (Qe, Q,q) of (m,n) is obtained by taking f to be the clas- 
sifying map of m, and g to be the factorization through Q — Nr x F (Qe) of the 
composite 


(1, a) hx F(f) 
B ———> Bx F(A) ———3 e) 


where h is the classifying map of n. The rest is straightforward verification. 


We note that there are projection functors P,:GI(F)—€ and 
P2: GF) — F; the former is logical, but the latter is merely cartesian (unless 
F is (isomorphic to) the constant functor with value 1, in which case Gl(F) 
reduces to £ x F). Both these functors have full and faithful right adjoints: the 
right adjoint of P, sends A to (A, F(A), 1r;a)), and the right adjoint of Pz sends 
B to (1,B,B — 1). 

Many instances of the glueing construction will be of use to us later on; 
one in particular is worth mentioning now. If £ is a locally small topos, then 
the representable functor £ (1, —): E — Set is cartesian: the topos obtained by 
glueing along it is called the scone (or Sierpiński cone) on £, and denoted sen(£). 
In particular if £ itself is Set, then £ (1, —) is (isomorphic to) the identity functor, 
and scn(Set) is just another name for the functor category [2, Set] (equivalently, 
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for the topos of sheaves on the Sierpiński space, i.e. the two-point space with just 
one closed point). But other simple examples of toposes have more interesting 
scones: for example, scn(Set;) will be a fruitful source of counterexamples later 
on (see, for instance, 4.2.4{d) and 4.5.24 below). The scone construction will also 
be of pivotal importance in Section F3.2. 


Example 2.1.13 (the filterquotient construction] Let £ be a topos, and 
let ® be a filter of subobjects of 1 in £, that is, a collection of subterminal 
objects with the property that U € ©® iff U contains the intersection of some 
finite set of members of ®. (Incidentally, we shall often use letters such as U and 
V to denote subterminal objects in a topos; this is motivated by the example of 
Sh(X), where the subterminal objects correspond to open subsets of X.) By a 
-map A — B in E, we shall mean a morphism A x U — B for some U € ®; we 
say that two -maps f: Ax U — B and g: Ax V — B are equivalent if there 
exists W < UNV in ® such that 


AxW >—> AxU 


f 


A Vee 


commutes. Since ® is closed under finite intersections, it is easy to see that this is 
indeed an equivalence relation. -maps can be composed: given f: Ax U — B 
and g: B x V — C, we have a pullback square 


fxi 
AxUxV—>BxV 


Ax U —— B 


where U xV = UNV € &, and so we may define the composite of g and f 
to be g(f x iy). This composition respects equivalence and is associative up to 
equivalence, so we obtain a category Ea, the filterquotient of € modulo ©, whose 
objects are those of € but whose morphisms are equivalence classes of ®-maps. 
And we have a functor Pe: E —> E, which is the identity on objects and sends 
f: A— B to the equivalence class of f regarded as a morphism A x 1 — B. 
We claim that £p is a topos, and the functor Ps is logical. In proving this, the 
key observation is that, since ® is closed under finite intersections, any finite 
collection of morphisms A; — B; in £ may be simultaneously represented by 
morphisms A; x U — Bi in £, for a single U € ©. Using this, it is easy to see that 
A x B is still the product of A and B in &4; and the equalizer of a parallel pair 
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of morphisms A 3 B in £ may be taken to be the equalizer in £ of any pair of 
representing -maps with the same domain A x U. Similarly, the correspondence 
between morphisms C x A x U — B and morphisms C x U — B^ in £ (where 
U is any member of ) respects the equivalence relation, and so B4 remains an 
exponential in E. Monomorphisms in £ turn out to be equivalence classes of 
®-maps which include at least one monomorphism in £, from which it is again 
easy to verify that Q remains a subobject classifier in £p. 


For example, if € = Sh(X), we may take © to be the filter of open neigh- 
bourhoods of some point x € X, to obtain a topos Sh(X)® which we may think 
of as ‘the germ of the topos of sheaves on X at the point x’, in the sense that 
its objects are sheaves on X but its morphisms are what are generally known as 
germs at x of sheaf morphisms. 

Again, if ® is a filter (in the classical sense) on a set J, we may regard it as 
a filter of subobjects of 1 in Set/J, and so form the quotient (Set /J)s, which is 
commonly called a filterpower (or, if ® is an ultrafilter, an ultrapower) of Set. 
For an ultrafilter @, the topos (Set/J)s shares many of the properties of Set; 
for example, it is Boolean (as are all filterquotients of Boolean toposes), and its 
terminal object Ps(1) is a generator and has just two isomorphism classes of 
subobjects. (We sometimes express the latter condition by saying that a topos 
is 2-valued.) As we shall see later (in C1.4.10), these properties suffice to charac- 
terize Set (up to equivalence) amongst toposes which enjoy the non-elementary 
properties of cocompleteness and local smallness; however, if © is a non-principal 
ultrafilter then (Set/Z) is not cocomplete (see [32]), which demonstrates that 
some such non-elementary condition is essential for any characterization of Set 
amongst toposes. 


Suggestions for further reading: Adelman & Johnstone [32], Dubuc & 
Kelly [306], Johnstone [507], Joyal [552], Lawvere [700-704, 712,716], Mac Lane 
& Moerdijk [751,752], Mulry [864]. 


A2.2 The Monadicity Theorem 


The original definition of an elementary topos, as given by Lawvere and Tierney, 
included the requirement that £ be cocartesian as well as cartesian. It was first 
shown by C. J. Mikkelsen [811] that this condition is redundant; subsequently 
R. Paré [927] gave a different proof of the same result, which established on the 
way the remarkable fact that if £ is a topos then £°P is monadic over £. It is 
this proof which we describe in the present section. 

Let £ be a (weak) topos. We begin by noting that the assignment A++ PA can 
be made into the object-map of a functor €°P — £: if f: A — B is a morphism 
of £, we define Pf: PB PA to be the name of the relation Ej — PB x A 
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defined by the pullback square 


E; ————> EB 


1x f 
PB x A —> PB xB 


(It is straightforward to verify that this is functorial.) P may also be made into a 
covariant functor on the subcategory Em of monomorphisms in £: ifm: A = B 
is monic, we define 3m: PA — PB to be the name of the composite 


1xm 
E4 >——> PA x A> PAxB. 


(This notation is deliberately reminiscent of the functors 4; defined for a regular 
category in Section A1.3; once we have seen that a topos is regular, we shall be 
able to extend the definition of 3f to an arbitrary morphism f of £, and we shall 
see that 3f is simply the ‘internal’ version of 3p.) 


Lemma 2.2.1 The functor P: EP — E has a left adjoint, namely P: E€ — £P. 


Proof  Unravelling the definitions, we have to set up a natural bijection 
between morphisms A — PB and morphisms B — PA in £, or equivalently 
between (isomorphism classes of) subobjects of A x B and of B x A. But the 
twist isomorphism A x B — B x A obviously induces such a bijection, which is 
natural in A and B. o 


We next introduce an important morphism from an object to its power object: 
for any object A of £, let {}: A — PA be the name of the diagonal relation 
(14,14): A =— Ax A. ({} is pronounced ‘singleton’; in the topos Set it is the 
mapping from a set to its power-set which sends an element a to the subset {a}.) 


Lemma 2.2.2 Let f: A— B be a morphism in a weak topos. Then 


(i) The composite {} f: A — B — PB names the relation (14, f): A >> A x B. 
(ii) The composite Pf{}: B —> PB — PA names the relation (f,14): A> BXA. 


Proof (i) Consider the diagram 
f 


A ———_> B ———>> EB 
(1, f) (1, 1) 


fxl Q xı 
A x B ——> B x B ———> PB xB 


in which both squares are pullbacks. 
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(ii) follows from (i) and the naturality of the bijection (94,8, say) between 
morphisms A — PB and morphisms B — PA, since Og g clearly maps {} to 
itself. 


Corollary 2.2.3 
(i) {}: B — PB is monic for any B. 
(ii) The functor P: E°P — £ is conservative. 


Proof (i) Suppose f,g: A 3 B are such that {}f = {}g. Then (la, f) 
and (14,g) are isomorphic in Sub(A x B); but this means that we have an 
isomorphism 7: A — A such that gi = f and 141 = 14, whence į = 14 and f =g. 

(ii) Since € (and hence £°P) is a balanced category by 1.6.2, it suffices to prove 
that P is faithful. Let f,g: A 3 B be such that Pf = Pg; then Pf{} = Pg{}, 
so we have (f,14) = (g, 14) in Sub(B x A), and as before this forces f =g. O 


The next result we require is a relation between the morphisms Pf and 3f, 
often called the Beck-Chevalley condition. 


Lemma 2.2.4 Let 


ae 
g h 
k 


C ———> D 
be a pullback square in a weak topos with g and h monic. Then the square 


Pf 


PB ———> PA 


commutes. 
Proof Consider the diagram 
E; —————> EB 


1x f 
PB x A ——> PB xB 
1xg 1xh 


Ixk 
PB x C ——> PBxD 
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in which both squares are pullbacks: the upper one by definition, and the lower 
one because PB x (—) preserves pullbacks. It is easy to see that the left ver- 
tical composite, as a relation from PB to C, is named by both 3g o Pf and 
Pkoðhk: PB > PC. 0 


Corollary 2.2.5 Ifm: A» B is monic, then Pm o Jm = ipa. 


Proof Apply 2.2.4 to the pullback square 


m) 


We digress briefly to note, for future reference, the ‘external’ version of 2.2.5: 


Lemma 2.2.6 In a (weak) topos, any object of the form PA is injective. In 
particular, a topos has ‘enough injectives’; i.e. for any object A there exists a 
monomorphism from A to an injective object. 


Proof Suppose given a pair of morphisms m: B’ — B, f': B’ — PA where 
m is monic. Let R — B’ x A be the relation named by f’, and let f: B — PA 
name the composite 


; mxi 
R>—> B x A > BxA. 


Then fm names the pullback of the latter subobject along m x 1, which is 
isomorphic to R >> B’ x A because m is monic; so fm = f’. The second 
assertion follows from the first and 2.2.3(i). o 


Theorem 2.2.7 For a weak topos €E, the functor P: E°P — E is monadic, i.e. 
it induces an equivalence between E°P and the category of algebras for the monad 
on E induced by the adjunction of Lemma 2.2.1. 


Proof We shall show that P: €°P — £ satisfies the three conditions of 1.1.2. 
The first two were verified in 2.2.1 and 2.2.3(ii) respectively, and €°P certainly 
has coequalizers because € has equalizers. So it remains to prove that P sends 
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equalizers of coreflexive pairs in € to coequalizers. Let 
nm f 
Eo A ee B 
g 


be an equalizer diagram in £ such that the pair (f,g) is coreflexive. Then by 
1.2.10 the square 


feral Lea 


E 
g 
A ———— B 
is a pullback. Applying 2.2.4 to it, we have Pf o 3g = Imo Pm; but we also 


have PmoJm = lpg and Pgo3g = 1p4 by 2.2.5, since m and g are monic, and 
Pm o Pf = Pmo Pg since P is a functor. So the diagram 


Pf Pm 
PB ZZ PA ——> PE, 
Pg 


together with the morphisms 3g and dm, forms a split coequalizer system in £; 
in particular it is a coequalizer diagram, as required. O 


Remark 2.2.8 For the case € = Set, the monadicity result of 2.2.7 is well 
known. It follows from the Lindenbaum—Tarski representation theorem for com- 
plete atomic Boolean algebras that Set°? is equivalent to the category of such 
algebras (equivalently, of completely distributive complete Boolean algebras, or 
of compact Hausdorff topological Boolean algebras; cf. [520, VI 1.16]), and it 
follows from the general results of [726] that such a category is monadic over 
Set. In a general topos €, a power object PA is of course not a Boolean algebra 
but a Heyting algebra; but we shall see in Section C3.5 that there is a sense in 
which power objects may be regarded as ‘complete atomic Heyting algebras’ in 
€, leading to a similar ‘algebraic’ description of £°P. 


Corollary 2.2.9 A (weak) topos is cocartesian. 


Proof Any category monadic over a cartesian category is cartesian, since the 
forgetful functor from a category of algebras creates limits. o 


Corollary 2.2.10 Let F:E— F be a logical functor between weak toposes. 
Then 


(i) F is cocartesian. 
(ii) F has a left adjoint iff it has a right adjoint. 
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Proof Since F is logical, the diagram 


E£0P ae FOP 
P P 
F 


E — >F 


commutes up to natural isomorphism (it is easily seen that, for a general 
cartesian functor F, the canonical morphisms ġ4: F(PA) — P(FA) defined in 
Section A2.1 form a natural transformation between the two composites). The 
first assertion is now immediate: F: €°? — F°P is cartesian because F: € — F 
is and P: FP — F creates limits. For the second, we use 1.1.3: the hypotheses 
of both parts of 1.1.3 are easily seen to be satisfied in the present case, but a left 
(resp. right) adjoint for F: €°? + F°P is the same thing as a right (resp. left) 
adjoint for F: E —+ F. o 


Remark 2.2.11 For future reference, it will be useful to write down the 
explicit construction of a right adjoint R for a logical functor F, given a left 
adjoint L for it: following the construction in [503], we find that for any object 
A of F we have an equalizer diagram 


f 
RA >—> PLPA —S PLPPPA, 
g 


where the morphisms f and g are obtained as follows: f is PLPn,4, where 7 
is the unit of the adjunction of 2.2.1 (i.e. 74: A — PPA names the relation 
€4~> PA x A = Ax PA), and g corresponds under the adjunction of 2.2.1 
to ypa: LPPPA— PPLPA, where y: LPP — PPL is the ‘mate’ under the 
adjunction (L 4 F`) of the natural isomorphism PPF = FPP. 


Suggestions for further reading: Mikkelsen [811], Paré [927]. 


A2.3 The Fundamental Theorem 


The ‘fundamental theorem of topos theory’, so named by P. Freyd [371], is the 
assertion that the property of being a topos is stable under slicing. Whether 
it really deserves this name is perhaps open to debate (we shall see that it is 
not at all hard to prove), but there is no doubt that a great many important 
consequences flow from it, as we shall see in this section. 

We begin with a lemma which is reminiscent of 1.5.2(ii) and 1.6.7 (indeed, 
1.6.7 is a special case of it). 
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Lemma 2.3.1 Let A be an object of a cartesian category E having a power 
object PA. Then, for any object B of €, the object B*(PA), equipped with the 
relation B*(€ 4), is a power object for B*(A) in €/B. 


Proof Let f:C — B be an object of €/B. The square 


fxi 
C x A ———> BxA 


C ———— B 


is a pullback, so Ug(f x B*(A)) is isomorphic to C x A, and hence 
Sube;a(f x B*(A)) = Sube(C x A). 


Thus, given a relation from f to B*(A) in €/B, we may regard it as a relation 
from C to A in £; if g: C > PA is its name in £, then (g, f): f > B*(PA) is 
the unique morphism naming it in €/B. a 


Note that, given the result which follows (and 2.3.7(iii) below), the isomor- 
phism Xg(f x B*(A)) = C x A, which we used in the above proof, is actually a 
special case of 1.5.8. 


Theorem 2.3.2 Let € be a weak topos, B an object of €. Then €/B is a weak 
topos, and the pullback functor B*: E — E/B is logical. 


Proof Given the first assertion, the second follows immediately from 
Lemma 2.3.1. To prove the first, we have to construct a power object for an 
arbitrary object f: A — B of E/B, not just for one which happens to lie in the 
image of B*. But, given f, the morphism (14, f): A — A x B is a monomor- 
phism f — B*(A) in €/B (in fact the unit of the adjunction (Xz 4 B*) at the 
object f), and so we should expect the power object of f to be a subobject (in 
fact a retract, by 2.2.5) of the power object B*(PA) of B*(A). 
Form the pullback 


Oa ee 


(h, k) 


1 x {} 1x Pf 
PAx B ——> PA x PB ——~> PAxPA 


where P, A, the order-relation on PA, is defined as the equalizer of the first pro- 
jection mı and the morphism A4: PA x PA — PA which names the subobject 
™3(E€4) N 134(Ea) Of PA x PA x A (cf. the definition of the order-relation on 
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Q in the proof of 1.6.3). Now, given an object g: C — B of €/B, morphisms 
g — k in €/B correspond to morphisms |: C — Q such that kl = g, or equiva- 
lently to morphisms hl: C — PA such that the pair (hl, Pf {}g) factors through 
P\A ~~ PA x PA, or equivalently again to subobjects of C x A contained in the 
subobject named by Pf{}g. But, by the definition of {}, the latter composite 
names the pullback of the diagonal B — B x B along gx f:C x A — Bx B, 
i.e. the subobject C xg A — C x A. So we have a bijective correspondence 
between morphisms g — k and subobjects of g x f in €/B, which is clearly 
natural in g (and hence induced by pulling back the subobject €s-— k x f which 
corresponds to the identity morphism k — k); so k is a power object for f. D 


Corollary 2.3.3 For any morphism f: A — B in a weak topos £, the functor 
f*: E/B — E/A is logical, and has both left and right adjoints. 


Proof The first assertion follows from 2.3.2 and the isomorphism €/A = 
(€/B)/f. For the second, f* always has a left adjoint by 1.2.8, so it has a 
right adjoint by 2.2.10/(ii). o 


Corollary 2.3.4 A weak topos is locally cartesian closed. 
Proof This follows at once from 2.3.3 and 1.5.3. 0o 


In particular, it follows from 2.3.4 that a weak topos is a topos, as defined in 
2.1.1, so we can now drop the adjective ‘weak’. 


Corollary 2.3.5 A topos is a Heyting category. 


Proof [It is locally cartesian closed by 2.3.4 and cocartesian by 2.2.9, so this 
follows from 1.5.13(ii). D 


In particular, a topos is a regular category; so we may now fulfil a promise 
made in the last section, and extend the definition of the covariant power-object 
functor to arbitrary morphisms of €, rather than just monomorphisms. Given 
f: A — B, we define 3f: PA — PB to be the name of the relation which is the 
image of the composite 


f 


1x 
E4 >——> PA x A— > PAxB. 
Since images are stable under pullback, it follows easily that if g: C — PA names 
a relation R — C x A, then If og names the relation 31, ;(R) — C x B. From 
this in turn we deduce that the assignment f +> If is functorial on £. We note 
a few further properties of this functor: 


Lemma 2.3.6 
(i) {} ts a natural transformation from the identity to the covariant power- 
object functor. 
(ii) The Beck-Chevalley condition of Lemma 2.2.4 remains valid without the 
assumption that g and h are monic. 
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(iii) Jf f: A — B is epic, then 3f o Pf =1pz. 
Proof (i) Given f: A — B, we have to show that 


f 


A———> B 


{} {} 


J 
PA ——> PB 


commutes. By the remarks above, 3f o {} names the image of the composite 


(1,1) 1x f 
A>—> Ax A——> AxB; 
but this composite is already monic, and by 2.2.2(i) it is also named by {}f. 
(ii) The proof is essentially the same as that of 2.2.4; we have to replace the 
vertical composites in the diagram appearing in that proof by their images, but 
the resulting square is still a pullback by regularity. 
(iii) If f is epic, then the top arrow in the pullback square 


ne 


1x f 
PB x A——~> PBxB 


is also epic by regularity (note that, because a topos is balanced, all its epi- 
morphisms are covers by the argument in the proof of 1.4.9). So the image of 
Eş = PB x A — PB x B (which is the relation named by Jf o Pf) is just €p. O 


We have not yet proved, as promised in Section A2.1, that logical functors 
between toposes preserve the cartesian closed structure; but this can be done 
without much further effort. Let F: E — F be any functor between cartesian cat- 
egories; then, for each object B of £, F induces a functor €/B — F/FB (which 
we shall denote F/B) in an obvious way, and for each morphism f: A — B in € 
we have a diagram 


Dy f 
PASS pps A 
F/A F/B F/A 
UF f (ES): 


F/FA ——> F/FB ——> F/FA 
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of which the left-hand square commutes by definition, and the right-hand square 
commutes provided F' preserves pullbacks. 


Proposition 2.3.7 Let F: € — F be a logical functor between toposes. 


(i) For each object B of E, the functor F/B: E/B — F/FB is logical. 
(ii) For each f: A— B in £, the diagram 


iy 
EJA ———> E/B 


FJA F/B 


pry 
F/FA —> F/FB 


commutes up to isomorphism. 
(iii) F is a cartesian closed functor. 


Proof (i) This follows from the construction of power objects in €/B given in 
the proof of 2.3.2, since F preserves all the structure involved in this construction. 

(ii) This similarly follows from the explicit construction of right adjoints for 
logical functors given at the end of Section A2.2: since the functors F/A and 
F/B are logical (i.e. commute up to isomorphism with power-object functors) 
and commute with the logical functor f* and its left adjoint ©;, they preserve 
(in an appropriate sense) everything involved in the construction of the right 
adjoint Ilp. 

(iii) The functor (—)?: € — € may be factored as IIg o B* (cf. the proof of 
1.5.2(i)), so it follows from (ii) that the square 


Gs 
E ——> E 
F F 
(— FB 
F — F 


commutes up to isomorphism, i.e. F preserves exponentials. O 


We have seen that pullback functors in a topos are logical, and that they 
have left (and right) adjoints. The latter property does not quite suffice to char- 
acterize them amongst logical functors between toposes, but there are several 
alternative hypotheses that we can add to obtain such a characterization. The 
next proposition collects together three of these; a fourth such condition will be 
found in C3.5.4(iii). 
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Proposition 2.3.8 Let F:€ — F be a logical functor between toposes having 
a left adjoint L. The following are equivalent: 


(i) There is an object B of E and an equivalence F ~ €/B identifying F and 
L with B* and Xp respectively. 
(ii) L is faithful. 
(iii) L preserves equalizers. 
(iv) L preserves pullbacks. 


Proof (i) = (ii), (iii) and (iv) because £p is well known to have all the stated 
properties. 

(ii) = (i): Let B = L(1). Since F = F/1, L ‘lifts’ to a functor L’: F  €/B; 
and it is easily verified that Z’ has a right adjoint F’, which sends f: A — B to 


the pullback of 
FA 


Ff 


TR; 


(where 7 is the unit of (L 4 F)). In other words, F’ is the composite 


F/B ni 
€£/B ——> F/FB —>F, 


and so it is logical by 2.3.7(i) and 2.3.3. Now L’ preserves the terminal object 
by construction, and F” is cartesian closed by 2.3.7(iii), so applying 1.5.9(i) we 
deduce that the counit L’F’ — 1¢;g is an isomorphism. It now follows from one 
of the triangular identities that L'y: L! — L’F’L’ is an isomorphism, where n’ 
is the unit of (Z’ 4 F’); but L’ clearly inherits faithfulness from L, and so reflects 
isomorphisms since F is balanced. Thus 7’ is an isomorphism, and LZ’ and F' 
form the required equivalence of categories. 

(iii) => (ii): Since L preserves equalizers, its faithfulness (indeed, conserva- 
tiveness, by 1.2.4) will follow provided we can show it preserves properness of 
subobjects. But the fact that F preserves Q implies that we have a bijection, 
for any object A of F, from Sub (A) to Sube(LA). We do not (yet) know that 
this bijection is simply the functor L applied to subobjects of A (but see 2.4.8 
below); however, from the fact that the diagram 


iy ener heme 
LT T 
Lo EQ 


96 TOPOSES- BASIC THEORY A2.3 


commutes (where ¢ is the classifying map of A’ — A, and € is the counit 
of (L4 F)), we deduce that LA’ is contained in the subobject of LA which 
corresponds to A’, and in particular that it must be proper when A’ is. 

(iv) = (iii) follows from 1.2.9. o 


To show that the hypotheses of 2.3.8 are not vacuous, consider the case when 
E = Set and F = |G, Set] for a nontrivial group G, with F: E — F the functor 
which equips a set with the trivial G-action. Using 2.1.4, it is easy to see that 
F is logical; and it has a left adjoint L, which sends a G-set to the set of its 
G-orbits (and a right adjoint which sends a G-set to the set of its G-fixed points). 
But F is not equivalent to a slice of E. By considering this example, one may 
verify that conditions (iii) and (iv) of 2.3.8 may not be weakened to ‘L preserves 
equalizers of coreflexive pairs’ or ‘L preserves pullbacks of monomorphisms’; in 
fact, as we shall see in the next section, these conditions are satisfied by any 
functor left adjoint to a logical functor. 

Before leaving this topic, we record 


Scholium 2.3.9 Jf a logical functor between toposes has a cartesian left 
adjoint, then it is an equivalence. 


Proof In the notation of 2.3.8, L preserves 1, so we have F~E/ISE. D 


Let f:A— B be a morphism in a topos E€. Just as the left adjoint 
Jj: Sub(A) — Sub(B) of f* is ‘internalized’ by a morphism 3f : PA — PB of E, 
so the right adjoint Y; of f* (which exists by 2.3.5) corresponds to a morphism 
Vf: PA— PB. To define the latter, we simply take the name of the relation 
Vix¢(€a) >> PA B. Since the functors Yg commute with pullback by 1.4.11, it 
follows easily that if r: C — PA names a relation R — C x A, then the compos- 
ite (Vf)r is the name of Vy ¢(R) > C x B. From this, we may easily establish 
the analogue for V of the Beck—Chevalley condition of 2.2.4 and 2.3.6(ii): 


Lemma 2.3.10 Let 


A a ae B 
k 

C —— > D 

be a pullback square in a topos. Then the square 

Pf 

PB ——~+> PA 

Vh Yg 

Pk 


commutes. 
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Proof By 1.4.11, each way round names the same subobject of PB xC. O 


For future reference, we also note 


Lemma 2.3.11 Let F: € — F be a cartesian functor between toposes. Then F 
commutes with (external) universal quantification iff the diagram 


(pa) ZOT, prep) 
PA $B 
V(Ff) 


P(FA) ——-> P(FB) 


commutes for all f: A— B in E. In particular, if F is a coherent functor, then 
it is a Heyting functor iff the above diagram commutes for all f. 


Proof The two ways round the diagram name the subobjects F(V\x 5(€4)) 
and Vixrs(F(€a4)) of F(PA) x FB, so if F commutes with universal quantifi- 
cation then they are equal. The converse follows from composing the two ways 
round the diagram with the images under F of morphisms 1 — PA in £. o 


A2.4 Effectiveness, positivity and partial maps 


We have seen that a topos is a Heyting category, and in particular regular; but 
we do not yet know that it is effective as a regular category. Our first task in 
this section is to prove that fact. 

Let (a,b): R:3 A be an equivalence relation in a topos. The pair (a,b) is 
jointly monic, and so is named by a morphism r: A — PA. In Set, r is the 
function which sends an element a of A to its equivalence class modulo R, so it 
is an obvious candidate for a morphism whose kernel-pair is (a, b). We now prove 
that this is true in general. 


Proposition 2.4.1 With the notation established above, (a,b) is a kernel-pair 
forr. 


Proof First we must show that ra = rb: R — PA, or equivalently that the 
subobjects of R x A obtained by pulling back (a,b): R >—> A x A along a x la 
and 6 x 14 are isomorphic. But if T denotes the ‘object of R-related triples’, i.e. 
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Ji 


R——> A 


the pullback 


P 
—> 


(cf. 1.3.6(d)), then it follows easily from symmetry and transitivity of R that 
there are pullback squares 


re aR age ge a 
(p, bq) (a,b) (p, bq) (a,b) 
bx1 
Redo! SAA Rx A——>AxA 


(where t is the morphism expressing transitivity of (a, 6)). 

Now suppose given (f,g): B 3 A with rf = rg. Then the subobjects (f x 
1,4)*(a,6) and (g x 14)*(a, 6) of B x A are isomorphic; pulling them back along 
(18,9): B — B x A, we deduce that (f,g)*(a,6) = (g, g9)" (a,b) in Sub(B). But 
the latter is the whole of B, since (g, g) factors through the diagonal A> Ax A 
and (a,b) is reflexive; so (f, g)*(a, b) is the whole of B, i.e. (f,g) factors through 
(a,b). And the factorization is unique since (a,b) is jointly monic; so (a,b) has 
the universal property of a kernel-pair. 


For an alternative proof of 2.4.1, in a slightly more general setting, see 3.4.8(ii) 
below. 

Note that we have not (explicitly) constructed a coequalizer for (a, b) in the 
proof of 2.4.1, since the morphism r is not epic (we shall see in D4.1.8(i) that 
there can never be an epimorphism A -» PA in a nondegenerate topos). Of 
course, we may obtain a coequalizer for (a, 6) by forming the image factorization 
of r; cf. 1.3.4 (and 3.4.8(iii) below). 

It also seems worth remarking that 2.4.1 includes as a particular case the 
result, already proved in 2.2.3(i), that {}: A — PA is monic, since its kernel-pair 
is (14, la). 

The fact that a topos is positive as a coherent category can be deduced from 
1.5.14 and 1.6.2. However, we shall give a different proof of it, which establishes 
a result of independent interest along the way. Let f: A B, g: A—C be 
a pair of morphisms with common domain, the first of which is monic. Then 
(f,g): A— B x C is also monic, and so is named by a morphism h: B — PC. 


A2.4 EFFECTIVENESS, POSITIVITY AND PARTIAL MAPS 99 


Lemma 2.4.2 With the above notation, the square 
g 
A —— C 


f 


amine! 
a) 


pats po 


is a pullback. 


Proof The square commutes, because we have a pullback square 


Jenti ; 


(1,9) (f,9) 


fxi 
AxC—>BxC 


and so Af names the subobject (1,g) of A x C, which is also named by {}g 
(2.2.2(i)). Given any pair of morphisms k: D — B, l: D — C with hk = {}1, we 
similarly see that there must be a pullback square 


D ———_> A 


(1,2) (f,9) 


BeOS Bee 


The top edge of this square is the required factorization of (k,!) through (f, g) 
(and it is unique, because (f, g) is monic). o 


Corollary 2.4.3 Given a pushout square 


T eee 


C 
f l 
|; EERE 5 


in a topos with f monic, the morphism d is monic and the square is also a 
pullback. 
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Proof Let h: B — PC be the name of (f,g), as in 2.4.2. By the universal 
property of pushouts, we have a factorization of (h, {}) through (e, d); since {} 
is monic, this forces d to be monic, and the fact that the square of 2.4.2 is a 
pullback forces this one to be a pullback too. o 


Corollary 2.4.4 Coproducts in a topos are disjoint. 
Proof We have a pushout square 


0——>B : 


A ——> A+B 


but the morphisms 0 — A and 0 — B are trivially monic, since 0 is a strict 
initial object by 1.4.1 (or by 1.5.12). So the result follows directly from 2.4.3.0 


Combining 2.4.1 and 2.4.4, we have 
Corollary 2.4.5 A topos is a pretopos. o 


We remark that 2.4.4 applies also to infinite coproducts, to the extent that 
they exist in £; also, a locally small topos is well-powered, since subobjects of A 
correspond to morphisms A — 2. Thus a locally small, cocomplete topos is an 
oo-pretopos. In fact there is a partial converse to this result, as we shall see in 
C2.2.8: an oo-pretopos which has a separating set of objects is a topos. 

We devote the rest of this section to a couple of mild digressions, which are 
included at least partly for historical reasons. The first concerns representability 
of partial maps: this condition originally formed one of the axioms for an elemen- 
tary topos, when Lawvere and Tierney began their investigation of the subject. 
By the time the first published accounts of topos theory appeared, it had been 
realized that the condition could be deduced from a special case of itself (namely, 
the existence of a subobject classifier) in the presence of cartesian closedness, and 
so it nò longer appeared as an axiom; but partial-map representers continued to 
play an important rôle in early accounts of the subject (e.g. [649], [1167], [504]) — 
in particular, in the proof of the Fundamental Theorem, where they were used 
in the construction of exponentials in €/B. As we have seen in the preceding 
section, they are no longer needed for this purpose; but their existence and 
properties are still of interest. 

A partial map from A to B in a cartesian category € is a special sort of 
relation from A to B, namely a subobject R >— A x B whose composite with the 
first projection A x B — A is still monic. Equivalently, we may think of it as a 
pair of morphisms (m: R — A, f: R — B) of which the first is monic. We shall 
write ‘(m, f): A — B’ (or sometimes simply ‘f: A — B’, suppressing the name 
of the monomorphism) for ‘(m, f} is a partial map from A to B’. 
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Partial maps can be composed: given (m, f): A — B and (n, g): B — C, we 
form the pullback 


“a 


This composition is associative up to (coherent) isomorphism, and so (provided 
we identify isomorphic relations) the objects and partial maps of € form a cat- 
egory Part(€). There is a conservative functor € — Part(€) sending an object 
A to itself and a morphism f: A — B to (14, f): A — B. 

We say that € has representable partial maps if this functor has a right 
adjoint: i.e. if, for each object A of £, we are given an object A and a partial map 
(ta,ga): A — A such that every partial map B — A factors uniquely through 
(ia, ga) by a morphism B — A in £. This definition can be simplified: by an 
easy generalization of the argument used to prove Lemma 1.6.1, it can be shown 
that the second component g4 of such a universal partial map is necessarily an 
isomorphism, and so we may as well take it to be the identity on A. Thus we 
define a partial-map representer for A to be a monomorphism i4: A — A such 
that, given a partial map (m, f): B — A, there is a unique morphism f: BoA 


making the square 
R 
B 
a pullback. 
A partial map B — 1 is (essentially) the same thing as a subobject of B, 


so a partial-map representer for 1 is just another way of describing a subobject 
classifier. Moreover, the proof of Lemma 1.6.5 immediately generalizes to yield 


Bats 


tA 


> << > 


=. vere 


Lemma 2.4.6 A Boolean pretopos has representable partial maps. 


Proof Define A = AU 1, and let i, be the first coprojection. Given a partial 
map (m, f): B — A, let m’ be a complement for m in Sub(B), and define f as 
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the unique morphism making 


/ 
Re Bae ee 


~ 


f f 


Ae Se a ey 


commute (where 7 is the second coprojection). The remaining details are as 
in 1.6.5. o 


In a topos, we have already seen (in Lemma 2.4.2) that if we take A = PA 
andi, = {}, we get the ‘existence’ but not the ‘uniqueness’ part of the definition 
of a partial-map representer. The trouble is that PA is ‘too big’ (in Set, if A 
is an n-element set, PA has 2” elements, whereas by 2.4.6 we want something 
with n + 1 elements), and so we need to cut it down somehow. 


Proposition 2.4.7 A topos has representable partial maps. 


Proof Given an object A of E & E/1, let us write 7: A > Q for the object 
T+(A) of €/Q. We note that since T is monic, the unit of the adjunction (Er 4 
T*) is an isomorphism, and hence so is the counit of (T* 4 II7); in particular, 
T*(r) & A, and so we can write the subobject of A classified by 7 as i4: A >> A. 
Given any partial map 


f 


B —A , 


B 
let x:B—Q be the classifying map of m; then the morphism 


f: B! = T*(x) — A transposes to yield a morphism f:x— rT in €/Q such that 
Tf) = f, i.e. the square 


f 


Bi ———> A 


m tA 


f 


B ——> À 


is a pullback. Conversely, if this square is a pullback, then we have 7 f = x by 
unigueness of classifying maps, and f must be the transpose of f. E 
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As an application of partial-map representers, we prove a result promised 
after 2.3.8: 


Lemma 2.4.8 Let F: € — F be a logical functor between toposes having a left 
adjoint L. Then L preserves monomorphisms; and in fact, for every object A 
of F, the assignment m +> Lm is a bijection from (isomorphism classes of) 
subobjects of A in F to subobjects of LA in E. 


Proof We clearly have a bijection between subobjects of A classified by 
morphisms ¢: A — NF = FNg and subobjects of LA classified by morphisms 
¢: LA — Ng; the problem is to show that it is induced by the action of L on 
monomorphisms. For the moment, let us write AA’ — LA for the subobject 
corresponding to m: A’ — A. Since the diagram 


€ 
LA ———> Ll zog 1 
Lm LT T 
Lo EQ 


LA ——> Lü; ———> Qe 


is readily seen to commute (where € is the counit of {L 4 F’)), we have a canonical 
morphism L.A’ — A’; to show that this is an isomorphism, we shall demonstrate 
that A’ has the same universal property as LA’. 

For any object B of £, morphisms A’ — FB can be regarded as partial maps 
A — FB, and so correspond to morphisms f: A — FB satisfying df = ġ, where 
ô: FB > Qr classifies ipg: FB = FB. But since F is logical, it commutes 
with everything involved in the construction of 2.4.7; so we have an isomorphism 
FB ~ F(B), which identifies ô with F(é’) where 6’: B — Ne classifies ig. Thus 
morphisms A’ — FB correspond to morphisms f: LA — B satisfying é’f = $ 
and these in turn correspond to morphisms AA’ — B, as required. 


We note in passing that, since L induces isomorphisms of subobject lattices, 
it preserves intersections of subobjects (i.e. pullbacks of monomorphisms), and 
hence also equalizers of coreflexive pairs, as we claimed in the last section. 

We have already observed that 1 is another name for Q. The following 
identification is also useful: 


Lemma 2.4.9 In a cartesian category with subobject classifier 9, the partial- 
map representer N (exists and) is isomorphic to the order-relation 9, defined in 
the proof of 1.6.3. 


Proof By definition, a partial map A — 2 corresponds to a subobject A’ — A 
(the domain of the partial map) and a subobject A” —+ A’ — equivalently, to a 
pair of subobjects (A” >=> A,A’ > A) of which the first is contained in the 
second. But such a pair is classified by a morphism A — 9. More explicitly, 
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let ig: Q — Q classify the pair of subobjects (T: 1 — Q, 1o: Q — Q); then ip 
is monic, since its composite with the first projection is the identity on Q (the 
classifying map of T), and given a partial map (A’ — A, f: A’ — 9) we let 
f: A — Q classify the pair (A”, A’) where A” is the subobject of A’ classified 
by f. It is then straightforward to verify that f is the unique morphism making 


f 


A’ —— 2 


a pullback. o 


Remark 2.4.10 There are a number of alternative constructions of A which 
are sometimes of use. We mention two of them here, both of which are based on 
the idea that A may be identified with the ‘object of subobjects of A having at 
most one element’. 

(a) Let s: PA > PA be the name of ({},14): A PA x A, and define A > 
PA to be the equalizer of s and 1p,. (In Set, s is the map which sends each 
singleton subset of A to itself, and everything else to the empty set.) First we 
note that there is a pullback 


iit eg 
(1,1) ({}, 1) 
{} x1 


Ax A——> PAxA 


because {} is monic, and so s{} = {}; hence {} factors through A PAbya 
monomorphism 24: A — A. More generally, if kh: B — PA names a partial map 
(m, f): B — A, then it follows immediately from 2.4.2 that the square 


Pe ey 
(m, f) ({},1) 


hxi 
B x A —> PAxA 


is a pullback; so sh = h, and h factors through A — PA by a morphism 
f: B — A. And the fact that the square in the statement of 2.4.2 is a pullback 
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R nar 

f 

B———_> 
is a pullback too. 


It remains to show the uniqueness of f . Let g: B > A be another morphism 
making the last square above a pullback, and write g’ for the composite of g with 
A >= PA, Then it is easy to see that the second square above, with h replaced by 
g’, must also be a pullback, and so sg’ names the subobject (m, f): R— B x A. 
But sg’ = g’ since g’ factors through A — PA, and so g’ names this subobject; 
hence g' = h, and so g = f since A > PA is monic. 

We remark that, by similar methods, the square 


implies that the square 


tA 


d << 


an 


({}-1) ({}, 1) 


1 
PAx A——-> PAXxA 


can be shown to be a pullback; hence s is idempotent (i.e. s? = s), and we could 
equivalently have defined Aas the image of s. In particular, A is a retract of PA, 
and is thus injective (this could have been proved directly from the definition of 
a partial-map representer, by methods similar to those of 2.2.6). 
(b) For the second alternative construction, let q: PA — P(A x A) name the 
relation 
Wi2(Ea) N Tis (Ea) >> PAX AXA 


(in Set, q is the map which sends a subset A’ of A to A’ x A’ C A x A), and let 
d: 1 — P(A x A) name the diagonal map A — Ax A1 x Ax A. Form the 
pullback 

A —— P(A x A) 


xd 
PAS PAx1 2% P(Ax A) x P(Ax A) 
where P\(A x A), as before, denotes the order-relation on P(A x A). (We may 
think of A as ‘the object of subobjects of A whose squares are contained in the 
diagonal’. 
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To show that this definition has the required universal property, let 
h: B — PA name a relation (g, f): R + B x A. We shall show that h factors 
through A — PA iff (g, f) is a partial map (i.e. iff g is monic). 

But, by definition, h factors through A — PA iff mia (R) N Th (R) — B x 
A x A is contained in the subobject 1 x A: B x A> B x A x A; equivalently, 
iff the top left square in the diagram 


R >———> ThB) ——> R 


(9, f) 
* 12 


T13 Tı 


(9, f ) Wy 
>——— Bx A——--> B 
(where the arrows R — aj,(R) and R — a}j3(F) are the splittings of the pro- 
jections induced by 1 x A: B x A >= B x Ax A) is a pullback. And the other 
three squares in this diagram are pullbacks; so this happens iff the outside is a 
pullback, i.e. iff g is monic. 


Our second (and much shorter!) digression concerns the construction of 
exponentials from power objects: we have seen that a weak topos must have 
exponentials, but we have not given an explicit procedure for constructing them. 
(Such a construction could in theory be extracted from 2.2.11 and 1.5.2, but it 
would be too complicated to be of much use.) The first proof of this result (due 
to A. Kock [636]) did yield an explicit construction of exponentials from power 
objects, which is based on the idea that in Set the exponential B4 may be iden- 
tified with a subset of P(A x B), namely the set of subsets of A x B which are 
the graphs of functions A — B. Kock’s construction produces, for each pair of 
objects (A, B) in a (weak) topos, a subobject of P(A x B) which we may think 
of as ‘the object of graphs of morphisms A — B’; it is sometimes useful to have 
such an explicit construction, and so we give it here. 

First we let r: P(A x B) x A — PB name the relation 


E(AxB) >> P(A x B) x (Ax B) = (P(A x B) x A) x B, 
and then we form the pullback 
Q ——————> B 


Q 


P(A x B)x A ——> PB 
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(In Set, Q is the set of pairs (R,a) such that a is R-related to exactly one 
element of B.) Now let q: P(A x B) — PA be the name of the relation Q, and 


form the pullback 
E ———— 1 


TA 


P(A x B) —> PA 


where T4 names the top element of Sub(1 x A). 


Proposition 2.4.11 The object E constructed above has the universal property 
of an exponential B4. 


Proof It suffices to show that a morphism h: C — P(A x B) factors through 
E — P(A x B) iff the relation (m,n, p): D—C x Ax B which it names 
has the property that (m,n): D — C x A is an isomorphism, i.e. iff it is the 
graph of a morphism C x A — B. But such a factorization exists iff gh 
names the top element of Sub(C x A), which is equivalent to saying that 
hx1la:C x A— P(A x B) x A factors through Q — P(A x B) x A, and this 
in turn is equivalent to saying that r(A x 14) factors through {}: B >=> PB. But 
r(h x 14) names the subobject D > C x A x B, considered as a relation from 
Cx A to B; so this says that there is a (unique) morphism f: C x A — B making 


D ————_-> B 


(m,n, p) A 


fxi 
C x AxB — BxB 


a pullback, i.e. that (m, n, p) is (isomorphic to) the graph of f. Cl 


In passing, we remark that 2.4.11 provides an alternative proof of 2.3.7(iii): 
for a logical functor preserves all the structure used in the construction of the 
object E. 


Suggestions for further reading: Kock & Mikkelsen [636], Kock & 
Wraith [649]. 


A2.5 Natural number objects 


The axioms for a topos, as set out in Section A2.1, permit one to carry out a 
great many mathematical constructions ‘inside’ a topos, as we shall see later on 
in this book. However, lots of constructions in mathematics (in particular, those 
which involve the formation of recursively defined collections of entities) require 
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some form of axiom of infinity, and the topos axioms do not imply any such 
axiom (as is shown by the fact that Sets is a model for them). So, for many 
purposes, we shall need to add such an axiom to the topos axioms. 

The neatest categorical formulation of the axiom of infinity was found by 
F. W. Lawvere [695]: 


Definition 2.5.1 Let C be a category with a terminal object 1. A natural num- 
ber object in C is an object N equipped with morphisms o: 1 — N ands: N+ N 
such that, given any object A with morphisms a: 1 — A and t: A — A, there is 
a unique f: N — A such that 


1°» N-—-_>N 


a 


E A 


commutes. 


The definition says that (N,0,s) is an initial object in the category of 
T-algebras in C, where T is the finitary algebraic theory freely generated by 
one nullary and one unary operation. (Later we shall see that, in a topos, the 
existence of a natural number object implies the existence of free algebras for 
arbitrary finitely-presented algebraic theories.) It is thus immediate that a natu- 
ral number object, if it exists, is unique up to canonical isomorphism. (Note that 
we shall frequently abuse notation by saying ‘N is a natural number object’, with- 
out mentioning the morphisms o and s. It should also be mentioned that many 
authors write ‘natural numbers object’ rather than ‘natural number object’.) 

In Set, the usual set N of (Cantor/von Neumann) natural numbers serves as 
a natural number object, with o and s taken to be ‘zero’ and ‘successor’. The 
definition is a particular case of the assertion that functions may be defined by 
primitive recursion, but the general case follows from it: 


Proposition 2.5.2 Let (N,o,s) be a natural number object in a cartesian 
closed category. Then, given objects A and B equipped with morphisms g: A~ B 
and h: Ax Nx B— B, there exists a unique f: Ax N—B such that the 
diagram 


J xo 1x 
Aj 2 SAN eS AS 


f (1, f) 


A ————> B <——_—__ AxNxB 
commutes. 
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Intuitively, f is the function defined recursively by f(a,0) = g(a) and 
f(a,n+1)=h(a,n, f(a,n)) . 


Proof First we consider the case A = 1. By definition, we have a unique 
f’: N— N x B such that 


(sm, h) 
N x B —> NxB 


commutes; but the composite mı f': N — N must be the identity, because it 
commutes with both o and s. So f’ is the graph of a morphism f = ma f': N > B, 
which is clearly the unique morphism with the required properties. 

The general case is reduced to this one by cartesian closedness. Let g: 1 => B4 
be the exponential transpose of g, and R: N x B4 — B^ the transpose of 


13, MI EVON 
wepia TO Nen ESR, 


Then it is easy to see that f: Ax N — B makes the diagram in the statement 
commute iff its transpose f: N + B^ makes 


E e; 


2 f (1, f) 
J 


commute. g 


Remark 2.5.3 We note that the first half of the proof of 2.5.2 made no use 
of cartesian closedness. If we wish to study naturai number objects in cartesian 
categories which are not cartesian closed, it makes sense to adopt a stronger 
form of the definition: (N, 0, s) is a natural number object if, given any diagram 
(A+ B = B), there exists a unique morphism f: A x N — B making the 
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diagram 
la x 14 xs 
Avis Ae N ——> AxN 


Tı f f 


A ———— B ————__ > B 

commute. (An equivalent condition would be to demand that N is a natural 
number object (in the sense of 2.5.1) ‘stably under slicing’, i.e. that A*N is a 
natural number object in C/A for any A.) With this modification, the result of 
2.5.2 remains true for natural number objects in arbitrary cartesian categories. 
However, we shall be almost exclusively concerned with natural number objects 
in toposes (or in quasitoposes; cf. Section A2.6 below); so we have chosen to 
adopt the simpler form of the definition given in 2.5.1. 


By taking B = N and A to be a finite power of N in 2.5.2, we deduce 
that every primitive recursive function of k variables is ‘realized’ as a morphism 
N* — N, in any cartesian closed category with a natural number object. 


Examples 2.5.4 We define addition, multiplication and exponentiation on N 
recursively by the diagrams 


1 
Ne Nene aa 
+ (L+) 


ST 
N <«——— NxXxNxN 


(i.e m +0 =m, m+(n+1)=(m+n)+1), 


ixo 1xs 
N ——~> Nx N <e NXN 
x (1, x) 
o +(73, 71) 


1 ————> N «e Nx Nx N 
(ie. mx 0=0, m x (n+ 1) = (mx n) + m), and 


Nes ay ee NaN 


(1, [—]) 


xX (13, 7 
> N e (ram) NxNxWN 
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(ie. [m = 0] = 1, [n = (n+ 1)] = [m — n) x m - we use the notation [m — n] 
rather than the more familiar m” in order to avoid confusion with exponen- 
tials within our category). Moreover, the usual inductive proofs translate, via 
the uniqueness clause of 2.5.2, into proofs that these operations satisfy the dia- 
grammatic forms of the usual (associative, commutative, distributive) laws of 
arithmetic. 


Two further morphisms which we shall require are the predecessor map 
p: N — N defined by 


t aN eN 


p (1, p) 


Tı 
N <—_ NxN 


and the truncated subtraction —: N x N — N defined by 


1 
A en ee 


= (1, >) 


NEL IONANN 


Note that the definition of p requires only the first half of the proof of 2.5.2, so 
this morphism exists in any category with finite products and a natural number 
object — in particular, s: N — N isa (split) monomorphism in any such category. 
This is essentially the fourth of Peano’s five postulates characterizing the natural 
numbers in Set (the first two are just the existence of the morphisms o and s). 

We now turn to a different characterization of natural number objects in 
toposes, due to P. Freyd [371]. 


Lemma 2.5.5 Let (N,o,s) be a natural number object in a category C. 
(i) If C has either binary coproducts or binary products, then 
jo Ne 
is a coproduct diagram. 
(ii) For any C, 


1 
NN 1 
83 


is a coequalizer diagram. 
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Proof (i) IfC has binary coproducts, then a natural number object in C is the 
same thing as an initial object in the category of T-algebras, where T: C — C is 
the-endofunctor 1 + (—). So by 1.1.4 the morphism 1+ N — N induced by o 
and s is an isomorphism. 

In the case when C has binary products, suppose given an object A and mor- 
phisms f: 1 — A, g: N — A. Consider the unique morphism (h,k): N> NxA 
making 


jie ee Ne N 
(h, k) (h,k) 
(0, f) 
s, g) 
vea O A 


commute: we have ho = o and hs = sh, so h = ly by uniqueness, and hence k 
is a factorization of (f, g) through (o, s). And it is unique, by the uniqueness of 
recursively defined morphisms. 

(ii) If f: N — A is any morphism satisfying fs = f, then f is equal to the 
composite 


f 


N ———> 1-5 N34, 


because each of them makes the diagram 


1—“"_s N—"-5N 


fo 


ae ee 


commute. So f factors (uniquely, since N — 1 is split epic) through N — 1. O 


Freyd’s characterization asserts that in a topos the converse of Lemma 2.5.5 is 
true: if N is any object equipped with morphisms o and s such that the diagrams 
in the statement of 2.5.5 are colimits, then it is a natural number object. We 
postpone the proof of this result until Section D5.1, because it requires the 
development of the internal logic of a topos; nevertheless, we make use of it in 
part (ii) of the next result. 


Lemma 2.5.6 LetC andTD be categories with terminal objects, and F: C > D 
a functor preserving the terminal object. Suppose C has a natural number object 
(N,o,8). If either (i) F has a right adjoint, or (ii) C has finite products or 
coproducts, D is a topos and F is cocartesian, then (FN, Fo, Fs) is a natural 
number object in D. 
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Proof (i) follows directly from the definition: given a: 1 — A and t: AA 
in D, we have 


ieri 7s pats Ra 


in C, where R is the right adjoint of F; so we get a unique morphism f: N — RA 
making the appropriate diagrams commute, and its transpose FN — A is the 
morphism we seek. 

(ii) (V, 0,8) satisfies the colimit diagrams of 2.5.5, and F preserves them; so 
the result follows from D5.1.2. a 


The hypothesis that D is a topos in 2.5.6(ii) cannot be weakened; see C4.2.4. 

Using 2.5.6, we may give many more examples of natural number objects. 
For example, [C, Set] has a natural number object for any small C, namely the 
constant functor with value N: the functor A: Set — [C, Set] sending a set to 
the corresponding constant functor preserves 1 and has a right adjoint, which 
sends a functor C — Set to its limit. Again, if N is a natural number object in 
a properly cartesian closed category C then A*(N) is a natural number object 
in C/A for any A, by 1.5.2(i) (cf. 2.5.3 above). 

In any category with a terminal object and countable coproducts, it is easy 
to see that the countable copower }_ „ey 1 satisfies the condition of 2.5.1. This 
provides an alternative proof of 2.5.6(i) for functors between categories with 
countable coproducts. However, even in a topos a natural number object need 
not be a countable copower of 1; see, for example, D5.1.7 below, or alternatively 
F2.1.6(iv). 

For future reference, we note the following result about countable colimits: 


Lemma 2.5.7 Let C be a o-pretopos (as defined at the end of Section A1.4), E 
a topos with countable coproducts and F: C — E a coherent functor. Then F is 
o-coherent (i.e. preserves countable coproducts) iff it is cocartesian (i.e. preserves 
coequalizers). 


Proof One direction follows directly from 1.4.19: if F is o-coherent, then it 
preserves all the structure involved in the construction of coequalizers in C. Con- 
versely, suppose F preserves coequalizers. Then it preserves the natural number 
object, by 2.5.6(ii); equivalently, by the discussion above, it preserves the count- 
able copower of 1. But, given any countable family (An | n € N) of objects of C, 
the coproduct A = Ll nex An is characterized up to isomorphism by the existence 
of morphisms vn: A, — A and p: A — N making 


Vn 
Ayn —————> 


> 


2<— > > 
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a pullback for each n; F preserves these pullbacks, and hence preserves the 
coproduct of the A,. m 


Next, we note a useful consequence of Lemma 2.5.5(i): 


Corollary 2.5.8 ZLet(N,0,s) be a natural number object in a positive coherent 
category. Then the monomorphisms a:1 >> N and s: N = N are complemen- 
tary in Sub(N). O 


In particular, we note that the subobjects o and s are disjoint — that is, 
Peano’s third postulate is satisfied. To complete the discussion of the Peano 
postulates, we have 


Lemma 2.5.9 In any category, a natural number object satisfies Peano’s fifth 
postulate: i.e. if N! — N is a subobject such that both 0:1— N and the 
restriction of s to N’ factor through N', then N’ is the whole of N. 


Proof Let 0’: 1 — N’ ands’: N! > N’ be the factorizations. Then there exists 
f:N—N' such that 


(Ny 
* f f 

i 
N’ —2-> NI 


commutes; and the composite N — N’ »= N must be the identity since it 
commutes with both o and s. So N’ — N is split epic, and hence an isomorphism. 
o 


Once again, we shall see in Section D5.1 that the Peano postulates suffice to 
characterize the natural number object in a topos. 

Until further notice, we shall work in a cartesian closed, positive Heyting 
category €. One important consequence of 2.5.8 is that, since complements are 
preserved under pullback, it allows us to construct lots of other complemented 
subobjects of (finite powers of) N. For example, we define the order-relation 
N, = N x N by the pullback 


N, ———-——> 1 
0 
NxN——>N 


Of course, it requires proof that this is an order relation on N; we shall indicate 
briefly how this may be done. First we note that, since the identity n — n = 0 is 
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provable in primitive recursive arithmetic, it is valid in our context; that is, the 
diagonal A: N > N x N factors through N. 


Proposition 2.5.10 The intersection of Nı — NXN and the opposite relation 
NÈ ~ N XN is the diagonal. 


Proof After the preceding remark, we have to show that Ni N NP < A, or 
equivalently that ((Ni N Np) > A) is the whole of N x N. Let N’ — N be 
the image of the latter subobject under Vy, : Sub(.V x N) — Sub(V) (in set- 
theoretic terms, N’ is the set 


{me N| (Yne N\((m-n=0=n-m)>(m=n))}); 


it suffices to show that N’ is the whole of N, and we shall do this using 
2.5.9. Firstly, o factors through N’, since n+0 = n, and so the intersection 
of ox 1: 1x N= NxN with Np is just (0,0): 1 — N x N. To show that s 
restricts to a morphism N’ — N’, we shall argue informally in terms of ele- 
ments (the logical underpinnings of this informal argument will be found in 
Section D1.3). Suppose m € N’, and suppose sm -n = 0 = n sm. By the first 
equation, n Æ 0, so n is a successor, say n = sk. But sm — sk = m > k for any 
m and k, so we deduce m +k = 0 = k — m, whence m = k (since m € N’) and 
so sm = sk = n. Thus sm € N”. 


Corollary 2.5.11 In any topos, the natural number object (if it exists) is decid- 
able (cf. 1.4.15); i.e. the diagonal subobject N —=> N x N is complemented in 
Sub( N x N). 


Proof N, is complemented in N x N because it is a pullback of o: 1 >=> N, 
which is complemented by 2.5.8. But a finite intersection of complemented sub- 
objects is complemented, in any coherent category. 0 


Proposition 2.5.12 The union of N, and Nf is the whole of N x N. 
Proof Let E — N x N be the equalizer of mı: N x N — N and 


— T 
nen LE E EN 


(i.e. the set {(m,n) | (m> n) +n = m}); we shall show that N, UE is the whole 
of N x N and that E < Np. (Note, incidentally, that E is a complemented 
subobject of N x N, since it is expressible as a pullback of the diagonal; but we 
shall not need this fact.) 

For the first assertion, let N’ >= N = Ym (Ni UE — N x N); we shall argue 
as in the proof of 2.5.10 to show that N’ is the whole of N. It contains o (in fact 
Yz (E) contains o, since m 0 = m + 0 = m). If n € N’, then for any m € N 
we have either m + n = 0, whence m — sn = p0 = 0, or (m> n) +n = m. But in 
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the latter case we have 
(m— sn) + sn = s(p(m—n) +n) = sp(m—n)+n, 


and m—n is either 0 (which reduces to the previous case) or a successor (in 
which case sp(m >n) = m—n, so (m — sn) + sn = m). Thus sn € N’. 

For the second assertion, it suffices to observe that n > (n+ k) = 0 for all n 
and k, so if (m,n) € E we have 


n-~m=n—(n+(m—n))=0, 
i.e. (m,n) E€ NÌ. O 


Proposition 2.5.13 The relation Nı — N x N is transitive (in the sense of 
1.3.6(d)). 


Proof Once again, we argue by induction, using 2.5.9. Let N’ — N be 
Vora ((Ti2(N1) N 723 (N1)) > 713(N1)), 


ie. the set {m E€ N | (Wn,k € N\((min = n-k = 0) > (mk = 0))}. 
We have o € N’, since 0—k = 0 for all k. Suppose m € N’, and suppose 
smn =n+k = 0. Then n Æ 0, so n must be a successor (say n = sn’), and 
similarly k = sk’. Now we have m — n’ = sm — sn’ = 0 and similarly n’ — k’ = 0, 
so m > k’ = 0; but this implies sm — k = 0. So sm € N”. O 


Putting together the last four results, we have shown that N; — N x N 
is a (trichotomous) linear order on N, in any cartesian closed positive Heyting 
category € with a natural number object. 

Next, we consider the composite 


T 
Ni >—> N x N ——> N25 N 


as an object of E/N. We denote this object by C, and call it the generic finite 
cardinal in £. If p: A — N is any morphism with codomain N (which we think 
of as a ‘natural number in the category €/A’ — recall that A*(N) is a natural 
number object in this category, by 2.5.6(i)), we shall write |p] for the pullback 
of C along p, and call the objects which arise in this way finite cardinals in 
E/A. (Thus C itself could alternatively be denoted [1y].) The reason for this 
terminology is that, in the case € = Set, C is the N-indexed family whose pth 
member is the set {0,1,2,...,p— 1}, i.e. it is the cardinal p itself as defined by 
von Neumann. In general, we have 


Lemma 2.5.14 [o] = 0 and [sp] = 1 I [p] for any p. 


Proof The first assertion follows from the disjointness of o and s (2.5.8). For 
the second, it suffices to prove the result for p = 1y, since pullback functors pre- 
serve coproducts. But since s is monic, [s] is the object 72: N= Nx NN. 
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And we know that, as a subobject of N x N, N; is the coproduct of the diagonal 
N — N x N and the strict order relation (the complement of N?); and the latter 
may easily be identified with 


1 
Ne SHEN SSN 


by an argument like those already given. Composing with 72, we obtain a coprod- 
uct decomposition [s] = 1y IC in E/N. o 


Further properties of finite cardinals will be studied in Section D5.2. 
We conclude this section with a brief discussion of list objects. 


Definition 2.5.15 A list object over an object A in a category E with 
finite products is an object LA equipped with morphisms o4:1 — LA and 
84: A x LA — LA which is initial in the category of objects equipped with such 
structure; i.e. given any object B with morphisms b: 1 — B and t: A x B —> B, 
there is a unique f: LA — B such that 


pa Sa AS A 
f Lx f 


t 
B «—— AxB 
commutes. 


The notation A* is also commonly used for the list object over A. Clearly, in 
the case A = 1 the definition reduces to that of a natural number object; we can 
think of list objects as standing in the same relation to natural number objects 
aS power objects do to subobject classifiers. In Set, we may take LA to be the 
set of all finite lists of elements of A, with o, picking out the empty list and s4 
corresponding to the operation of appending an element of A to the head of a 
list. We note that, if E has list objects, then the operation A ++ LA becomes 
a functor € — £E: given f: A — B, we define Lf to be the unique morphism 
making 


QA SA 
1 ——> LA ———__ AxLA 


Lf 1x Lf 


S x l 
ip cee) acre 


commute. In particular, any list object comes equipped with a morphism 
l: LA — L1 = N which ‘measures the length of a list’. 
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Many of the formal properties of natural number objects, established in this 
section, are inherited by list objects. For example, in analogy with 2.5.5, we 
have a coproduct decomposition LA ¥ 1+ (A x LA) provided coproducts exist 
in E, and the coequalizer of s4 and m2: Ax LA — LA is LA — 1. Similarly, 
generalizing 2.5.6(i), if F: E — F is a functor which preserves finite products and 
has a right adjoint, and LA is a list object over A € ob €, then F(LA) is a list 
object over FA. We omit the proofs, which are straightforward generalizations 
of those already given. 

In establishing the existence of list objects in suitable categories, we shall 
need to use the following ‘section criterion’ due to G. C. Wraith [549]. 


Lemma 2.5.16 Let E be a cartesian category having a natural number object 
N, and let f: AN be an object of E/N. If we are given morphisms 
a:1—~o0*(f) nE andt: f — s*(f) nE/N, then there exists a unique h: Ly > f 
in E/N such that o*(h) = a and s*(h) = th. 

1 

l 

( 


N 
o*(A) ——> A 
l Oo 


ee NY ee 


Proof Consider the diagram 


0 
ee NN 


dD 


RE 6 E 


f 


in which the morphism h exists uniquely by the definition of N, and the com- 
posite fh is the identity by the uniqueness clause of 2.5.1. Moreover, a standard 
diagram-chase shows that the upper left square of the diagram and the square 


N 
| 
A 


are pullbacks. o 


N——> 
ha 
s*(A) ——> 


Theorem 2.5.17 Let € be a locally cartesian closed, positive coherent category 
with a natural number object. Then € has list objects. 
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Proof We define LA to be the partial product P(A,C) as defined in 1.5.7, 
i.e. the object Uy((N*A)°), where the exponential is calculated in €/N; and 
we shall write l: LA — N for the object (N*A)°© itself. We note first that, by 
2.5.14 and the fact that pullback functors preserve exponentials (1.5.2(ii)), we 
have o*(1) = All = A? = 1 and s*(l) S (N* Aj] = N*AGNUC) = N*A x l; so 
the pullbacks of o and s along / provide the required structure maps 1 ~ LA 
and A x LA — LA. 

Now suppose given an object B with morphisms b: 1 — B and t: A x B~ B. 
We need to construct a morphism Ey(N*A°) —> B in £, or equivalently a 
morphism ly —> (N* BNA‘) in €/N. But by 2.5.14 we can construct this 
from a morphism 1 + o*((N*B)(N"4°)) in £ and a morphism (N*B)(N*4°) _, 
s*((N*B)%"4%) in €/N. And, again using the fact that pullback functors 
preserve exponentials, we have o*((N*B)(N"4°)) = BUA") œ Bl % B, so we 
may take the former to be the given morphism b; and s*((N*B)(N°A°)) = 
(N*B)(NAXN*AS) so for the latter we take the transpose (u, say) of 


. 72, eV O T13) N* 
(N* BNA x N*A x N*AC l BS N*A x N*B OS NB. 

Moreover, a straightforward diagram-chase shows that a morphism LA —> B 
satisfies the conditions of 2.5.15 iff its transpose h: Ly — (N*B)(N"4®) satisfies 
o*(h) = b and s*(h) = uh, so this is the unique morphism with the required 
property. o 


For a simpler, but less explicit, proof of 2.5.17 in the case when £ is a topos, 
see D5.1.5. Further properties of natural number objects and of list objects will 
be explored in Chapter D5. 


Suggestions for further reading: Freyd [371], Johnstone & Wraith [549]. 


A2.6 Quasitoposes 


Although one great merit of the topos axioms is their wide applicability, as 
demonstrated by the list of examples in Section A2.1, there are certain con- 
texts in which they appear inconveniently restrictive. The problem arises from 
Corollary 1.6.2, which told us that a category with a subobject classifier (in 
particular, a topos) is necessarily balanced, whereas we do sometimes encounter 
examples of non-balanced categories which are sufficiently ‘topos-like’ to share 
many important properties with them. The notion of quasitopos, introduced by 
J. Penon [954,956], succeeds in capturing these examples (as well as all toposes); 
we devote this section to studying it. 

Clearly, one way of weakening the notion of subobject classifier is to demand 
classifying maps not for arbitrary subobjects but only for the members of some 
restricted class of subobjects (which might as well be stable under pullback). 
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Whatever class we choose, it will follow from the argument of 1.6.2 that all 
monomorphisms in this class are regular; so one possibility would be to take 
the class of regular monomorphisms. However, for axiomatizing the notion of 
quasitopos it is more convenient to take the class of cocovers (more usually 
called strong monomorphisms); we shall then obtain for free the information that 
every cocover is regular monic (compare 1.3.4, where a substantial argument was 
required to establish the dual result in a regular category). 

Accordingly, we define a weak subobject classifier in a cartesian category € 
to be an object Q equipped with a morphism T:1— Q such that, given any 
cocover A’ >> A in £, there is a unique f: A — N making 


1 
i 
Q 
a pullback. (There is no danger in re-using the notation of Section A1.6, since a 
subobject classifier — if it exists — is necessarily a weak subobject classifier.) 


 <—_——_< 


f 
> 


Definition 2.6.1 A quasitopos is a locally cartesian closed, cocartesian cate- 
gory with a weak subobject classifier. 


We note that, in compensation for weakening the notion of subobject classi- 
fier, we have had to strengthen our axioms in two other respects. First, we have 
had to assume the existence of finite colimits, instead of proving it as in Sec- 
tion A2.2: we may, of course, define the weak power object PA of an object A to 
be 24, and as previously we can make P into a functor €°P — E; but this functor 
is no longer monadic, because (although it is still faithful, by the argument of 
2.2.3(ii)) it fails to be conservative. (In fact we shall see that if f is any morphism 
which is both monic and epic in a quasitopos, then Pf is an isomorphism.) Sec- 
ondly, we have had to strengthen ‘properly cartesian closed’ to ‘locally cartesian 
closed’, in order to ensure that the property of being a quasitopos is stable under 
slicing: although we can argue just as in 2.3.2 to show that the possession of weak 
power objects is inherited by €/B from €, we cannot reconstruct exponentials 
from weak power objects. (The point at which the construction of 2.4.11 breaks 
down is that, although we can define the singleton map {}: A — PA (since the 
diagonal A > A x A is regular monic, being the equalizer of the two projections) 
and we can prove as in 2.2.3(i) that {} is monic, it is not a cocover in general; 
hence (in the notation of 2.4.11) the relation Q => P(A x B) x A need not be a 
cocover, and so need not have a name.) 


A2.6 QUASITOPOSES 121 


Despite the foregoing, it follows immediately from 2.2.9 and 2.3.4 that a topos 
is a quasitopos. Conversely, we shall see shortly that a balanced quasitopos is a 
topos. First we need a quasitopos version of 2.4.3: 


Lemma 2.6.2 Given a pushout square 


r one 


C 
Ba =p 


in a quasitopos where f is a cocover, the morphism d is also a cocover and the 
square is also a pullback. 


Proof The morphism (f,g): A+ B xC is a cocover, since f is (this is 
where we use the fact that we defined weak subobject classifiers in terms of 
cocovers rather than regular monomorphisms), and so is named by a morphism 
h: B — PC. Exactly as in 2.4.2, we may show that the square 


insg 


f {} 


fy ey 


(commutes and) is a pullback; hence, as in 2.4.3, the pair (A, {}) factors through 
(e, d), and the square in the statement of the lemma is a pullback. We may also 
conclude that d is monic, since {} is; but, as indicated above, we must do some 
further work to prove that d is a cocover. 

Let k: B — Q be the classifying map of f; then the diagram 


yee eee 


f T 


OE E 


commutes (since both ways round classify the top element of Sub(A)), and hence 
induces a morphism l: D — 2. Let d’: C’ — D be the (regular) subobject classi- 


fied by J; then d’ > d in Sub(D), and the pullback of d’ along e is isomorphic to f. 
We shall show that the factorization C ~ C” of d through d’ is an isomorphism, 
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so that d is a cocover. Since D is a pushout, we have a regular epimorphism 
B +C — D; pulling this back along d’, we get a regular epimorphism (in par- 
ticular, a cover) A + C — C”, since pullback functors preserve coproducts and 
regular epimorphisms. But A — C” factors through C — C’; so the latter must 
be a cover, and hence an isomorphism since we already know it is monic. o 


Corollary 2.6.3 
(i) A quasitopos is coregular. 


(ii) A balanced quasitopos is a topos. 


Proof (i) This is immediate from 2.6.2 and 1.3.5, since we already know that 
cocovers coincide with regular monomorphisms. 

(ii) By (i), every monomorphism in a quasitopos factors as a monic epimor- 
phism followed by a cocover; if the quasitopos is balanced, the first factor is an 
isomorphism, and so every monomorphism is a cocover. Thus the weak subob- 
ject classifier is a subobject classifier. a 


Before we develop the theory of quasitoposes any further, we should look at 
some examples which are not toposes. 


Examples 2.6.4 (a) Any Heyting algebra (more generally, any cartesian 
closed preorder which is cocartesian) is a quasitopos. We saw in Section A1.5 
that such a category is locally cartesian closed; and since its only cocovers are 
isomorphisms, the terminal object will serve as a weak subobject classifier. 

(b) We have already noted that Sp is not cartesian closed; however, various 
constructions have been proposed for embedding Sp in a cartesian closed cate- 
gory C. It is clear that if such an embedding is reasonably well-behaved (at least 
if it preserves monomorphisms and epimorphisms and reflects isomorphisms) 
then C cannot be a topos, since it will inherit non-balancedness from Sp; but 
we can hope for it to be a quasitopos, and in fact several of the proposals put 
forward for C are quasitoposes. One such, which is canonical in a certain sense 
(see [1247]), is the category Choq of Choquet spaces (convergence spaces in the 
sense of Choquet [240]), and we now describe this example. A Choquet space is 
a set X equipped with a binary relation between (proper) filters ® on X and 
points z € X, written ® — z (and read ‘® converges to x’), such that 


(i) For every x € X, the principal ultrafilter {X’ C X | xz € X"} converges 
to T. 


(ii) If — z and W 2 9, then Y - z. 


(iii) If ® is a filter and z a point such that every filter containing © is itself 
contained in a filter converging to z, then ® — z. 


A continuous map of Choquet spaces is a function between their underlying 
sets which preserves convergence, in the obvious sense; this defines the category 
Chog. The usual (Cartan [235]) notion of convergence for filters on a topological 
space defines a functor Sp — Chog, which is well known to be full and faithful. 
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It is straightforward to verify that Choq has finite limits and colimits, and 
that the forgetful functor to Set preserves them; in fact the latter has both left 
and right adjoints, which respectively equip a set X with the discrete Choquet 
structure in which only principal ultrafilters converge, and with the indiscrete 
structure in which every filter converges to every point. The monomorphisms 
(resp. epimorphisms) in Choq are just the injective (resp. surjective) continuous 
maps, from which it follows that the cocovers are just those injections i: Y — X 
for which Y has the ‘subspace Choquet structure’ (i.e. every filter on Y which 
maps to a filter converging in X to i(y) converges in Y to y), and hence that 
the indiscrete two-point Choquet space is a weak subobject classifier. We shall 
not give the proof that Chogq is locally cartesian closed, since it would occupy 
too much space — for more details of this and related examples, see the papers 
of Wyler [1247] and Dubuc [297]. 

(c) There is a sequential version of the last example, which essentially goes 
back to M. Fréchet [369]. By a Fréchet space we mean a set X equipped with a 
relation of convergence between sequences in X and points of X, such that 


(i) For every x € X, the constant sequence (x) converges to x. 
(ii) If (£n) —> z, then every subsequence of (z,,) converges to x. 


(iii) If (zn) is a sequence and z a point such that every subsequence of (Zn) 
contains a subsequence converging to x, then (£n) —> 7T. 


(Fréchet’s original definition also included the ‘separation axiom’ that every 
sequence has at most one limit, but for our present purposes it is essential to 
omit this requirement, since we need to have the indiscrete spaces in our cate- 
gory.) The category of Fréchet spaces (with convergence-preserving functions as 
morphisms) will be denoted Fre; the proof that it is a quasitopos is similar to 
that for Choq, and we omit the details. The usual notion of sequential conver- 
gence in topological spaces gives a functor Sp — Fre, which is faithful but not 
conservative; however, it is full when restricted to the subcategory of sequential 
spaces (cf. Example 2.1.11(7)). Moreover, since the space N* is sequential, it 
is easy to verify that the functor Sp (Nt, —): Sp — Sh(M) of 2.1.11(7) extends 
to a functor Fre (N+, —): Fre — Sh(M); the latter is full and faithful (note 
that continuous maps N+ — X in Fre correspond to instances of the conver- 
gence relation in X). For more discussion of this example, see [507], and also 
C2.2.14(b). 

(d) Let (C,T) be a site, as defined in 2.1.9. We say a functor F: C°P — Set is 
separated if it satisfies the ‘uniqueness’ but not the ‘existence’ part of the sheaf 
axiom for T-covering families; equivalently, if for every (fj: Ai + A|ie€ I) € 
T(A) the function 

F(A) — |] Fi4d) 
ie 
induced by the F(f;) is injective. We write Sep(C,T) < [C°?, Set] for the full 
subcategory of T-separated functors; we claim that Sep(C,T) is a quasitopos. 
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The proof that Sep(C, T} is closed under finite limits and an exponential ideal 
in [C°?, Set] is similar to that for Sh(C, T). It is not closed under finite colimits, 
but it: has them by virtue of the fact that it is reflective in [C°P, Set]; we shall 
not prove this here, since it is a special case of a general result which we shall 
establish in Section A4.4. For local cartesian closedness, we recall from 1.1.7 that 
any slice [C°P, Set]/F is equivalent to [¥°P, Set] for a suitable category F; and 
we can ‘lift’? the coverage T on C to a coverage Tp on F, in such a way that 
Sep(C,T)/F is identified with Sep(7,T;). Finally, a monomorphism F’ —> F 
in Sep(C, T) is easily seen to be a cocover iff it has the property that, given any 
A € obC and any s € F(A), if there exists a covering family (f;: A; > A|ie€ J) 
such that F(f;)(s) € F’(A;) for each i € J, then s € F'(A). Using this, it is easy 
to see that the subobject classifier Qr of Sh(C, T), as constructed in 2.1.10, is a 
weak subobject classifier for Sep(C, T). 

(e) Our next example is a modification of the category Pos(L), for a dis- 
tributive lattice L, which we described in 1.4.6. Suppose that L is irreducible, 
i.e. that the meet of any two nonzero elements of Z is nonzero. Then since any 
sequence of elements of L, regarded as an object of Pos(Z), is isomorphic to 
the sequence obtained by deleting all occurrences of 0 from the original one, 
and since the nonzero elements of L have no nontrivial disjoint decompositions, 
we may simplify the description of Pos(Z) as we did for the particular case 
L = 2 in 1.4.6 itself: it is equivalent to the concrete category whose objects are 
finite sets A equipped with a function a: A — L \ {0}, and whose morphisms 
f: (A,a) > (B, B) are functions satisfying a(a) < 6(f(a)) for all a € A. We 
denote this category by Fuz;(Z), and we claim that if L is an irreducible Heyting 
algebra then Fuz;(L) is a quasitopos. It is easy to verify that Fuz,(L) has finite 
limits and colimits (and the forgetful functor Fuz;(Z) — Set; preserves them). 
It is cartesian closed: (B, 3)'4:~) may be taken to be the set F of all functions 
f: A— B with structure map ¢: F — L \ {0} given by 


(f) = \{a(a)=> BF (a) | a € A} 


(note that this meet is never 0, since A is finite and L is irreducible), and 
the exponential adjunction is easy to verify. Exponentials in slices of Fuzs(L) 
are similarly defined. Cocovers in Fuz;(Z) are injective functions for which the 
inequality in the definition of a morphism becomes an equality; from this one 
sees easily that a two-element set, with a sending both its elements to 1, is a 
weak subobject classifier. 

In the particular case L = 2, this construction in fact yields a topos, as we 
saw in 1.4.6; but in general it does not — for example, if Z is a totally ordered 
set with more than two elements, then we saw after 1.4.6 that Pos(Z) is not 
effective as a regular category, and so it cannot be a topos. If L is a complete 
irreducible Heyting algebra, then we may embed Fuz;(L) in a larger quasitopos 
Fuz(L), simply by removing the finiteness restriction on the underlying sets of 
its objects (and by modifying the definition of exponentials: we have to cut down 
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to the subset F’ C F of functions f such that ¢(f) Æ 0). This category is one 
version of the category of L-valued fuzzy sets, as studied by fuzzy-set theorists; 
another version in common use, which allows the possibility e(a) = 0, may be 
identified with Fuz(Z,), where L, is the lattice obtained from L by adjoining 
a new bottom element below the original one (note that such a lattice is always 
irreducible). Of course, we may define the categories Fuzș (L) and Fuz(L) for any 
lattice L, irreducible or not; but for Heyting algebras which are not irreducible 
we do not get a cartesian closed category. (The reason is that the definition of 
products has to be modified: in forming the product of (A, a) and (B, 3) we have 
to exclude the pairs (a, b) with a(a) A G(b) = 0. See also [1248, Chapter 8], for a 
more detailed discussion of the categories which have been proposed by fuzzy-set 
theorists.) 

(f) One further example, which we shall not discuss here, but which will be 
of importance in Chapter F2, is the category Ass(A) of A-valued assemblies, 
where A is a partial Schénfinkel algebra — see F2.1.6. 


We now revert to the general theory of quasitoposes. As well as being coreg- 
ular, a quasitopos is regular (in fact, a Heyting category) by 1.5.13; it need not 
be effective or positive (indeed, if it has both properties then it is a topos, by 
1.4.9 and 2.6.3(ii)), but it can have either property without having the other. For 
example, a Heyting algebra is (trivially) effective, but not positive unless it is 
degenerate, while the quasitoposes Choq and Fre are positive but not effective. 
Regarding positivity, we have the following result: 


Lemma 2.6.5 A quasitopos has disjoint coproducts iff the unique morphism 
0 — 1 is a cocover. 


Proof If 0 — 1 is a cocover, then 0 cannot have any proper epimorphic 
images, and disjointness of coproducts follows from 1.5.14 (or alternatively it 
can be deduced from 2.6.2 just as we deduced 2.4.4 from 2.4.3). Conversely, if 
1+ 1 is disjoint, then 0 — 1 is an equalizer of the two coprojections 1 3 1+1.0 


Quasitoposes satisfying the conditions of 2.6.5 have been called solid by 
G. P. Monro [861]. Let £ be a general quasitopos, and let 0 -» 0 — 1 be 
the factorization of 0 — 1 into a (monic) epimorphism followed by a cocover. It 
is easy to see that the analogue of 1.6.7 holds for weak subobject classifiers, and 
hence that any slice category of a quasitopos is a quasitopos; in particular, £ /0 is 
a quasitopos. But it is also a preorder; in general, any cartesian closed category 
in which 0 — 1 is epic must be a preorder, since two distinct morphisms A 3 B 
would yield distinct morphisms 1 =3 B4. Moreover, since 0 is a subobject of 1, 
we may identify €/0 with a full subcategory of £, whose objects are those objects 
of E which admit a morphism to 0 (equivalently, those subobjects of 1 contained 
in 0); we call these thin objects of £. More surprisingly, we have 


Lemma 2.6.6 With the above notation, the co-slice category 0\E is a positive 
quasitopos. 
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Proof Again, since 0 — 0 is epic, we may identify 0\€ with a full subcategory 
of €, consisting of those objects which admit a morphism from 0; we call these 
solid objects. As such, it is easily seen to be closed under finite limits and 
pushouts, and to have an initial object 15; and it is an exponential ideal since, 
for any two objects A and B, there exists a morphism B — B^, namely the 
transpose of the first projection. It is locally cartesian closed, because slicing 
commutes with co-slicing: for any solid object A, we have 


(O\E)/(0 > A) = (0 > A)\(E/A), 


but (0 — A) is the ‘object’ of €/A, and so the category on the right is cartesian 
closed. And 0\€ inherits a weak subobject classifier from £, by the argument of 
1.6.8. Finally, 0 — 1 is a cocover in 0\€, since it is one in £; so by 2.6.5 O\E is 
positive. E 


Thus, in a sense, the study of quasitoposes ‘bifurcates’ into the study of 
those which are preorders and those which are positive. By this we do not mean 
to imply that, given an arbitrary quasitopos €, we can reconstruct it up to 
equivalence from the preorder P = €/0 and the positive quasitopos Q = 0\E: 
there are at least two ways of putting these together, namely the product P x Q 
and the ordinal sum P @ Q (the latter being obtained from the disjoint union by 
identifying the terminal object of P with the initial object of Q), and these are 
distinct provided both P and Q are non-degenerate. However, it turns out that 
one further piece of information suffices to recover € from P and Q: 


Proposition 2.6.7 Let € be a quasitopos, and let P and Q be defined as above. 
Then there is a cartesian functor F:P— Q such that E is equivalent to the 
category GI(F) obtained by glueing along F', as in 2.1.12. 


In passing, we remark that the glueing construction applied to a cartesian 
functor between quasitoposes always yields a quasitopos; this can be proved 
directly by methods like those of 2.1.12, but will follow from a more general 
result to be proved in Section A4.2. 


Proof To simplify the notation, we identify P and Q with the appropriate 
full subcategories of £. For a thin object A, we define F(A) = A? + 0; we note 


that since A? is subterminal by 1.5.10, we have 


Ax TZ AxDZA 
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by one of the Heyting semilattice identities of 1.5.11 and the fact that A < 0 in 
Sub(1). And since 0 — A is epic (being a pullback of 0 — 0), the square 


is a pushout; equivalently, by 1.4.3, F(A} is the union of A? and 0 in Sub(1). 
So the distributive law (1.4.2), plus the fact that (—)° is a right adjoint, ensures 
that F preserves finite intersections as a functor Sub(0) — Sub(1), and hence 
that it is cartesian as a functor P — Q. Note also that, since F takes values 
in the lattice of subterminal objects of Q, we may identify GI(F) with a full 
subcategory of P x Q, consisting of those objects (A, B) such that the support 
of B (the image of the unique morphism B — 1) is contained in F(A). 

Now let C be an arbitrary object of £. We define the thin part C; and the 
solidification C, of C to be the product C x 0 and the coproduct C + 0; clearly 
they are respectively thin and solid. We have a morphism 


62040 Ss (Cx 40 Fo) 


(where 7 is the unit of ((—) x 0 4 (—)®)), so the pair (Ce, Cs) is an object 
of GIF); and the assignment C ++ (C;,C;) is clearly functorial. In the other 


direction, we have a functor GLF) — €E sending (A, B) to A® x B. Now if (A, B) 
is an object of GIF), we have 


(AŤ x B) = A°x BxOZAOXOZA 
using the fact that B is solid and A is thin; and 
(A? x B), = (AÏ x B) +0 & (A? x B) + (Ox B) & (A°40) x B= F(A)x BY B 


where the last step uses the fact that F(A) contains the support of B. Similarly, 
for any object C of £, we have 


Co x C, = C8 x (C +0) = (CP x C) + (CP x 0) = (CPx C) +C, 
= (C? x C) + (O x C) = (+0) x C=F(G,)xC2C 


where the last step uses the fact that F (C+) contains the support of Cs, and 
hence also of C. The above isomorphisms are clearly natural in A, B and C; so 
we have an equivalence of categories, as claimed. D 
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Remark 2.6.8 If a quasitopos £ is effective as a regular category, then it is 
easy to see that the same property is inherited by its solid part; hence, by 1.4.9 
and 2.6.3(ii), the latter is a topos. It is also easy to verify that effectivity is 
preserved by the glueing construction; thus, as a special case of 2.6.7, we may 
deduce that effective quasitoposes are exactly the categories obtained by glueing 
along cartesian functors from Heyting algebras to toposes. In this sense, they 
form a ‘minimal common generalization’ of toposes and Heyting algebras. 


In Example 2.6.4(d), we noted that the quasitopos Sep(C,T) contains as a 
full subcategory a topos Sh(C,7'), whose subobject classifier provides the weak 
subobject classifier of the quasitopos. This state of affairs is typical, as we shall 
see shortly. First we note 


Lemma 2.6.9 Suppose f: A» B is both monic and epic in a quasitopos. 
Then pullback along f induces a bijection between (isomorphism classes of) 
cocovers B' — B and cocovers A’ — A. 


Proof Since isomorphism classes of cocovers A’ — A correspond to morphisms 
A — Q, the fact that f is epic ensures that pullback along it is an injective 
operation on cocovers. To prove that it is surjective, let A’ — A be a cocover, 
and form the epimorphism-cocover factorization A’ >» B’ — B of the composite 
A!’ >> A>» B. We must show that the square 


A ———> B' 


f 


A ———> B 


is a pullback. Let A” be the pullback of B’ along f; since the square above 
commutes, we certainly have a morphism A’ — A”, which is a cocover because 
A’ — A is. But it is also epic, because it is the pullback of A’ — B’ along 
A” — B’; hence it is an isomorphism, and the square above is a pullback. O 


Corollary 2.6.10 A morphism f: A— B in a quasitopos is both monic and 
epic iff Pf: PB — PA is an isomorphism. 


Proof If f is both monic and epic, then so is lc x f:C x A—C xB for 
any C (since monomorphisms and epimorphisms are stable under pullback). By 
2.6.9, this implies that any morphism C — PA factors uniquely through Pf, 
so Pf must be an isomorphism. The converse holds because P is faithful, as 
mentioned earlier. E 


We define an object C of a quasitopos E£ to be coarse if ‘it cannot detect the 
lack of balance of £’; i.e. if, whenever f: A — B is both monic and epic, each 
morphism A — C factors uniquely through f. Thus Lemma 2.6.9 says that Q is 
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coarse. Every coarse object is solid, because 0 — 0 is monic and epic, and the full 
subcategory Cs(€) of coarse objects is (trivially) closed under finite limits in £. 
It is also an exponential ideal, because functors of the form (—) x C preserve 
monics and epics. Thus, to complete the proof that Cs(&) is a topos, it suffices 
to show 


Lemma 2.6.11 Let f: A’ — A be a monomorphism in a quasitopos, where A 
is coarse. Then A’ is coarse iff f is a cocover. 


Proof First suppose f is a cocover. Then, given 


B' Sy A’ 
g f 
B A 


where g is monic and epic, we get a factorization of fh through g by coarse- 
ness of A, and then we get a factorization of k through g by (the dual of) 
Lemma 1.3.2(ii). Conversely, suppose A’ is coarse, and form the epic-cocover 
factorization A’ >» A” > A of f. By coarseness, 1,4, factors through A’ — A”, 
i.e. A’ -» A” is split monic; but it is also epic, and hence an isomorphism. O 


Proposition 2.6.12 In a quasitopos E, the full subcategory Cs(E) of coarse 
objects is a topos, and the inclusion Cs(E) — E has a left adjoint. 


Proof That Cs(€) is a topos, with Q as its subobject classifier, follows from 
2.6.9, 2.6.11 and the discussion preceding the latter. To obtain the reflection in 
Cs(E) of an arbitrary object A, form the epic-cocover factorization A => A — 
PA of the singleton morphism {}: A — PA. PA is coarse by 2.6.9 and the fact 
that the coarse objects form an exponential ideal, so A is coarse by 2.6.11; but 
A>» A is monic as well as epic, because {} is monic, and so by definition every 
morphism from A to a coarse object factors uniquely through it. E 


In the case when € is the quasitopos Sep(C,T) for a site (C,T), it is not 
hard to verify that Cs(€) is the topos Sh(C, T). (We shall do this, in a more 
general context, in Section A4.4.) For € = Chogq or Fre, the coarse objects are 
just the indiscrete spaces, so that Cs(€) is isomorphic to Set; and similarly in 
Example 2.6.4(e) we find that Cs(Fuz,;(Z)) = Sets. (Fuzzy-set theorists are 
accustomed to call these objects crisp rather than coarse.) 


Suggestions for further reading: Adámek et al. [22], Dubuc [297], 
Höhle [454], Johnstone [507], Monro [861], Penon [954,956], Wyler [1247, 1248]. 


A3 
ALLEGORIES 


A3.1 Relations in regular categories 


The object of this Chapter is to present an alternative approach to the notion of 
topos, different from (but not entirely independent of) the one which we followed 
in Chapters Al and A2, via categories of relations. While this approach will not 
play a very important rôle in our subsequent exposition of topos theory, it is of 
some interest in its own right, and it certainly provides an appealing route to 
the construction of some of the leading examples of toposes, which can make 
their structure easier to understand. The advantages of this approach have been 
forcefully argued over some years by P. Freyd, and a fuller account of it will be 
found in [381]. There are a number of other approaches to the study of categories 
of relations — notably that developed by A. Carboni and R. Walters [232], which 
makes heavier use of 2-categorical ideas — but we shall not describe them here. 

By a relation from A to B in a cartesian category C, we mean a subobject 
of A x B. (In this context — in order to define an associative composition of 
relations ~ it is essential to interpret ‘subobject’ as meaning an isomorphism 
class of monomorphisms, rather than an individual monomorphism; but as usual 
we shall not labour the point.) We shall use the notation A % B for a relation 
from A to B, and we shall generally use Greek letters to denote relations. If 
(f:T > A,g:T — B) is a pair of morphisms such that the induced morphism 
(f,9):T — A x B is monic and belongs to the isomorphism class ¢: A 9 B, we 
shall call (f, g) a tabulation of ¢. 

The composite of two relations ¢: A q- B and w: B % C can be defined in 
any cartesian category with images: having chosen tabulations (f,g) and (h, k) 
of ¢ and y, we form the pullback 


Pea ce y 


T ———-> B 


and define Yọ to be the relation tabulated by the image of (fp, kq): P > A x C. 
This is clearly independent of the choice of tabulations of ¢ and yw. 
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Lemma 3.1.1 LetC be a cartesian category with images. The composition of 
relations just defined is associative iff C is regular. 


Proof First suppose C is regular; let x: C % D be a third relation, tabulated 
by (l,m): V — C x D. Form the pullbacks 

V 

C 


u 
TERA o eE 


R Q 
S: 
Pa ey | ee ee 
tt 
T ———>B 


We claim that x(q) is tabulated by the image of (fpt, msu): R — A x D; by 
symmetry (xw)¢ is also tabulated by this pair, so they are equal. 
To prove the claim, let 


(a,c) 


Pa S y a SS Ae 


be the image factorization of (fp, kq), and form the pullback 


z 
| 


H 


SS 


ae 


Now there exists a pullback square 


iS 


ee aes 
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and by regularity d is a cover; so, since 


oe a 


(fpt, msu) orn 


Ax D 


commutes, the images of the two maps into A x D coincide. But the image of 
(ax, my) is by definition a tabulation of x(wW¢). 

Conversely, suppose composition of relations is associative: let e: A—> B be 
a cover in C, and f: C > B an arbitrary morphism. Let ¢:C } B, y: BHA 
and x: A% 1 be tabulated by (1c, f), (e,14) and (14, A) respectively (recall 
our convention that A also denotes the unique morphism A — 1); then since 
(e, A): A — B x 12 Bis a cover the composite xy is tabulated by (1g, B), and 
(xv)¢ is tabulated by (1lc,C). But a similar calculation shows that (yd) is 
tabulated by the image of (h, P): P + C x 1, where 


Pia eee C 
g f 


E 5, 


is a pullback; so if these two relations are equal then h is a cover, and thus covers 
in C are stable under pullback. c 


Corollary 3.1.2 If C is a regular category, then relations in C form the 
morphisms of a category Rel(C) having the same objects as C. 


Proof Having defined composition of relations and verified that it is associa- 
tive, it remains to establish the existence of identities. But it is trivial to verify 
that the relation ¿4 tabulated by the diagonal map (14,14): Æ >> A x A serves 
as the identity relation on A. C 


Of course, Rel(C) is more than just a category. Since the relations A 9 B are 
the subobjects of a fixed object of C, they have a natural partial ordering (in fact 
a meet-semilattice ordering, since C is cartesian), and this ordering is preserved 
under composition (that is, œ < w implies dx < wy and 0¢ < Oy whenever the 
composites are defined ~ although we do not in general have (6NwW)x = xN YX, 
since the operation of taking images does not commute with intersection). Thus 
Rel(C) is a particular case of a 2-category (for which see Section B1.1): in fact 
a locally ordered 2-category, i.e. one in which there is at most one 2-cell between 
any two given 1-cells, and the only invertible 2-cells are identities. 
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We shall not need to use very much 2-categorical machinery in this chapter 
(and in particular, we shall not have to worry about diagrams commuting only 
up to 2-isomorphism), but we shall need the notion of adjointness of 1-cells in a 
2-category. What this amounts to in Rel(C) is that ¢: A% B is left adjoint to 
y: BG A iff dv < tg and ¿4 < Wo (the triangular identities being vacuous). 

The other structure which Rel(C) possesses is an anti-involution: if ¢: A% B 
is a relation tabulated by (f,g), the opposite relation ¢°: B % A is that tabu- 
lated by (g, f). It is elementary to verify that the mapping @ + ¢° defines a 
2-functor Rel(C) — Rel(C)°? (where the superscript ‘op’ signifies that 1-cells 
are reversed but 2-cells are not), and that its square is the identity. 

If f: A— B is any morphism of C, then the graph of f, i.e. the pair 
(14, f): A — A x B, is monic (we have already used this in the proof of 3.1.1), 
and so tabulates a relation which we denote fe: A 9 B. The assignment f + fe, 
plus the identity map on objects, defines a faithful functor C — Rel(C) (for its 
faithfulness, cf. the proof of 2.2.3). We shall write f°: Bq A for (f.)°. 


Proposition 3.1.3 

(i) For any morphism f of C, fe is left adjoint to f° in Rel(C). 

(ii) Let 6: A9- B be a morphism in Rel(C) having a right adjoint. Then 
there is a unique morphism f: A— B inC such that d= fe. 


Proof (i) f*f. is tabulated by the kernel-pair of f, and the reflexiveness of 
this relation (cf. 1.3.6) is precisely the inequality ¿4 < f° fe. On the other hand, 
fe f° is the image of (f, f): A — B x B, and since this morphism factors through 
the diagonal we have fef? < tz. 

(ii) Let y be the right adjoint of ¢, and let (a,b): T > AxB, 
(b,a): T’ > B x A be tabulations of ¢ and y. First, the inequality 14 < Yọ 
says that the image of (ap, a’p’): P —> A x A contains the diagonal, where 


| eee 

p’ b 
b 

T’ ———> B 


is a pullback. So ap factors as a cover followed by a split epimorphism; hence it is 
a cover, and in particular a is a cover. Similarly, a’ is a cover, so in the pullback 
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both q and q’ are covers. But the inequality ¢y < tg means that bg = b'q'; and 
since we also have ag = a'g this forces Yy = ¢°, since each relation is tabulated 
by the image of (bg, aq) = (b'q',a'q'). Thus we can now drop the primes on T”, 
a’ and b’. Also, since: we have aq = aq’ and bq = bq’ and the pair (a,b) is monic, 
we must have q = q’. But (q, q’) is the kernel-pair of a; so a is monic, and hence 
an isomorphism. Thus, putting f = ba~!, we have ¢ = fe, since a itself is an 
isomorphism from the given tabulation of @ to the graph of f. The uniqueness 
of f follows from the faithfulness of f — fe. Oo 


Thus the regular category C is recoverable (up to isomorphism) from the 
2-category Rel(C), as the subcategory containing all the objects of Rel(C) but 
only those morphisms which have right adjoints. Because of this, a morphism 
in a locally ordered 2-category which has a right adjoint is often called a map; 
of course, such morphisms always form a subcategory. In the next section, we 
shall consider how to characterize categories of the form Rel(C) amongst locally 
ordered 2-categories. 

In passing, we note 


Scholium 3.1.4 Let f: A— B be a morphism in a regular category. 
(i) f is monic iff the unit ta < f°f, is an identity. 
(ii) f is a cover iff the counit fe f° < tp is an identity. 
Proof Both observations are immediate from the construction of these inequal- 


ities in the proof of 3.1.3(i). o 


We have already remarked that composition of relations does not in general 
distribute over the finite intersections which exist in the ‘hom-posets’ of Rel(C). 
The following result, known as the modular law (for reasons which will become 
clear in 3.2.2(a@) below), provides a substitute in some cases for the missing 
distributive law. 


Proposition 3.1.5 Let¢d: A% B,w~: BC andy: A C be three relations 
in a regular category. Then 


PENX S(YNX’)o- 


Proof Let (f,g):T — Ax B, (h,k): U — Bx C and (I,m): V > Ax C be 
tabulations of the three relations. First form the limit of the diagram 
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by definition, this is an object P with morphisms x: P—T, y: P—U and 
z: P — V satisfying fx = lz, gx = hy and ky = mz (and universal among 
such). (If we were working with relations in the category Set, P would be the 
set of all triples (a, b,c) such that a is ¢ related to b, b is w-related to c and a is 
x-related to c. The underlying idea of this proof is that if (a, c) is in (a tabulation 
of) YN x, then there must exist b such that (a,b,c) is in P; but then (b,c) is 
in ~Nx¢° and (a,b) is in æ.) 

We claim first that the image of (lz,mz): P— Ax C is a tabulation of 
wo N x. To prove this claim, note that a tabulation of yọ N x is obtained 
by constructing the pullback 


g= 1 37: , 
i f 

h 
U ———-> B 


the image factorization Q —> I — A x C of (ft, ku), and then the pullback 


R >_> I 


(l m) 
V>—> AxC 


But an elementary diagram-chase shows that there is a pullback 


P ——~> Q ’ 


z (ft, ku) 


l,m 
\ ) AxC 


so since image factorizations are stable under pullback there is a cover P — R. 

Similarly, the image of (hy, ky): P — B x C (say S > B x C) is a tabulation 
of YNxg?. Now the cover P — S, together with the morphism z: P — T, induce 
a morphism from P to the pullback 


W —— sS 


T ———> B 
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this is easily seen to be a morphism over A x C', and hence to induce an inequality 
between the images of the two morphisms into A x C. But the image of W — 
T x S + Ax C is by definition a tabulation of (Y N x¢°)¢. o 


Corollary 3.1.6 Let ọ be a morphism of Rel(C) such that $¢° < tp (where 
B is the codomain of }). Then the distributive law 


(YNx)d = pN xe 
holds whenever the composites are defined. 


Proof We have (Y Nx)¢ > (YN xop) = HON xe by the given assumption 
and the modular law, but the reverse inequality always holds because (—)¢ is 
order-preserving. Oo 


The inequality ¢¢° < tg holds in particular when ¢ is a map, by 3.1.3; but 
in this case it would be easy to prove the distributive law directly. 


Suggestions for further reading: Carboni [218], Carboni & Walters [232], 
Freyd & Scedrov [381], Kelly [584]. 


A3.2 Allegories and tabulations 


Abstracting from the categories of relations considered in the last section, we 
define an allegory to be a locally ordered 2-category A whose hom-posets have 
binary intersections (note that we don’t, for the moment, require them to have 
top elements), equipped with an anti-involution ¢ > ¢° and satisfying the 
modular law 


PENX S(YNXG")d 


whenever this makes sense. As is already apparent, we shall maintain continuity 
with the last section by using Greek letters (and the notation ¢: A % B) for 
arbitrary morphisms in an allegory, reserving italic letters and ordinary arrows 
for maps (i.e. morphisms with right adjoints). We write Map(A) for the subcat- 
egory of maps of A; note that it contains all isomorphisms of A (and they are 
isomorphisms in Map(A)), since an isomorphism is both left and right adjoint 
to its inverse. (Partly for this reason, we shall denote identity morphisms in A 
by 1 rather than z.) 

Because of the anti-involution, the apparent asymmetry in the modular law 
is only apparent; it implies (indeed, is equivalent to) the law 


PENX <V(ONY’x) . 


A further immediate deduction from the modular law, which we note for future 
reference, is 
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Lemma 3.2.1 For any morphism p in an allegory, we have ¢ < o¢°¢. 
Proof We have ¢=146N¢ < (11 4¢°)¢ < 4¢°¢. Oo 
Before proceeding further, we arm ourselves with a couple of new examples. 


Examples 3.2.2 (a) Let L be a lattice. We may regard L as a locally ordered 
2-category with one object, interpreting composition as join in L (and intersec- 
tion as meet in L); since join is commutative, the identity mapping A° = À is an 
anti-involution. In this context, the modular law is equivalent to the law familiar 
to lattice-theorists under that name, i.e. 


Apu <XXpNv) whenever à< v; 


the point is that the latter inequality is equivalent to v = vA°. Note that any 
lattice satisfies å < AA°A (in fact we have equality here), so this inequality does 
not imply the modular law. In fact, as is well known, any lattice generated by 
two elements is modular (and even distributive), so the modular law cannot be 
equivalent to any collection of identities in two or fewer variables. 

(b) Let L be a distributive lattice. The allegory Mat ,;(Z) of finite Z-valued 
matrices has finite sets (or natural numbers, if you prefer) as objects, and mor- 
phisms A + B are ‘B x A matrices with entries in L’, i.e. functions B x A— L, 
the partial ordering (and binary intersection) on hom-sets being computed point- 
wise. Composition is defined by matrix multiplication, as in 1.4.6: the composite 
of ¢: A% B and y: B > C is the function x defined by 


x(c,a) = \/{v(c, b) A ob, a) | b € B}, 


the distributive law in L being precisely what is needed to verify that this com- 
position is associative. The identity matrix 14 is what you think it is, and ¢° 
is simply the transpose of ¢. The verification of the modular law is straightfor- 
ward. Moreover, if L is a frame (i.e. L has arbitrary joins, and binary meets 
distribute over them), we may embed this example in a larger allegory Mat(L) 
whose objects are arbitrary sets, the rest of the definition remaining unchanged. 

If L is the two-element lattice 2 = {0,1}, then Maty(Z) is simply another 
description of the allegory Rel(Set;), using the bijection between subsets of 
A x B and their characteristic functions. For a more general L, however, it is 
instructive to compute the maps A — B in Mat, (L) (or in Mat(Z)): using 3.2.3 
below, they are easily shown to be those matrices whose columns are disjoint 
decompositions of 1 in L, i.e. those ¢ such that (b,a) A ¢(b’,a) = 0 whenever 
bb, and 

\V{9(b, a) |b€ B} =1 

for each a € A. Thus we see that Map(Mat,(L)) is equivalent to a full sub- 


Category of the category Pos(L) of 1.4.6, consisting of those objects which are 
copowers of 1 in Pos(L). (We shall see how to recover the ‘missing objects’, 
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by splitting cores in Mat (JZ), in 3.3.7 below.) Note in particular that if 1 has 
no nontrivial disjoint decompositions in L (for example, if L is totally ordered), 
then any map in Mat,;(Z) has all its entries 0 or 1; thus Map(Mat,;(L)) = 
Map(Mat ;(2)) = Set, although Mat; (L) is clearly not equivalent to Mat (2) 
unless L = 2. 

(c) The axioms for an allegory do not include anything (such as the existence 
of limits or power-objects) which allows us to construct new objects from old; 
it is therefore immediate that any full subcategory of an allegory is an allegory, 
no matter how bizarre its collection of objects. For example, the collection of all 
sets with either 23 or 35 elements, and all relations between them, is an allegory. 
At first sight, this may look like a defect of the allegory axioms, but it is actually 
one of their strengths: the point is that many allegories of interest to us — that 
is, allegories whose categories of maps are toposes — may be generated (by the 
idempotent-splitting processes described in the next section) from quite small 
(in the non-technical sense) full sub-allegories, and these sub-allegories are often 
easier to describe explicitly than the large categories in which we are interested. 


Next, we prove an important, result (already established for allegories of the 
form Rel(C) in the course of proving 3.1.3), which is the first indication we have 
had of the real power of the modular law. 


Lemma 3.2.3 
(i) In an allegory, the right adjoint of a map f is necessarily f°. 
(ii) The maps in an allegory are discretely ordered (i.e. f < g implies f = g). 


Proof (i) Let f: A— B be a map. We can certainly write its right adjoint as 
g° where g is a map, since an adjunction f 4 ¢ yields an adjunction ¢° 4 f°. 
Now we have 
(PAG )F = (9 NI1af?)f Zf =la 

by the modular law and the fact that g° f > 14; and clearly f(f° Ng°) < fg° < 
lg. So f° Ng° = (f N g)° is also right adjoint to f, but by the uniqueness of 
adjoints this forces (f N g)° = 9°, ie. f Ag = g. By symmetry, we also have 
fNg=f,sof=g. 

(ii) Let f and g be maps such that f < g. Then we also have f° < g°, and 
hence 

9S 9P FS OP F SSF, 

so f=g. Oo 


Let @: A P B be a morphism in an allegory. By a tabulation of ¢ we mean 
a pair of maps f:T — A, g: T —> B with common domain satisfying ¢ = gf° 
and f°f Ng°g = Ir. It is easy to see that tabulations in Rel(C) are (modulo 
the isomorphism Map(Rel(C)) = C) exactly what we called tabulations in the 
previous section; the second condition in the definition says that the pair (f, 9) 
is jointly monic (cf. 3.1.4(ii)). Thus every morphism in Rel(C) has a tabulation; 
we call an allegory tabular if this condition holds. 
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Clearly, not all allegories are tabular. In a modular lattice L, regarded as 
an allegory as in 3.2.2(a), only the identity morphism (which is the element 0 
of L!) is a map, and hence it is the only morphism with a tabulation. Again, if 
L is a distributive lattice with more than two elements, the allegory Mat ;(L) 
of 3.2.2(b) is not tabular: if À is any element of L other than 0 or 1, then it 
is easy to verify that the matrix ¢: {a} % {a} defined by ¢(a,a) = À cannot 
be tabulated. However, in the next section we shall see how to embed certain 
allegories (including Mat ;(Z)) into tabular allegories in a canonical way. 

In passing, we note that in a tabular allegory the operation ¢ + ¢° may 
be derived from the other structure; for if f is a map then f° must be its right 
adjoint, and if ọ = g f° then ¢° = fg°. In theory, therefore, one could axiomatize 
the notion of tabular allegory without mentioning this operation; but in practice 
it would make the statement of the modular law intolerably opaque. 


Lemma 3.2.4 Suppose ¢: A B has a tabulation (f,g), and let z: C —> A, 
y: C — B be maps. Then yx° < ¢ iff there exists a map h such that x = fh and 
y= gh. Moreover, if such anh exists it is unique. 


Proof If«z= fh and y = gh, then yr° = ghh° f° < gf° = ¢ since h is a map. 
Conversely, suppose yx° < ¢; define 0 = f°x N g°y: C % T. Since x and y are 
maps, we have lo < y °yz°x < y°gf°x. Now 


0°90 = (2° f Ny gf rN (f £N g°y)) 
> (2°f Ny g) rN lc 
> le Ny gf rN le = ic 


by two applications of the modular law, and 


00° = (fox N g°y)(2° f N y°g) 
< f xx? f N g’yy’g 
< ffNg°’9 = 17, 
so 6 is a map (and we can now denote it by h). Next, 


fh=f(fPrng’y) <fr 


whence fh = x by 3.2.3(ii), and similarly gh = y. Finally, if k is any other map 
Satisfying fk = xz and gk = y, then 


k= (f° fg’ g)k < fo feng’ gk = fPrng’y=h, 
whence h = k by 3.2.3(ii) again. O 


Corollary 3.2.5 Any two tabulations of a given morphism in an allegory are 
uniquely isomorphic. 


Proof Apply 3.2.4 each way round to the two tabulations. o 
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Lemma 3.2.6 Let f: A— B be a map in a tabular allegory A. The following 
are equivalent: 


(i) f is monic in Map( A). 
Gi) f° f =la. 
(iii) (J, f) is a tabulation of f f°. 
Proof (ìi) => (ii): Suppose f is monic, and let (h,k) be a tabulation of f° f. 
From the equality f°f = kh° we deduce fk < fkh°h = ff°fh = fh, whence 
fk = fh by 3.2.3(ii), whence k = h since f is monic. Sol, < f° f = hh® < Ia. 
(ii) > (iii) is immediate from the definition of a tabulation. 


(iii) => (i): If (f, f) is a tabulation of anything, then it follows from the 
uniqueness clause of 3.2.4 that f is monic in Map(.A). O 


Proposition 3.2.7 Let A be a tabular allegory. Then Map(A) is locally 
regular (that is, it has pullbacks and images, and covers are stable under 
pullback). 


Proof First we observe that if f,g,h,k are four maps satisfying hf = kg, 
then gf° < k°kgf? = k°hf f° < k°h; and conversely if gf° < k°h, then kg < 
kof°f < kk°hf < hf, whence hf = kg by 3.2.3(ii). Thus 3.2.4 tells us that 
we may construct a pullback for a pair of maps (h,k) with common codomain 
by finding a tabulation of k°h. Next, to form the image of a map f: A — B, 
consider a tabulation of f f°, say (g9: T — B,h: T > B). Since f f° < 1g and 
isg is tabulated by (1g, 1g), 3.2.4 yields a unique map k such that 1 gk = g =h; 
that is, g = h. It now follows from 3.2.6 that g is monic in Map(.A), and 3.2.4 
yields a unique map q: Á — T with gq = f. Moreover, if m: C = B is any 
monic map through which f factors, then gg° = ff° < mm?®; since (m, m) 
tabulates mm° by 3.2.6, 3.2.4 now yields g < m in Sub(B). So (q, g) is an image 
factorization of f. 

In particular, we see that f is a cover in Map(.A) iff f f° = 15. Now suppose 
that we have a pullback square 


f 


A ———~> B 


g h 


ee» 


in which h is a cover; then 1c < k°k = k°hh°k = gf° fg° (the last step by the 
construction of pullbacks given earlier). So 


lc = lc Ngf?’ fg? < (gN gf? fg? = 99° 


using the modular law and the inequality 14 < f° f; thus g is a cover. ü 
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Thus we have shown that tabular allegories are essentially the same thing as 
categories of relations of locally regular categories. (The proof given in the last 
section that Rel(C) is an allegory goes through almost unchanged if C lacks a 
terminal object (for example, if C is the category LH of spaces and local home- 
omorphisms, cf. 1.2.7), although we have to strengthen the definition of ‘locally 
regular category’ slightly. The notion of relation must of course be redefined if 
C fails to have binary products; we must think of a relation as an isomorphism 
class of jointly-monic pairs rather than a subobject of a product — note that in 
general the poset of relations from A to B will fail to have a top element. In 
order to construct binary intersections of relations, we need to have equalizers 
as well as pullbacks in C, and in order to define composition we need images for 
pairs of morphisms as well as single morphisms — i.e. we need to know that each 
pair of morphisms with common domain factors in a best possible way through a 
jointly-monic pair. In fact it is not hard to verify, by arguments similar to those 
in the proof of 3.2.7, that both these extra conditions are satisfied by Map(A) 
for any tabular allegory A; they are also satisfied by LH.) 

To characterize the categories Rel(C) where C is regular, we merely have 
to add some condition to the axioms for a tabular allegory A which ensures 
that Map(A) has a terminal object. The following definition is only slightly less 
blatant than demanding this explicitly, but it seems to be the best we can do: 
an object U of A is called a unit if ly is the largest morphism U % U, and for 
every object A there is a morphism ¢: A % U with ¢°¢ > 14. 


Lemma 3.2.8 If U is a unit in an allegory A, then it is a terminal object in 
Map(A). The converse holds if A is tabular. 


Proof If U is a unit, then there certainly exists a map A — U for every A, 
since every ¢: A % U satisfies 6¢° < ly. But if f and g are two such maps, then 
we have fg° < ly, so f < fg°g < g, whence f = g by 3.2.3(ii). 

Conversely, if U is terminal in Map(A), then the second condition for being 
a unit is clearly satisfied. For the first, let ¢: U % U; then in any tabulation 
(f,g) of @ we have f = g, whence @ = gf? = ff° < lv. Oo 


For future reference, we also note 


Lemma 3.2.9 Ifan allegory has a unit, then each of its hom-posets has a top 
element. 


Proof Let U be a unit in A, and for each object A let pa denote the unique 
map A — U. If ¢: A®U, then ¢ < dp4pa < pa since dp < ly. Now for any 
p: A 3 B we have ppw < pa, so that Y < p$pa by adjointness, i.e. pppa is the 
top element of A(A, B). O 


We are now ready to put everything together: 


Theorem 3.2.10 
(i) If C is a regular category, then Rel(C) is a tabular allegory with a unit, and 
C = Map(Rel(C)). 
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(ii) If A is a tabular allegory with a unit, then Map( A) is a regular category, 
and A S Rel(Map(.A)). 


Proof The only part of this that we have not already verified is the isomor- 
phism A = Rel(Map(.A)). But the definition of tabularity and 3.2.5 ensure that 
we have a bijection between morphisms A + B in A and relations A +} B in 
Map(.A); and the construction of pullbacks and images in Map(.A), given in 
the proof of 3.2.7, make it straightforward (though not entirely trivial) to verify 
that this bijection commutes with composition — the fact that it preserves the 
partial order on hom-sets is immediate from 3.2.4, and that it commutes with 
ot œ is trivial. O 


Theorem 3.2.10 can be made functorial. If we define a (unital) morphism of 
allegories to be a functor commuting with intersections and with g+> ¢° (and 
preserving the unit), then it is easy to see that unital morphisms between tabular 
allegories with unit correspond to regular functors between regular categories. 
(It is possible, though less straightforward, to make it 2-functorial; the right 
notion of ‘natural transformation’ between morphisms of allegories is not just 
that of a natural transformation of functors, but something more complicated. 
We shall not go into details here.) 

It is also possible to obtain more restricted equivalences from 3.2.10, by 
adding further categorical or allegorical structure to each side. We give two 
examples of this here; further examples will occur in the next two sections. 


Proposition 3.2.11 Let A be a tabular allegory with a unit. 


(i) Map(A) is a coherent category iff the hom-posets of A have finite unions 
(including least elements) and composition distributes over these. 


(ii) Map(A) is a positive coherent category iff A has the structure described in 
(i), together with finite products (equivalently, finite coproducts). 


Proof (i) It is clear that if C is a coherent category then the hom-posets of 
Rel(C) have finite unions; and the fact that unions are stable under pullback 
in C makes it easy to verify that composition in Rel(C) distributes over unions. 
Conversely, if Rel(C) has the extra structure indicated, we deduce that C is 
coherent by observing that subobjects of A in C correspond to relations A > 1, 
and pulling back subobjects along f: A — B corresponds to composing such 
relations with f,. 

(ii) If A and B have a disjoint coproduct AL B, then for any object C we have 
C x (AL B) = (C x A) Il (C x B), and any subobject of C x (AH B) decomposes 
uniquely as the disjoint union of a subobject of C x A and a subobject of C x B. 
Using this, it is easy to see that A II B is both a product and a coproduct of A 
and B in Rel(C) — and the initial object of C, being strict, is both initial and 
terminal in Rel(C). 

Conversely, let A be an allegory satisfying the conditions of (i), and suppose 
two objects A and B have a coproduct A+ B in A. (Of course, it is also their 
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product, because of the anti-involution.) Let ny: A A+B, ve: B} A+B be 
the coprojections; let 71: A + B + A be the unique morphism satisfying 7,1) = 
l4 and m2 = 0, and let m2: A+ B% B be defined similarly. Consider the 
morphism 0 = rm1 U vana: A+ Bq A+ B; since composition distributes over 
unions, we have fr = n U0 = v and Org = 0 U rs = v2. So by the uniqueness 
of morphisms out of coproducts, we have 6 = 14+ 3; in particular, 1:7) < 1448. 
But mıvı = 14, so v is a map, and m = v? by 3.2.3(i). In fact vı is monic in 
Map(.A), since it is split monic in A. Similarly, v2 is a monic map; so we can 
regard A and B as subobjects of A+ B in Map(.A). To compute the intersection 
of these subobjects, the construction of 3.2.7 tells us to tabulate the morphism 
V5V1 = T = 0; but this clearly yields the smallest subobject of A+ B, so A and 
B are disjoint. It now follows from 1.4.4 that A and B have a disjoint coproduct 
in Map(.A) (in fact, though we don’t need this, the coproduct is A + B itself, 
since the equation viv? U vava = 1 indicates that the union of the two given 
subobjects is the whole of A + B). Oo 


If A is an allegory (not necessarily tabular) whose hom-posets have finite 
unions which are distributed over by composition, we shall call it a union allegory. 
If A additionally has finite (co)products, we call it a positive union allegory. Part 
of the preceding proof can now be recast: 


Scholium 3.2.12 If two objects of a union allegory A have a coproduct, it is 
also their coproduct in Map(A). 


Proof Suppose the coproduct A+ B exists in A. The argument in the proof of 
3.2.11(ii) shows that the coprojections are maps (and we shall now denote them 
by nı: A — A+B, no: B — A+ B), so any map with domain A + B induces 
by composition a pair of maps with domains A and B. Conversely, suppose we 
are given maps f: A > C, g: B — C; we must show that the unique morphism 
ġ: A+ B %} C induced by f and g is a map. But we have ¢ = ¢(nynj Ungn3) = 
fn? U gn3, so 
°$ = (nı f? Unag®)(fnz U gn) 
> m f? fni U nag’ gna 
> nın Ungns = la+s, 


and 
p° = (Fn? U gn3)(n1 f° U nag?) 
= fnini f? U fnnag? U gnani f° U gngnag° 
= ff? VUOUDUgGY <ic. 


O 


A similar argument shows that the initial object 0, if it exists, in a union alle- 
gory A is also initial in Map(.A) (in fact strict initial, since the unique morphism 
A > 0, being the opposite of a map, can only be a map if it is an isomorphism). 
The argument also extends readily to infinite coproducts, provided the hom- 
Posets of A have infinite unions and composition distributes over them. In the 
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context of a particular model of set theory, we shall call an allegory geometric 
if its hom-posets are complete lattices (i.e. small posets with arbitrary unions) 
and composition distributes over arbitrary unions, and positive geometric if it 
additionally has arbitrary set-indexed coproducts. Of course, geometric alle- 
gories which are tabular and have units correspond to the geometric categories 
introduced at the end of section A1.4 (and positive geometric allegories similarly 
correspond to oo-pretoposes, provided their equivalences split — cf. 3.3.9 below.) 


Example 3.2.13 For a distributive lattice L, Mats(L) is a positive union 
allegory. Unions of matrices are computed pointwise, and it is easy to verify that 
composition distributes over them; the coproduct of two objects A and B is just 
their disjoint union, since a matrix ¢: C x (A II B) — L is uniquely determined 
by the pair (¢lcx4,¢\cx se). Similarly, if L is a frame, then Mat(L) is a positive 
geometric allegory. 


It should be noted that, in our definition of union allegory (and of geo- 
metric allegory), we have not required that intersection of morphisms should 
distribute over union. For tabular union allegories, this distributive law auto- 
matically holds, by 3.2.11 and 1.4.2, but in general it may fail: a counterexample 
is provided by any modular lattice which is not distributive, regarded as an alle- 
gory as in 3.2.2(a). Freyd and Scedrov [381] use the term ‘distributive allegory’ 
for a union allegory in which intersections distribute over unions. 


Suggestion for further reading: Freyd & Scedrov [381] 


A3.3 Splitting symmetric idempotents 


In this section we shall be concerned with various classes of endomorphisms in 
an allegory. We begin with a clutch of definitions. 


Definition 3.3.1 Let ¢: A % A be an endomorphism in an allegory. 
(a) We say ¢ is reflexive if 14 < ¢. 
(b) We say ¢ is symmetric if @ < ¢° (equivalently, ¢ = ¢°). 
(c) We say ¢ is transitive if od < ¢. 
(d) We say ¢ is an equivalence if it has all three of the above properties. 
(e) We say dis a core if ġ < 1,4. (The name ‘core’ is short for ‘coreflexive’; but 


there is no connection with the coreflexive pairs considered in 1.2.9.) 


It is clear that, in an allegory of the form Rel(C), the first three properties are 
equivalent to those (of tabulations of the relations) given the same names in 1.3.6. 
Thus equivalences in Rel(C) are the same thing as equivalence relations in C, as 
defined there. (We remark that there is little danger of confusion between this 
use of the word ‘equivalence’ and its usual meaning in a 2-category (for which 
see Section B1.1); since the only 2-isomorphisms in an allegory are identities, 
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the only equivalences in the 2-categorical sense are isomorphisms, and we do not 
need a separate name for them.) 

We also, of course, call ¢ idempotent if pġ = ¢. Clearly, an equivalence 
(indeed, any reflexive transitive morphism) is idempotent; but we also have 


Lemma 3.3.2 
(i) A symmetric transitive morphism is idempotent. 
(ii) A core is symmetric and idempotent. 


Proof (i) From 3.2.1, we have ¢ < ¢¢°¢; so if ¢ is symmetric and transitive, 


we have $ < A$ < o¢ < ¢. 
(ii) The inequality @ < ¢° also follows from ¢ < ¢$°¢, given @< 1,4. Anda 
core is trivially transitive, and hence idempotent by (i). E 


We recall the construction of the category CIE], where € is a class of idem- 
potents in a category C, described in 1.1.8. What is of interest to us now is that 
if C is an allegory and £ consists of symmetric idempotents, then C[&] also has 
an allegory structure. Before proving this, we need 


Lemma 3.3.3 Letec: A% A be a symmetric idempotent in an allegory. 


(i) Any splitting of € must have the form ġo? for some ġ: BY A. 
(ii) If € is a core, then any splitting of it must have the form f f° where f is 
a map (in fact a monomorphism in Map(A)). 
(iii) Ife is an equivalence, then any splitting of it must have the form g°g where 
g is an epimorphism in Map( A). 


Proof (i) Suppose e is split as dy, where Yy = 1g. Then by 3.2.1 we have 
P = pop < Pe bod? = O'R” = (by)? = bod = Y 


and similarly Y° < ¢, so y = ¢°. 

(ii) If € is a core and splits as #¢°, then we have ¢¢° < 1 and ¢°¢ = 1, so ġ 
is a map; and it is split monic in A and hence monic in Map(.A). 

(iii) is similar to (ii). 


If A is a tabular allegory, then by 3.1.4 the epimorphism g in (iii) must 
actually be a cover. Thus we can think of the process of splitting cores in A as 
adjoining ‘missing subobjects’ to Map( A), and similarly splitting equivalences 
in A adjoins ‘missing quotients’ to Map(.A). We shall make this more precise 
shortly. 


Theorem 3.3.4 Let A be an allegory and E a class of symmetric idempotents 
in A (including all the identities). Then AJE) has an allegory structure making 
the functor I: A— AJE) of 1.1.8 a morphism of allegories. Moreover, 

(i) If A has a unit, then so does AJE] (and I preserves it). 

(ii) If A is a union allegory, then so is A[E] (and I preserves unions). 
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(iii) If A is a positive union allegory and E is either the class of all cores or 
the class of all equivalences in A, then A[E] is a positive union allegory 
(and I preserves finite coproducts). 


Proof The allegory structure is defined in the obvious way: the opposite of 
h: P € is f°: e= ° % 6° = ô, and the intersection of ġ and Y: 6 I} cis Ny. 
Of course, it needs verification that the latter is a morphism 6 % €. But (¢Nw)d < 
06 N Yå = NY, and 


(PN y) = (GN Ye) = GSN YS = ONY, 


so (@¢NW)éd = Ny, and similarly «(6N Y) = dN wy. The validity of the modular 
law in A[Š] is immediate from its validity in A. And it is immediate from the 
definitions that J is a morphism of allegories. 

(i) If U is a unit in A, then 1z(y) is the greatest morphism [(U) % I(U), 
since J is full (and preserves the ordering on hom-posets); and if €e € E has 
domain (and codomain) A, then the composite pae: € % F(A) % I(U) satisfies 
le =€ < €°pypae. Thus (U) is a unit in A[é]. 

(ii) Unions in the hom-posets of A[E], like intersections, are constructed in 
the ‘simple-minded’ way; and since composition in A distributes over unions, it 
is immediate that if @ and y are both morphisms 6 9> € in A[E], then so is pU% 
(and that composition distributes over unions in A[é]). 

(iii) First suppose that £ is the class of cores. If € and 6 are cores on objects A 
and B respectively, consider the morphism 0 = nyen? Un2gdn3: A+B A+B 
(where the n; are the coprojections, as in 3.2.12). Clearly, we have 0 < nynj U 
n2ns = 1448, 80 0 is a core. Moreover, 


Onie = njeni nie U N2dngnie = nie UN = nye, 


so n € is a morphism (in fact a map) e 9> 0 in A[&]. Similarly, nod: ô % 0; 
we claim that these morphisms make 0 into a coproduct of € and 6 in A[Ă]. 
For if ġ:c%} y and Y4: 6% y are two morphisms in A[Š], there is a unique 
x: A+B%C in- A (where C = dom y = cod y) satisfying yn, = ¢ and 
Xn2 = Y; then yx = x by uniqueness, and 


XO = pen? U dng = dny U yna = x(mıni U nang) = x, 


so x:0% y in AJE]. And x is easily seen to be the unique such morphism 
satisfying yn € = @ and yn2d = w. Once again, it is clear from the construction 
that J preserves binary coproducts; and a similar argument shows that if 0 is 
initial in A then J(0) is initial in A[é]. 

The argument in the case when € is the class of equivalences is identical, 
except that we have to verify that 0 is an equivalence rather than a core. It is 
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clearly reflexive and symmetric if € and 6 are; for transitivity, we have 


00 = (nyen? Un2dn3)(men? U n2dn3) 
= meen UOU OU n2ddn3 
<9, 


by transitivity of € and ô. 0 


It will be observed that (iii) of the above theorem could also be proved under 
the hypothesis that £ is the class of all symmetric idempotents of A; but we 
shall not need this result (and cf. 3.3.12 below). For the rest of this section, we 
shall use the letters X, Q and S to denote respectively the classes of cores, of 
equivalences and of all symmetric idempotents in a given allegory A. 

Rather remarkably, 3.3.4 is best possible in a certain sense: once we have 
split the symmetric idempotents, we have split all the idempotents that can be 
split in an allegory (which is not to say that all such idempotents are symmetric, 
of course). 


Proposition 3.3.5 Let « be an idempotent in an allegory A, and suppose there 
is a faithful morphism of allegories F: A — B such that F(e) splits in B. Then 
I(€) splits in AJS]. 


Proof Consider the morphism 6 = eMe°. 6 is clearly symmetric, but it is also 
transitive since 66 = (eNe°)(eNe°) < ee N e°? = eN e°. Hence ô is a symmetric 
idempotent by 3.3.2(i). We claim that ede = e and ded = ô; since the validity of 
these equations is reflected by the faithful functor F, it suffices to prove them 
under the assumption that € is already split, say € = dy with yọ = 1. But then 


we have 
cde = ploy N Wd" oy 
> elpy ¢°) ov 
> ovoveN 1) 


= = € 


since ~¢@ = 1, and 


ded = (PN WS" oy(oy N yp.) 
< (GN PPP) Podgly NGPA) 
= (NYP APN g) 
< gy Npp = ð; 


the reverse inequalities are trivial since € = é? > ede and 6 = 6? < ôeô. It now 
follows that any morphism of allegories which splits 6 will also split ¢; for if 
ô = aß with Ba = 1, then € = (ea)(Ge) and 


(Ge)(ea) = Bea = (85)e(da) = Pda = Ba=1. 
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It follows easily from 3.3.3(ii) that a core has a splitting f f° iff it has a tabula- 
tion (f, f); so, in a tabular allegory, ail cores must split. However, this condition 
is not sufficient for tabularity: consider the allegory of all sets of cardinality 
at most 10 and relations between them. In this, all cores (indeed, all symmetric 
idempotents) split, but not every morphism has a tabulation, essentially because 
the category of maps lacks products. 

We say an allegory is pre-tabular if every morphism is contained in one which 
has a tabulation. If the hom-posets of A have top elements (e.g. if A has a unit, 
cf. 3.2.9), then this is equivalent to demanding that these top elements have 
tabulations. Further, if the greatest morphism A % B has a tabulation, the 
latter is a product of A and B in Map(.A), by 3.2.4. (The converse may fail: if 
A and B have a product in Map(A), the most we can say about the morphism 
A B tabulated by the product projections is that it is the greatest which can 
be factored in the form gf°.) 


Proposition 3.3.6 

(i) An allegory is tabular iff it is pre-tabular and cores split. 

(ii) If A is a pre-tabular allegory, then A|K] is tabular, and the functor 
I: A > AK] is universal among morphisms from A to tabular allegories. 


Proof (i) One direction is obvious. Conversely, suppose A is pre-tabular 
and cores split; let 6: Aq B be a morphism and (f,g) a tabulation such that 
@<gf°. Then, if T = dom f = dom g, the domain of the maps in a tabulation 
of ¢ ‘ought to be’ a subobject of T, so we should be able to obtain it by splitting 
a suitable core on T. Consider the core 1r N g° df; suppose it splits as hh°. We 
claim that (fh, gh) tabulates ¢. First, we have 


(gh)(fh)° = ghh’ f° < go’ off? <o 


and 
$=$NGf? <g9(GONSF*) gl of N1r)f? = ghk fe. 
Then 
(fh)°( fh) N (gh)°(gh) =h° f° fh N h?g’ gh 
l a Ng°g)h 
= o = 1, 


using 3.1.6 and its dual at the second step. 

(ii) Let @: € > 6 be a morphism in A(X], and let (f, g) be a tabulation of a 
morphism which contains ¢ in A. Let 0 = f°ef N g°dg; then 0 < f° f Ng°g = 1, 
so 6 is a core and we have ef: 4% e, 5g0: 0 % 6 in A[K]. Moreover, 


€f00° fre = €fO fre < ef fre<e 


and 


0° fee fO = Of ef°O > 000 =9, 
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so €f@ is a map in A[X], and similarly 5g@ is a map. Now 


(599) (€f0)° = 590 f°e 
= 69(f°ef N gôg) fre 
> (ðgf ef Ng) fre 
2 Og f? Ngfre 
29ON b= ONG= 


d 
= (<f8)°(€f0) N (596)°(5g0) =0 fe fO N 0g°5g0 
=6( fref N g°ô0)0 
<(f°f n.9°9)8 = 8 


(where we have again used 3.1.6 at the second step), so (€f0,6g0) tabulates a 
morphism containing ¢. Thus A[K] is pre-tabular. But any core in .A[K] is also 
a core in A and so splits in A{K]; thus A[X] is tabular by part (i). The last 
assertion is also immediate from (i), since if T: A — B is any morphism from A 
to a tabular allegory, it must send cores in A to (split) cores in B. O 


Example 3.3.7 The allegory Mat;(L) of 3.2.2(b) is pre-tabular; the greatest 
morphism A % B is the matrix with all entries 1, which lies in the sub-allegory 
Mat ;(2) and has a tabulation there. So if we split its cores we obtain a tab- 
ular allegory, which has a unit and is a positive union allegory since Mat ;(L) 
enjoys these properties. What is the corresponding (positive coherent) category 
of maps? Working through the definitions, we see that a core « in Mat,(L) 
is a diagonal matrix, i.e. one satisfying e(a,a’) = 0 whenever a £ a’, so an 
object of Mat ;(L)[K] may be identified with a pair (A,a) where A is a finite 
set and a: A — L an arbitrary function. A matrix p: B x A — L defines a mor- 
phism (A,a) + (B, 8) iff ¢(b,a) < (b) A a(a) for all a and b; it is a map if, 
in addition, its ath column is a disjoint decomposition of a(a@) for each a, i.e. 
o(b, a) A d(b’,a) = 0 whenever b £ b’, and 


VV {9(6, a)|be B} =ala). 


Thus we see that Map(Mat ;(L)[K]) is none other than (a category equivalent 
to) the positivization Pos(L) of L, which we considered in 1.4.6; equivalently, 
Mat ¢(L)[K] is equivalent to Rel(Pos(L)). 


Example 3.3.8 The core-splitting construction may also be used to give an 
alternative description of the regularization of a cartesian category, constructed 
in 1.3.9. We recall from 3.1.1 that if C is merely cartesian, then relations in C do 
not in general form a category, since they do not possess an associative composi- 
tion. However, we do always have a bicategory (cf. Section B1.1) Gpan(C) whose 
objects are those of C, whose morphisms A — B are spans in C (that is, diagrams 
of the form (A — T — B), without any requirement that the pair be jointly 
monic), and whose 2-cells (A + T — B) — (A + T’ — B) are morphisms 
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T — T’ commuting with the given morphisms to A and B. (Equivalently, the 
hom-category Gpan(C) (A, B) may be regarded as the slice category C/A x B.) 
The composite of two spans (A + T — B) and (B e U — C) is defined by 
forming the pullback 

V ——> U 


T ———~> B 


and composing its projections with T — A and U — C. (Note that this compo- 
sition is not strictly associative in general; thus Gpan(C) is only a bicategory, 
not a 2-category.) If we form the preorder reflections of the hom-categories 
Gpan(C) (A, B) (that is, we form their quotients by the equivalence relations 
which identify all parallel pairs of 2-cells), we still do not have a 2-category, 
but if we further identify all isomorphic pairs of 1-cells, then composition does 
become strictly associative. Thus we have a locally ordered 2-category Gpan(C) 
whose objects are those of C and whose morphisms are equivalence classes of 
spans (under the equivalence relation which identifies two spans if there exist 
morphisms between them in both directions), with ordering defined by the rela- 
tion that there exists a morphism of spans from (a span representing) the first 
morphism to the second. 

Moreover, Gpan(C) is an allegory: binary intersections of morphisms are 
given by (the equivalence classes of) pullbacks in C/A x B, the opposite of a 
morphism is defined just as in Rel(C), and the modular law is proved in the 
same way that we proved it for relations in 3.1.5 (though the proof is easier, 
because we do not have to take images). Also, the terminal object of C is eas- 
ily seen to be a unit in Gpan(C); and, for any morphism f: A — B of C, the 
morphism fe: A 9+ B of Gpan(C) represented by the graph of f (that is, the 
span (1, f): A— Ax B) is a map (though we do not claim that all maps in 
Gpan(C) are of this form), from which it follows easily that the top element 
of Gpan(C) (A, B) (which is of course represented by the span consisting of the 
product projections from A x B) has a tabulation. So Gpan(C) is pre-tabular; 
hence if we split its cores we obtain a tabular allegory with a unit — equiva- 
lently, by 3.2.10, we obtain an allegory equivalent to the category of relations of 
a regular category. 

Now if F: C — D is any cartesian functor from C to a regular category, we 
obtain a morphism of allegories from Gpan(C) to Rel(D) by sending the equiva- 
lence class of a span (T — A x B) to the image in D of (FT — FA x FB). Thus 
the universal property of 3.3.6(ii) tells us that Map(Gpan(C)[K]) is universal 
amongst regular categories into which C can be mapped by cartesian functors; 
hence it must be equivalent to Reg(C) as constructed in 1.3.9. (In fact it is not 
hard to show directly that Gpan(C)[K] ~ Rel(Reg(C)).) 
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Example 3.3.7 shows that Map(.A[K}) is not in general effective as a regular 
category; to achieve this, we need to split some more symmetric idempotents, 
namely the equivalences. We call an allegory effective if its equivalences split. 


Proposition 3.3.9 

(i) A tabular allegory with a unit ts effective iff its category of maps is 
effective regular. 

(ii) For any allegory A, A{Q] is effective, and the functor I: A— A[Q| is 
universal among morphisms from A to effective allegories. Moreover, if A is 
tabular, so is A[Q]. 


Proof (i) We have already observed that equivalences in Rel(C), for a regular 
category C, are the same thing as equivalence relations in C, as defined in 1.3.6. 
Moreover, an equivalence relation (a,b): R= A is the kernel-pair of a cover 
q: A B iff it is a tabulation of q°qe: A H A (by the proof of 3.2.7), iff it has 
a splitting in Rel(C) (by 3.3.3(iii)). 

(ii) As in 3.3.6(ii), it is clear that any equivalence in A[Q] is an equivalence 
in A and hence splits in A[Q]; so the first two assertions are immediate. Now 
suppose A is tabular; let ø: €} ô be a morphism of A[Q], and let (f,g) be a 
tabulation of ¢ in A. Define 0 = f°ef N g°dg; then 0 > f° f N9°9g = 1, so 0 is 
reflexive. It is also clearly symmetric, and 


00 = (fre f Ng SgS Ef N 9°59) 
<f eff ef N gigg ig 
< frefing’dg =0 


since f and g are maps, so @ is transitive. Now ef < ef f°ef < ef, but we also 
have ef0 > ef since 0 is reflexive; so ef is a morphism 0 % € in A[Q]. In fact it 
is easily seen to be a map, and similarly 6g: 0 — 6. Now ¢ = ġe = (dg)(ef)°, 
and 


(ef) (Ef) N (ôg) (69) = FEF 9°59 = 9, 
so (ef, 6g) is a tabulation of 6: € + 6 in A[Q]. o 


Corollary 3.3.10 Let C be a regular category. Then there exists an effective 
regular category Eff(C) and a full and faithful functor C — Eff(C) which is 
universal among regular functors from C to effective regular categories. Moreover, 
if C is a (positive) coherent category, so is Eff(C), and the functor C — Eff(C) 
is then coherent. 


Proof Define Eff(C) to be Map(Rel(C)[9]), and translate 3.3.9 via the equiv- 
alence of 3.2.10. (Note, incidentally, that a full and faithful morphism of allegories 
automatically restricts to a full and faithful functor between categories of maps.) 

oO 
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Remark 3.3.11 Let C bea regular category having coequalizers of equivalence 
relations. Then the inclusion C — Eff(C) has a left adjoint. For if ¢: A% A is 
an equivalence in Rel(C) — that is, an object of EF (C) -~ and A — R — Aisa 
tabulation of ¢, then it is easy to see that maps ¢ > tg in Rel(C)[Q} are exactly 
the maps A — B in C having equal composites with R = A. So we obtain the 
left adjoint by sending ¢ to the coequalizer of this pair in C. 


We note that the two splitting constructions, of cores and equivalences, could 
have been combined into one: 


Lemma 3.3.12 In an allegory, all symmetric idempotents split iff cores and 
equivalences split. 


Proof One direction is trivial. Conversely, suppose that cores and equiv- 
alences split, and let e€: A % A be an arbitrary symmetric idempotent. Then 
cN 1, is a core, and so we can split it as f f° for some f: B — A. Now consider 
ô = f°ef: BW B. We have 


ô> PENAS = fF fof =18, 


so ô is reflexive; it is clearly symmetric, and it is transitive since 
65 = fef f ef < frecef = Pef. 


Thus 6 is an equivalence, and we can split it as g°g for some g: B — C. Now 
Jg gf = fof? = ff ef f° < esince f f° < 1,4, but we also have 


Ff fPef fo =(EN la)e(eN 1a) 
= (e° N 1g )ec(e°E M14) 
> (eN e)(eN e) = &; 


and g f° fg° = gg° = 1c, so (g f°, fg°) is a splitting of e. O 


Thus, for any allegory A, we may identify A[X][Q] with A[S]; though one 
should beware that in this identification Q denotes the class of all equivalences 
in A{K], and not just those in A. In particular, we obtain 


Corollary 3.3.13 
(i) For any cartesian category C, Map(Gpan(C)[S}) is an effective regular 
category, and there is a full embedding C + Map(Gpan(C)[S]) which is 
universal amongst cartesian functors from C to effective regular categories. 
(ii) For any distributive lattice L, the category Map(Mat;(L)[S 5}) is a preto- 
pos, and there is an embedding L — Map(Mat;(L)[S]) which is universal 
amongst coherent functors from L to pretoposes. 


Proof The first assertion is immediate from 3.3.8 and 3.3.10; the second from 
3.3.7 and 3.3.10. a 
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We may describe the pretopos of 3.3.13(ii) explicitly as follows. A symmet- 
ric idempotent in Mat,(Z) is a function é: A x A — L satisfying €(a1,a2) = 
€(@2,@,) and €(a1, a2) A €(@2,@3) < e(a1, a3) for all a1, a2, a3 € A; that is, it is an 
‘L-valued partial equivalence relation’ on A. (‘Partial’ because we don’t demand 
the reflexivity condition e(a@,a) = 1.) And a map (A, €) —> (B,6) is a function 
¢:BxA-L satisfying 


p(b, a) < 5(b, b) A €(a, a), 
b(bi, a1) A (b1, b2) A e(a1, a2) < $(b2, a2), 
p(b1,a) A $(b2, a) < 6(b1, b2), 


and 
e(a,a) < \/{$(b,a) |b € B} 


for all a,aı,@2 € A and all b, b1,b2 € B. As usual, composition of maps is defined 
by matrix muliplication; it is not hard to verify directly that the four conditions 
above are stable under composition. 

In the case when L is a frame, and with the finiteness restrictions on A and 
B removed, the above formulae define the category of maps of Mat(L)[S]; once 
again this category is a pretopos, and we shall see in the next section that it is 
actually a topos. We denote it by Set(Z), and call it the category of L-valued 
sets. 


Suggestion for further reading: Freyd & Scedrov [381]. 


A3.4 Division allegories and power allegories 


Next, we consider the extra structure on an allegory which corresponds to that 
of a Heyting category. 


Definition 3.4.1 An allegory A is called a division allegory if, for each 
p: A B and each object C, the order-preserving map (~)¢: A(B, C) > A(A,C) 
has a right adjoint, which we call right division by ¢ and denote (—)/¢. 


Of course, the anti-involution ensures that if we have right division we also 
have left division ¢\(—) (right adjoint to ¢(—)), since ¢\y may be defined as 
(4° /p° Y. We note that the operation (—)/¢ satisfies the identities 


(YN xp = ¥/ONX/¢; 
polep z y 


and 
(x/¢)b <x 


whenever the expressions involved in them are defined; and indeed these identi- 
ties suffice to characterize (—)/¢ġ. A further useful identity is the associativity of 
double division: (¢\w)/x = ¢\(#/x) whenever this makes sense, for each of the 
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two expressions defines the largest 0 satisfying 60x < y. Similarly, we may verify 
that (¢/~)/x = ¢/(xw) whenever this makes sense, and that o(w/x) < (¢)/x. 
Also, if the division operations exist, then composition automatically distributes 
over any unions which may exist in the hom-posets of A; conversely, a geometric 
allegory (as defined at the end of Section A3.2) is a division allegory, by the 
poset version of the adjoint functor theorem. 


Lemma 3.4.2 Let A be a tabular allegory with a unit, whose hom-posets have 
finite unions. Then A is a division allegory iff Map(.A) is a Heyting category. 

Proof As usual, this is entirely straightforward if we translate it, via 3.2.10, 
into a statement about a regular category C and the corresponding allegory 


Rel(C). If (f,g) is a tabulation of ¢: A + B, then (—)@ may be regarded as the 
composite 


Sub(B x C) ao Sub(T x C) A Sub(A x C), 


and so has a right adjoint Vigx1)°(f x 1)* if C is a Heyting category. The converse 
is obtained by identifying Subc(A) with Rel(C)(A, 1). n 


Lemma 3.4.3 Let A be a division allegory, E a class of symmetric idempotents 
of A. Then AJE] is a division allegory, and the canonical functor I: A — ALE] 
preserves division. 


Proof Let ¢:€% 6 and y: €% y be morphisms of A[E]. Then y(%/p)6 is 
certainly a morphism ô % y; and 


b/)56 = h/p < w=. 


But if x: ô% 7 is any morphism satisfying x < w, then we have x < w/¢ in 
A, whence x = 7x6 < 7({/0)6. So y(—/¢)6 is right adjoint to 


AEI, y) 2S Allen). 


The second assertion is immediate. o 


If L is a Heyting algebra, then Mat,;(L) is a division allegory: if 6: A + B, 
w: AC and x: BUC are three L-valued matrices, we have xf < y iff 
x(c, b) A (b,a) < y(c,a) for all a,b,c, iff x(c,b) < (¢(b,a) > y(c,a)) for all 
a,b,c, and so we may define the matrix Y/¢ by 


hiel b) = Mb, a= ple a)) |a € A}. 


Thus we may deduce from the two preceding results that Pos(L) and 
Eff (Pos(L)) are Heyting categories. Similarly, if L is a complete Heyting algebra 
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(equivalently, a frame), then the category Set(L) defined at the end of the last 
section is a Heyting category. 

In fact, as we remarked earlier, Set(L) is a topos; so the next step is to seek a 
condition on an allegory which is equivalent, in the tabular case, to its category 
of maps being a topos. In one sense, there is a simple answer: 


Lemma 3.4.4 Let C be a regular category. Then C is a topos iff the functor 
C — Rel(C) which sends f to fe has a right adjoint. 


Proof Such aright adjoint, if it exists, will send each object A of C to an object 
PA equipped with a relation € 4; PA % A such that any relation ¢: B % A can 
be factored as E4 fe for a unique map f: B — PA. But, if we identify relations 
with subobjects of products, E4 fe is just another name for (f x 1,4)*(€a); so 
this is exactly the definition of a power object as given in Section A2.1. O 


Thus we could define a power allegory as an allegory A for which the inclusion 
Map(A) — A has a right adjoint. It turns out, however, that in the presence 
of the division operation this description may be further simplified, since the 
‘naming’ operation from morphisms to maps is definable in terms of the other 
structure. (And since a topos is a Heyting category by 2.3.5, it is not unreason- 
able to demand that the division operation should be present in a power allegory.) 
Before giving the definition, however, we need to introduce a ‘symmetrized’ ver- 
sion of the division operation: if 6: B + A and w: C % A are morphisms in a 
division allegory with the same codomain, we write (o|) for 


(ANY) 9 (8°/ 0") = W) ON A CH B. 


(In the allegory Rel(Set), (|y) relates a pair of elements (c, 6) iff they are 
related by % and ¢ respectively to exactly the same elements of A.) In general, 
we note that (1d) = (d|W)°; and if x: D + A is a third morphism, then 


(dld) (lx) S ANAYO NA (°/ 0°) (W°/X?) 
< (@\x) N (¢°/xX°) = (Alx) - 


Also, since ¢\¢ > 1 for any ¢, it is easy to see that (¢|¢) is always an equivalence. 
Conversely, if € is an equivalence in a division allegory, then transitivity yields 
c€ < e\e, whence e < (ele) by symmetry; but we also have e = 1\e > e\e > (ele), 
since € is reflexive and —\— is order-reversing in its first variable. Thus the 
equivalences in a division allegory are exactly the morphisms of the form (¢|¢). 


Definition 3.4.5 A division allegory A is called a power allegory if there is 
an operation assigning to each object A a morphism Ea: PA A satisfying 
(€a|E4) = lpg and 

lp < (¢\E4)(Ea \d) 


for any ¢: BY A. 
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If A is a power allegory, we shall call PA the power object of A, and write 
ro’ for (€4|¢): B+ PA, which we call the name of the morphism ¢: B % A. 
(Intuitively, we can think of the two conditions in the definition as allegorical 
versions of the extensionality and comprehension axioms: in Rel(Set), the con- 
dition (€A | €a) = 1pa says that two subsets of A which have the same members 
are equal, and the second condition says that for any œ: B + A and any b E€ B 
there is a subset, of A whose members are precisely the elements of A related to 
b by ¢.) To justify the terminology just introduced, we need 


Lemma 3.4.6 (i) For any ¢, "¢" is a map. 
ii) The assignment A> PA extends to a functor A — Map( A), right adjoint 
to the inclusion. 


Proof (i) We have "¢"¢” = (€41¢)(¢| Ea) < (€4] Ea) = 1 Pa, and 


oP b'=((Ea\d)°N (P\ Ea) CEA \b) NA (P\ EA)? N (E4 \9)) 
> 1p N((Ea\d)?N (o\Ea))(Ea\9) 
> lp Nlpn(¢\e€a)(Ea\d) = 1B 


by two applications of the modular law. 
(ii) First, we claim that €4"¢' = ¢ for any ¢: B + A. For 


Eao? <e€a(Ea\d) < 6, 


and 

SP ONTOS ANEAN? SEAT". 
Now if f: B — PA is any map satisfying €4 f = ¢, we have f < (€14 \¢ġ), and 
also f¢° = f f° €4<€4, whence f < (E) /¢°). So f <T’, whence f =" ¢' by 
3.2.3(ii). Thus f HE, f is a bijection 


Map( A) (B, PA) —> A(B, A); 
the remaining details are straightforward. O 


Corollary 3.4.7 If A is a tabular allegory with a unit, then A is a power 
allegory iff Map(.A) is a topos. 


Proof One direction is immediate from 3.4.4 and 3.4.6. Conversely, if C is a 
topos, then Rel(C) is a division allegory by 2.3.5 and 3.4.2, and it is not hard 
to verify directly that the membership relation on A, as defined in Section A2.1, 
satisfies the identities of 3.4.5. 0 


Not every power allegory is tabular: for a simple counterexample, take the 
full subcategory of Rel(Set ;) whose objects are sets whose cardinality is a power 
of 2. (This example is pre-tabular, since the corresponding category of maps has 
finite products, but it can easily be modified to make it not even pre-tabular.) 
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However, once we split the cores in this example, we obtain an allegory which is 
not only tabular but effective, and this is typical: 


Proposition 3.4.8 Let A be a power allegory. Then 
(i) If K is the class of cores in A, A|K] is a power allegory. 
(ii) Every equivalence in A can be factored as f° f where f is a map. 


(iii) If cores split in A, so do equivalences (and hence AJK] ~ A[S], where S is 
the class of all symmetric idempotents in A). 


(iv) If A is nonempty and effective and its hom-posets have bottom elements, 
then it has a unit. 


Proof (i) Let 6: A% A be a core in A. Define 7 = "6E€,4'N1pa; then 7 is 
a core on PA, and we shall show that it serves as a power object for 6 in A[X]. 
First, we have cam <€4°b6E€4'=6Ea,, and so 


EAT =EANNn LEAN LEAT 


using the fact that both 6 and 7 are cores. Thus €47: % 6 in A[K); we shall 
show that it satisfies the conditions of 3.4.5. First, 


EAT(EaT\Eam)m SEanT S SEA, 
so 7(E47\E4m)a < (Ea \8 Ea). Thus 


m(EaAr\ Eann N (rleanr\Earyn) < (ea \ĵEa)N (Ea \dEa 
< (ea \ĵEA)N (Ea \ Ea)? 
= (€a \EA) N (S EA\ Ea N (EA\EA) 
N(€a\Ea)? 
= fca N lpA = 7; 


using the fact that 6 is a core and the order-preserving properties of —\—. 
Now if ¢: y % 6 is any morphism of A[X], we have 6¢ = ¢, and hence 


EACEA" =dE€4' b'=56=4, 
so'd€4"¢'="¢' by 3.4.6(ii). Thus by 3.1.6 we have 
mg! = (EEA N \pa)'¢! = rca Aint ¢! = ro? i 
Next, we have 
oye” <eavo™ go” =eaTr og” <EA T, 
so TH” < (6\Eam7). And (EaT\g) > (€4\¢) > Tġ? since 7 is a core, so 
Vg\EaT)m(EaT\9)7 21 OI OY =op a N, 


as required. 
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(ii) Let 6 be an equivalence. Then we have 
POTS) = (dE 4)(Ea l8) < (818) = 6, 
but also 
5 = (816) < (818) FPS = (EEE Ea)" < (Blea) = EVN, 


so we may take f to be "6". 

(iii) Again, let 6 be an equivalence, and factor it as f° f. Then f f° is a core, 
so we can split it (by 3.3.3(ii)) as gg° with g°g = 1. Let ọ = g°f; we have 
Od = fag? f = FF FPF = 66 = 0 and ġe? = 9 f F° g = 9° G9°9 = 1, so Po is a 
splitting of 6 (and ¢ is a map). The second assertion follows from (i) and 3.3.12. 

(iv) Let A be an object of A, and let 04 denote the bottom element of 
A (A, A). Since composition in A preserves bottom elements, it is easy to see 
that ra = 04/04 (or, if you prefer, 14/04) is the top element of A(A, A), and 
hence that it is an equivalence. If we split it as g°g where g: A— U (using 
3.3.3(iii)), then ly is the top element of A(U,U), since if œ: U % U then 


p = 99° 099° < gTAG? = 99° 99° = ly . 


So U will be a unit provided every object of A admits a map to it; but A admits 
a map to U, and if we take A = PB for some B then every object admits a map 
to A, viz. the name of the zero morphism to B. o 


Corollary 3.4.9 IfA is a nonempty pre-tabular power allegory whose hom- 
posets have bottom elements, then Map(A[K]) is a topos. 


Proof By 3.4.8(i), A[K] is a power allegory, and by 3.3.6(ii) it is tabular. 
By 3.4.8(iii) it is effective, and its hom-posets inherit bottom elements from <A; 
hence it has a unit by 3.4.8(iv). So the result follows from 3.4.7. o 


We conclude this section with a non-elementary result which enables us to 
recognize certain particular allegories as being power allegories. First we need 
some new terminology. We call a morphism ¢: A % B in an allegory a partial 
map if it satisfies $¢° < 1g, but not necessarily ¢°ġ > 14. We shall use the 
notation ¢: A — B for partial maps; it is easy to see that in Rel(C) partial 
maps are exactly what we called by that name in Section A2.4. And we shall 
call an object G of an allegory a strong separator if, given any two morphisms 
o,y: A% B with og Æ, there exists not merely a morphism but a partial map 
x: G — A with ox £ Yx. 


Lemma 3.4.10 IfG is a strong separator in an allegory A, then I(G) is a 
strong separator in A[E], for any class E of symmetric idempotents in A. 
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Proof Let ¢,~:€% ô be two morphisms of A[E] with ¢ 4 %. Then there 
exists x: G — A in A (where A = dom e) with dx # px; now ex is a morphism 
I(G) % ein AJE] which has distinct. composites with ¢ and Y, and 


(ex)(ex)® = exx°e < e =e, 
So €x is a partial map. 0 


Theorem 3.4.11 Let A be an effective positive geometric allegory with a strong 
separator. Then A is a power allegory. 


Proof Let G be a strong separator, and let A be an arbitrary object of A. 
Form the coproduct, H of copies of G indexed by A (G, A), and let 6: H 9 A be 
the unique morphism satisfying Ong = ¢ for all 6: GY A (here ng is, as usual, 
the ¢th coprojection). (6|@) is an equivalence (recall that a geometric allegory is 
necessarily a division allegory), so we can split it as g°g for some g: H — K; let 
c€ = 6g°: K % A. We shall show that this latter morphism satisfies the conditions 
of 3.4.5. 
First, we note that eg = 0g°g > 0 but also 


eg = 0(0|0) < A(A\8) < 9, 
so eg = 6. Now 
Ag°(e\e)g = €(e\e)g < €g = 9, 
so g°(e\e)g < (A\9), whence g°(ele)g < (0/0) and so 
(ele) = 99° (ele)99° < 9(8|0)9° = 99°99" = 1k . 

Next, suppose we have a factorization e = 6h where k: K — L is a map and 
(6|6) = 1z. Since h°h is symmetric and hh°h = h, we have h°h < (hjh) < (ele) = 
lx, so h°h = 1x. Suppose hh° Æ 1,; then there is a partial map x: G — L with 
hh°x # x. But now 6x: G % A, so we can factor dx as Ons, = dhgns, = hf, 
say, where f is a map. Thus 6 > dyx° = dhfx°, so hfx° < (6\6); but similarly 
ô > dxf°h®, so xf°h? < (6\d) and hence hfx° < (6/6) = 1z. Now we have 


xSxF K hf < hf; 
we claim that x = Af (x° xN 1a). For we have 
hf(x xN 1a) <hAfXXSX 


and 

hf(xX xN 1G) 2 X0CXN1e) Z XNX=X. 
So hh°x = hh°hf (x° xNla) = hf (x xN la) = x, contradicting our assumption. 
Thus we have shown that, if there is a factorization € = 6h with (6|6) = 1, then 
necessarily h is an isomorphism. 
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Now let ¢: B % A be an arbitrary morphism of A, and consider the morphism 
w:B+K A defined by yn; = ¢, Yno = e. By splitting the equivalence (|W) 
as k°k and arguing as before, we can factor y as wk, where (wlw) = 1. But then 
wkni = €, so kn, is an isomorphism, and hence ¢ = wkng = €(kn)°kna; that 
is, we can factor ¢ as el where l is a map. Now (€\¢) > l, and also @l° = ell? < e 
whence (¢\¢) > 1°, so 


(o\e)(e\d) = II 2 1, 


as required. Oo 


Corollary 3.4.12 If A is a pre-tabular positive geometric allegory with a 
strong separator, then Map(A|S)) is a topos. 


Proof Putting together all our results on splitting symmetric idempotents, 
we know that .A[S] is tabular, effective and positive geometric, and that it has a 
strong separator. Hence by 3.4.11 it is a power allegory, and by 3.4.8(iv) it has 
a unit. So the result follows from 3.4.7. O 


In particular, we may now prove a long-promised result: 
Corollary 3.4.13 For any frame L, Set(L) is a topos. 


Proof By definition, Set(L) is the category of maps of Mat(L)[S], and we 
know that Mat(L) is pre-tabular and positive geometric, so it is enough to show 
that Mat(L) has a strong separator. But any singleton set {+} will do for this; 
for if ọ and 4% are two distinct B x A matrices, they must differ at some particular 
entry, say $(b9, ao) Æ (bo, ao), and then the matrix 


x(a, *)=1 if a = ap 
=0 otherwise 


defines a map (not just a partial map) {x} — A satisfying dx # wx. 0 


We shall see in B3.1.9 that there is a very large class of toposes which may 
be constructed in this way (that ìs, by splitting all symmetric idempotents and 
then cutting down to the subcategory of maps) from relatively ‘simple’ positive 
geometric allegories having sets as objects, disjoint union as coproduct and a 
singleton set as strong separator. 


Suggestions for further reading: Carboni et al. [225, 226}, Freyd & 
Scedrov [381]. 


A4 
GEOMETRIC MORPHISMS - BASIC THEORY 


A4.1 Definition and examples 


In Chapters Al and A2 we took the view that, whenever we introduced a new 
class of categories, there should be a corresponding class of functors, namely 
those which preserve the structure involved in the definition of the given class of 
categories. Thus the notion of topos, when we introduced it, came with the asso- 
ciated notion of logical functor. However, an important aspect of topos theory, 
and one which particularly reflects the confluence of geometrical and logical influ- 
ences in the origins of the subject, is the fact that it has another natural notion 
of ‘morphism of toposes’ associated with it: the notion of geometric morphism. 
We shall see in 4.1.18 below (and again in B2.2.7) that geometric morphisms are 
also ‘structure-preserving functors’ in a suitable sense, although the structure 
they preserve is different from that involved in our original definition of a topos. 

In fact a geometric morphism is not a single functor but an adjoint pair of 
functors. One could, of course, define a geometric morphism to be a functor 
possessing an adjoint rather than an adjoint pair: since the adjoint of a functor, 
if it has one, is unique up to canonical natural isomorphism, this would not 
make any essential difference, but in practice it is generally convenient to have 
both the left and right adjoints explicitly specified as part of the definition of a 
geometric morphism, and we shall take this approach. 


Definition 4.1.1 (a) Let E£ and F be toposes. A geometric morphism 
f: F — E consists of a pair of functors f,: F — E (the direct image of f) and 
f*:& — F (the inverse image of f) together with an adjunction (f* 4 f+), such 
that f* is cartesian (i.e. preserves finite limits). 

(b) Let f and g: F — E be geometric morphisms. A geometric transformation 
a: f — g is defined to be a natural transformation a: f* — g*. 


A few comments on this definition are in order. We do not have to demand, 
in part (a), that the direct image functor f, should be cartesian, since this 
follows from its possession of a left adjoint — indeed, it preserves whatever limits 
exist in F. Similarly, the inverse image f* preserves whatever colimits exist in 
E. The fact that a geometric morphism f is regarded as pointing in the same 
direction as f, is a purely conventional choice; we shall see later that, in a 
sense, fẹ ‘embodies the geometric aspects’ of the morphism f, and f* embodies 
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its algebraic aspects. There is a similar arbitrary choice of direction involved 
in part (b) of the definition: in general, if we are given adjunctions (f* 4 fa) 
and (g* 4 g,) between the same two categories, then natural transformations 
f* — g* correspond bijectively to natural transformations g, — f,. Both choices 
are justified on pragmatic grounds: they minimize the amount of dualization we 
shall have to do later on in the discussion of particular examples (see, for instance, 
4.1.4 and 4.1.11 below). 

It is also possible to consider geometric morphisms and transformations 
(defined in exactly the same way) between quasitoposes rather than toposes. At 
various points in this chapter, we shall prove results about geometric morphisms 
between quasitoposes rather than toposes (when we can do so without significant 
extra effort), but we shall not develop the full theory of such morphisms here. 

We shall write Top for the 2-category of toposes, geometric morphisms and 
geometric transformations. Although this is a 2-category, we shall generally treat 
it as if it were a bicategory: that is, we shall not expect diagrams of toposes and 
geometric morphisms to commute ‘on the nose’, but only up to (specified) invert- 
ible geometric transformations. For instance, when we refer (as we frequently 
shall) to the slice category Top/E, we shall mean the 2-category whose objects are 
morphisms f: F — E in Yop, whose 1-cells (g: G — E) — (f: F — E) are pairs 
(h, a) where h is a geometric morphism G — F and a is a 2-isomorphism g — fh, 
and whose 2-cells (h, a) — (k, 8) are geometric transformations y: h — k such 
that fyoa = 8. Normally, we shall take such details for granted, but we shall 
spell them out in full (as here) whenever there is a serious danger of confusion. 
(See also Section B1.1 for a more detailed discussion of our general attitude to 
diagrams in 2-categories. ) 

The primary justification for studying geometric morphisms is that examples 
of such morphisms occur ‘all over the place’ when we look at particular 
examples of toposes. We devote most of this section to introducing some of 
these examples. 


Example 4.1.2 Let f: A— B be a morphism in a topos £. By 2.3.2, the 
slice categories €/A and €/B are toposes, and by 2.3.3 the pullback functor 
f{*: €/B — €/A is the inverse image of a geometric morphism, whose direct 
image is the functor denoted I, in 1.5.3. In this way the assignment At> €/A 
becomes a functor E — Yop (but only in the ‘pseudo’ sense indicated above: 
if g: B — C is another morphism of £, we do not expect to have an equality 
(gf)* = f*g*, but only a canonical isomorphism between these functors). In 
fact, recalling the isomorphism € = €/1, we can regard A > £€/A as a functor 
E — Top/E. 


The geometric morphisms which arise as in Example 4.1.2 are rather special, 
in that their inverse image functors are logical. In 2.3.8 above, we saw that this 
does not quite suffice to characterize them amongst geometric morphisms; in 
Section C3.5 we study what can be said in general about geometric morphisms 
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with logical inverse images. Incidentally, 2.3.9 shows that the only geometric 
morphisms with logical direct images are equivalences. 


Example 4.1.3 Let f: F — € be a geometric morphism, and A any object 
of €. Then we have a geometric morphism (which we shall denote by (f/A)) 
from the slice category F/f*A to €/A: explicitly, (f/A)* is simply f* applied to 
morphisms of £ with codomain A, and (f/A).(g: B — f*A) is the left vertical 
morphism in the pullback 


CSF fB 


f.(9) 


A— 5s ffA 
where 7 is the unit of (f* 4 f4). It is straightforward to verify that (f/A)* is 
cartesian and left adjoint to (f/A),. Note also that, for any morphism g: A — B 


in E, the square 
F/f*A—>F/f*B 


(f/A) (f/B) 


£€/A —> €/B 


(where the horizontal morphisms are defined as in 4.1.2 from f*g and g 
respectively) commutes up to natural isomorphism. 


We shall say that a property of geometric morphisms is stable under slicing 
if it is inherited by morphisms of the form (f/A) from any morphism f which 
possesses it. 


Example 4.1.4 Let f:C — PD be a functor between small categories. Then 
composition with f defines a functor f*: [D, Set] — [C, Set], which has adjoints 
on both sides, the left and right Kan extensions along f: for example, the right 
Kan extension lim; sends a functor F: C — Set to the functor whose value at 
an object B of D is the limit of the diagram 


CA | eee ee 


(here (Bf) is the comma category whose objects are pairs (A,¢) with 
$: B — fAin D, and U is the forgetful functor from this category to C). Thus 
f* is the inverse image of a geometric morphism [C, Set] — [D, Set], whose 
direct image is lim;. Moreover, any natural transformation a: f — g between 
functors C — D induces a natural transformation f* —> g* (whose value at F 
is the natural transformation Fa: Ff — Fg), i.e. a geometric transformation 
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(lim ;, f*) — (lim,, 9*). Thus the assignment C + [C, Set] can be made into a 
functor (that is, a 2-functor) from the 2-category Cat of small categories, functors 
and natural transformations into Top (in fact into Lop/Set). 


The particular case of 4.1.4 in which the categories C and D are discrete 
corresponds, via the equivalence Set? ~ Set/B of 1.1.6, to the particular case 
E = Set of 4.1.2. In B2.3.22 we shall generalize Example 4.1.4, replacing Set 
by an arbitrary topos € (and the notion of small category by that of an inter- 
nal category in €); for the present, we may note that it admits an immediate 
generalization from Set to Set; (taking C and D to be finite categories). 

We note that the geometric morphisms which arise as in 4.1.4, though not as 
special as those of 4.1.2, still have the property that their inverse image functors 
have left adjoints as well as right adjoints. We call a geometric morphism f 
essential if it has this property; we normally write fı for the left adjoint of f*. 
With the aid of this notion, we can prove a partial converse to 4.1.4: 


Lemma 4.1.5 LetC andD be small categories such that D is Cauchy-complete 
(cf. 1.1.10). Then every essential geometric morphism f: |C, Set] — [D, Set] is 
induced as in 4.1.4 by a functor C > D. 


Proof Since fi has a right adjoint f* which preserves epimorphisms, it is 
straightforward to verify that fı preserves projectives; similarly, the fact that f* 
preserves coproducts implies that f, preserves indecomposable objects. Hence by 
1.1.10 f; maps representable functors in (C, Set] to representables in [D, Set], i.e. 
it restricts via the Yoneda embeddings to a functor fo: C — D. The adjunction 
(f: 4 f*), and the Yoneda lemma, then yield isomorphisms 


f*(F)(A) & [C, Set] (C (A, —), f*(F)) 
[D, Set] (D (fo(A), —), F) 
= F(fo(A)) 


ie I 


for every F: D — Set and every A € ob C; so f* is identified with the functor 
‘compose with fo’, and hence fi and f, are respectively isomorphic to lim jf and 
lim j- l 


We note that 4.1.5, like 4.1.4, can be made 2-functorial: if a: f >g is a 
geometric transformation between essential geometric morphisms |C, Set] = 
[D, Set], then œ induces a natural transformation g) — fı, which restricts 
(since the Yoneda embeddings are contravariant) to a natural transformation 
fo — go. In B3.2.8(c) we shall characterize arbitrary geometric morphisms 
between toposes of the form [C, Set]: they do not all arise as in 4.1.4, but they 
may all be constructed from certain ‘generalized functors’. (Cf. also 4.1.10 below.) 

We now return to our list of examples of geometric morphisms. 


Example 4.1.6 Let G be a (nearly discrete) topological group. In 
Example 2.1.6 we noted that the inclusion Cont(G) — [G, Set] is cartesian 
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and has a right adjoint; so it is the inverse image of a geometric morphism 
IG, Set] — Cont(G). 


Example 4.1.7 Let f: G — H bea continuous homomorphism between topo- 
logical groups. If we regard f as a functor between small categories, then it 
induces a functor f*: [H, Set] — [G, Set] as in 4.1.4; and it is easy to see that this 
functor maps continuous H-sets to continuous G-sets, since if B is an H-set then 
the stabilizers of points of f* B are just the inverse images under f of their stabi- 
lizers as points of B. Thus we can regard f* as a functor Cont(H) — Cont(G); 
as such, it is clearly still cartesian, and it has a right adjoint obtained by 
applying lim; and then coreflecting into Cont(H). In this way the assignment 
G ++ Cont(G) becomes a functor from the category Gp(Sp) of topological 
groups to Top. 


Example 4.1.8 Let (C,T) be a small site, as defined in 2.1.9. The inclusion 
functor Sh(C,T) — |C°?, Set] has a cartesian left adjoint (the associated sheaf 
functor — this is a special case of a result which we shall prove in 4.4.8 below); 
so it is the direct image of a geometric morphism. 


Example 4.1.9 By 1.4.7, there is (up to canonical isomorphism) exactly one 
cartesian and cocartesian functor A: Set; — E for any topos E, namely the 
functor which sends an n-element set to the n-fold copower of 1. If it has a right 
adjoint T, then for any object A of E we have 


TA & Set; (1, PA) & £ (Al, A) & £ (1, A); 


taking the right-hand side as a definition, it is easy to verify that it does define a 
functor € — Sets right adjoint to A, provided £ (1, A) is finite for every object 
A of E. But since E is cartesian closed, this is equivalent to requiring that every 
hom-set € (A, B) & E (1, BÂ) be finite. Thus we have shown that, up to unique 
isomorphism, a topos admits at most one geometric morphism to Sets, and it 
admits such a morphism iff it is locally finite (i.e. has finite hom-sets). A similar 
argument shows that there is at most one geometric morphism E — Set for 
any E, and such a morphism exists iff E is locally small and has arbitrary set- 
indexed copowers of 1. (The latter condition does not imply that E£ has all small 
coproducts; compare Example 2.1.7.) 


Example 4.1.10 Let C and D be small cartesian categories, and f:C — D 
a cartesian functor. We shall show that in this case the left Kan extension 
functor lim ¢: [C°P, Set] — [D°P, Set] is also cartesian, so that it is the inverse 
image of a geometric morphism (D°P ,Set] — [C°P, Set], whose direct image is 
f* (compare 4.1.4). To verify this, note that for any B € ob D, the functor 
lim ¢(—)(B): [(C°P, Set] — Set may be described as the composite 


y" lim 
[C°P, Set] —> [(B] f)°?, Set] —> Set 
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where U:(B\|f) -C is the forgetful functor, as before. Now U* is cartesian, 
since it has a left adjoint, so the composite will be cartesian (by a well-known 
property of filtered colimits in Set) provided we can show that the category 
(Bf)? is filtered: But this follows easily from the cartesianness of f: it is 
nonempty because there is a morphism B — 1 & f1, and if (¢: B — fA) and 
(¢': B — fA’) are any two objects of (B | f), then 


p DP pax fale (Ax A) 


is a third object which maps to both of them. Similarly, if y and a’: A — A’ both 
define morphisms in (B | f) from (¢: B — fA) to (¢’: B — fA’), then ¢ factors 
through fy”, where y”: A” — A is the equalizer of and w’. Thus (B| f)°? 
is filtered, and so lim ;(—)(B) is cartesian; but from the definition of limits 
in [D°P, Set] this implies that lim is cartesian. Note that, since we are here 
dealing with contravariant functors, a natural transformation a: f — g between 
cartesian functors C — D induces a natural transformation g* — f* and hence 
a natural transformation lim p — lim ,; so, if we write Cart for the 2-category of 
small cartesian categories, cartesian functors and natural transformations, the 
assignments C + [C°P, Set] and f + (f*, limp) define a functor Cart’? Top 
(where the superscript ‘op’ indicates that 1-cells but not 2-cells are reversed). 


Example 4.1.11 Let f: X —Y be a continuous map of topological spaces. 
Then we may regard f~! as a functor O(Y) — O(X). From the fact that f7! pre- 
serves arbitrary unions and finite intersections, we see that if F: O(X)°? — Set 
is a sheaf on X then the composite F o f~!: O(Y)°? — Set is a sheaf on Y, i.e. 
the functor (f~!)* : [(O(X)°P, Set] — [O(Y)°?, Set] maps the subcategory Sh(X) 
into Sh(Y). We write f.: Sh(X) — Sh(Y) for the restriction of (f~')*. As a 
functor between presheaf categories, (f~!)* has a cartesian left adjoint lim ,-1, 
by 4.1.10; this adjoint does not in general map sheaves to sheaves, but by com- 
posing it with the associated sheaf functor [(O(X)°?, Set] — Sh(X) (cf. 4.1.8) we 
obtain a cartesian left adjoint f*: Sh(Y) — Sh(X) for f,, and hence a geometric 
morphism Sh(X) — Sh(Y). Once again, this construction is functorial — that 
is, X + Sh(X) defines a functor Sp — Top. 


It was, of course, Example 4.1.11 which motivated both the definition and 
the name of geometric morphism. We shall return to it in C1.4.3, where we shall 
see that the inverse image f* has an alternative (and simpler) description in 
terms of local homeomorphisms, and also that there is an appropriate notion of 
2-cell in Sp to which the above functor can be extended; and in Section C2.3 we 
shall see how the example may be extended to a suitable notion of ‘continuous 
functor’ between more general sites. 


Example 4.1.12 Let F: € — F be a cartesian functor between toposes, and 
let GI(F) be the topos obtained by glueing along F, as in 2.1.12. The two 
projection functors Pi: GUF) — E and Pz: GUF) — F are both cartesian, and 
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both have right adjoints: the right adjoint R, of P, sends an object A of £ to 
(A, FA,1lra), and the right adjoint Rz of Po sends B to (1,B,B — 1 = F1). 
Thus we have geometric morphisms from both € and F to GI(F’). Note that the 
composite 


E ——> GIF) — F 
ìs simply F; thus we have shown that any cartesian functor between toposes 
can be factored as a direct image functor followed by an inverse image functor. 
Note also that the functor P, is logical, by the constructions of 2.1.12 (although 
P is not in general); in fact, as we shall see in 4.5.6 below, E is equivalent in 
Yop/GI(F) to the slice category GI(F)/(1,0,0 - 1). 


Remark 4.1.13 Occasionally in this book (particularly in Part B), we shall 
wish to consider adjunctions between toposes of which the left adjoint preserves 
pullbacks (and hence all finite connected limits, by 1.2.9), but not the terminal 
object. We shall call these pre-geometric morphisms, and use the same notation 
for them as for geometric morphisms. (In [1041], they are called ‘partial geometric 
morphisms’.) We note that if f: F — E is a pre-geometric morphism, then we 
may factor it as 


a 


F 2s F/I tek E 
where I = FQ), gr is the pre-geometric morphism defined by gj = Ly; and 
= I*, and f is a geometric morphism: specifically, f* (A) is f* applied to the 
ae morphism A — 1, as in 1.2.9, and IA (B — I) is the pullback of 


f.(B) 


m 
taan 
where 7 is the unit of ( f* 4 f,). Thus results about pre-geometric morphisms can 
usually be deduced from the corresponding results about geometric morphisms 
and the particular case of morphisms of the form gy. 


We conclude this section with a number of unrelated results about geometric 
morphisms, which do not fit into the general development of the theory in the 
rest of this chapter, but which deserve to be mentioned at this stage. First we 
prove a simple but important result about geometric morphisms and natural 
number objects. 


Lemma 4.1.14 Let f: F —E be a geometric morphism. If E has a natural 
number object, then so has F. 


Proof If N isa natural number object in £, then it follows from Lemma 2.5.6(i) 
that f*N is a natural number object in F. QO 
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The converse of 4.1.14 is not true in general: if f: F — € is a geometric 
morphism and F has a natural number object, there is no reason why € should 
have one. For a simple counterexample, take F to be Set and E to be the topos 
obtained by glueing along the inclusion functor Set; — Set (which does not 
have a natural number object, since Sets admits a geometric morphism to it). 
However, in the particular case when f is a surjection (which we shall study in 
the next section), then it will follow from 4.2.1(v) below that we can construct 
a natural number object for € from one for F. 

We have not said very much in this section (and will not say much in the 
rest of this chapter) about the 2-categorical structure of Lop; detailed discussion 
of this aspect of topos theory is postponed until Part B. However, there is one 
simple result which might as well be proved now. In general, the hom-categories 
Top (F,€) are not well-endowed with limits or colimits, but they do have the 
very minimum of such structure: 


Lemma 4.1.15 Top is locally Cauchy complete; that is, its idempotent 2-cells 
split. 


Proof Let f: F— E be a geometric morphism and €: f > f an idempo- 
tent geometric transformation. Then for each object A of E the morphism 
ca: f*A — f*A is idempotent; since F has equalizers, we can split it as f*A > 
g* A> f*A. From the naturality of €, it is easy to see that g* becomes a functor 
E — F, and that it preserves finite limits. Now the mate €: f, — f, of € is also 
idempotent, and so we can similarly split €g : f+B — f+B as fB > g,B — f,B 
for each object B of F. Finally, morphisms A — g,B correspond to morphisms 
h: A — f,B satisfying Egh = h, and hence to morphisms h: f*A — B satisfying 
hea = h, which in turn correspond to morphisms g*A — B. So the pair (9., 9*) 
is a geometric morphism F — E, which is clearly a splitting for € in Top (F, €). 

Oo 


It is also appropriate at this point to say something about Beck—Chevalley 
conditions in Top. Given a commutative square (up to isomorphism) 


-4 6 


H 
f 
F ——-> E 


of toposes and geometric morphisms, the natural isomorphism k*g* = h*f* 
transposes to yield a canonical natural transformation @: 9* f, — k.h* between 
functors F =3 G. We say that the square satisfies the Beck-Chevalley condi- 
tion if 0 is an isomorphism; we shall also be interested in the condition that 
it is pointwise monic (i.e. 9g is monic for each object B of F), which we shall 
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call the weak Beck-Chevalley condition. (In practice, it is usual to consider the 
Beck—Chevalley condition only for pullback squares in Top; but the definition, 
and the two preliminary results to be proved here, do not require the assumption 
that the square is a pullback.) We shall have a good deal to say about Beck~ 
Chevalley conditions in Chapter C3, but there are a couple of results about them 
which do not involve any of the machinery of Part C, and so might as well be 
given now. The first is a simple stability property: 


ee 
Vay 


satisfies the (weak) Beck-Chevalley condition. Then so does the square 


Lemma 4.1.16 Suppose 


Mm <z S 


H/h*f*A iaa G/g*A 
h/f*A g/A 
f/A 


FIF A ——— > E/A 
for any object A of E. 


Proof Let p: B — f*A be an object of F/f*A. We have pullback squares 


gC ——>gfıB and D——————>k,h*B 


g*A —> of, f*A g* A ——> k,k*g* A S Kh FTA 


(the former since g* preserves pullbacks) whose left vertical morphisms are 
respectively the images of p under (g/A)*(f/A). and under (k/g* A),(h/f*A)*. 
A straightforward diagram-chase shows that if the comparison maps between the 
right-hand sides of the squares are isomorphisms (respectively, monic), then the 
same is true of the induced map between their top left corners. O 
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The second result is more substantial — and more surprising: it says (approx- 
imately) that the weak Beck-Chevalley condition is equivalent to the restriction 
of the strong one to subterminal objects. 


Proposition 4.1.17 For a commutative square 


ae 


F ——— E 
in Top, the following are equivalent: 


(i) The square satisfies the weak Beck-Chevalley condition. 
(ii) For any monomorphism B' >— B in F, the naturality square 


Ap: 
g* fB! -2> k,h*B’ 


0p 
g“ fB —>> k,h*B 


is a pullback. 
(iii) For each object A of E, the square 
k/g*A 
H/{h* f* A ——> G/g*A 
h/ fra g/A 
F/A 


Ff Ass ESA 


satisfies the Beck-Chevalley condition for subterminal objects, i.e. the nat- 
ural transformation (0/A) associated with this square has the property that 
(0/A)m is an isomorphism for each subterminal object m: B > f*A of 
F/f*A. 


Proof (i) (ii) follows from 1.6.9, since the functors g* fẹ and k,h* are carte- 
sian. 1.6.9 also shows that these conditions imply the particular case A = 1 
of (iii); but we have seen in 4.1.16 that condition (i) is stable under slicing, 
and hence it implies the general case of (iii). So it remains to prove that (iii) 
implies (ii). 


A4.1 DEFINITION AND EXAMPLES 171 


Given a monomorphism m: B’ — B in F, set A = f,B and form the pullback 


C ———> B’ 


€B 
f° A ———— B 
where e is the counit of (f* 4 f,). The triangular identities for the 
adjunction, and the fact that f, preserves pullbacks, tell us that (f/A).(n) 
is simply f,(m): f,B’ — fB; so (g/A)*(f/A).(m) is just g*f.(m). But 
(k/g*A)«(h/f*A)*(n) is by definition the left vertical morphism in the diagram 


Disa iC SS EB! 
k,h*(n) k.h*(m) 


g* A ——> k,k*g* A S k,h* f* A ——> k,h* B 


where both squares are pullbacks, and the first factor of the bottom edge is the 
unit of (k* 4 ką). Now a straightforward diagram-chase shows that the bottom 
composite is simply 0 g, and that (0/A), is the factorization through the pullback 
of 0g: g* f, B’ — k,h* B’. So the assertion that (6/A),, is an isomorphism implies 
that the naturality square for 0 at m is a pullback. o 


Finally in this section, we outline one sense (not the only one) in which 
geometric morphisms — or at least inverse image functors — can be viewed as 
the structure-preserving maps between toposes: namely the allegorical sense. If 
f: F — Eis a geometric morphism, then f*: E — F is a regular functor (though 
f+ is not, in general), and so extends to a unital morphism of allegories which we 
shall denote F: Rel(€) — Rel(F), by the remarks after 3.2.10. Further, since f* 
preserves coproducts and images, F preserves unions of morphisms to the extent 
that they exist in Rel(€); that is, it is a morphism of union allegories (and, if 
the toposes concerned have arbitrary set-indexed unions of subobjects, then it 
is a morphism of geometric allegories). We cannot hope to obtain an elementary 
converse result; but if we restrict our attention to the class of allegories studied 
in 3.4.11, then we have 


Proposition 4.1.18 Let A and B be allegories satisfying the hypotheses of 
3.4.11. Then unital morphisms of geometric allegories A — B correspond 
bijectively to inverse image functors Map(A) — Map(B). 


Proof We have seen that an inverse image functor induces a unital morphism 
of geometric allegories. Conversely, suppose given such a morphism F: A —> B, 
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and let f* denote its restriction to Map(A) (which of course takes values 
in Map(B), since F preserves adjunctions). Since F preserves arbitrary (set- 
indexed) unions, f* preserves arbitrary unions of subobjects; hence, by an 
infinitary extension of the argument after 1.4.4, it preserves arbitrary coprod- 
ucts. As a regular functor between effective regular categories, it also preserves 
coequalizers of equivalence relations; and it preserves the construction of the 
equivalence relation generated by a parallel pair u,v: A = B (that is, the small- 
est equivalence relation on B through which (u, v) factors — or equivalently the 
kernel-pair of the coequalizer of u and v), since the latter may be constructed as 
the image of a map to B x B from a countable coproduct of objects constructed 
from u and v by finite limits. So f* preserves all coequalizers, and hence all 
set-indexed colimits. 

Now if G is a strong separator for A, as defined before 3.4.10, then it is readily 
seen that the subobjects of G in Map(.A) form a separating set for Map(.A) (they 
form a set because Map(A), as a subcategory of A, is locally small, and it has a 
subobject classifier). So we are in a position to apply the Special Adjoint Functor 
Theorem (cf. C2.2.10) to obtain a right adjoint f, for f*; thus f* is the inverse 
image of a geometric morphism Map(B) — Map(.A). go 


Two cautionary remarks about the foregoing result should be made. First, the 
reader should not be deceived into thinking that 4.1.18 represents the origin of 
the term ‘geometric morphism’; rather, the term ‘geometric allegory’ was chosen 
in order to fit with it. Secondly, while 4.1.18 may be viewed as ‘explaining’ the 
notion of geometric morphism, it does not provide a similarly satisfactory expla- 
nation of the notion of geometric transformation; as we remarked after 3.2.10, 
natural transformations between regular functors do not correspond to any very 
obvious notion of ‘2-cell’ between the corresponding morphisms of allegories. 


Suggestions for further reading: Carboni et al. [225, 226], Rosebrugh & 
Wood [1041], Wraith [1236]. 


A4.2 Surjections and inclusions 


A geometric morphism f: F — E, like any other adjunction, induces a monad 
on one of the categories involved (in this case €) and a comonad on the other. 
We shall defer discussion of the monads induced by geometric morphisms until 
later; in this section our concern is with the comonads. 

We note first that the functor part of such a comonad must be cartesian, since 
it is the composite of the direct and inverse images of f. Of course, we shall use 
the term cartesian comonad for a comonad with this property. We shall see 
shortly that every cartesian comonad on a topos arises from a geometric mor- 
phism. First, we prove an omnibus theorem which indicates how much categorical 
structure is inherited by the category of coalgebras for such a comonad. 
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Theorem 4.2.1 Let E be a cartesian category, and G = (G,¢€,6) a cartesian 
comonad on E. Let Eg denote the category of G-coalgebras in E. 
(i) If E is properly cartesian closed, so is Eg. 
(ii) If € is locally cartesian closed, so is Eg. 
(iii) If € has a subobject classifier, so has Eg. 


(iv) If E is coregular and has a weak subobject classifier, then Eg has a weak 
subobject classifier. 


(v) If E has a natural number object, so has Eg. 
In particular, 
(vi) If E is a topos (resp. a quasitopos), so is Eg. 


Proof (i) & is cartesian, since the forgetful functor Eg — E creates finite 
limits. Now let (A, œ) and (B, 8) be G-coalgebras, and let E — G(B“) be the 
equalizer of 


G(B*) 


G(B4) ——_—_—_—___—_—__—_——_—_> GB’) 


Opa G(GB?) 
GG(B4) ——> G(GB°4) 


where 0: G(B4) — GBA is the exponential comparison map for G, i.e. the 
transpose of 
G(ev) 


G(B4) x GA = G(B4 x A) —> GB. 


If we equip the vertices of the above diagram with cofree coalgebra structures, 
then the edges are all coalgebra homomorphisms; so Æ acquires a coalgebra 
structure which we shall denote n: E — GE. Now let (C, y) be another coalgebra; 
then coalgebra homomorphisms (C, y) — (G(B“), 8ga) correspond bijectively to 
morphisms C — B4 in £, and hence to morphisms C x A — B. But the statement 
that f: C x A— B is a coalgebra homomorphism (C, y) x (A,a) — (B,) is 
equivalent to saying that the diagram 


f pA 


C22 B* 5s CR* 


y GB 


GF 
GC Be G(B4) Í> Gpes 
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commutes, which in turn is equivalent to saying that the corresponding coalgebra 
homomorphism (C, y) — (G(B4), 6) factors (uniquely) through the subcoalgebra 
(E,n). So (E, n) has the universal property of an exponential (B, G)(4™ in Eg. 

(ii) Let (A, a) be a G-coalgebra. To show that £g/(A, a) is cartesian closed, 
we shall show that it is isomorphic to (€/A)g, for a suitable cartesian comonad 
G’ on €/A, and then appeal to (i). To define G’ at an object f: B — A of E/A, 
we form the pullback 


GB—>»cB ; 
G'f Gf 
Q 


A ——> GA 


this is clearly functorial. Moreover, since eaa = l4, the composite 
eph: G'B — GB — Bisa morphism over A, and thus defines a natural transfor- 
mation ¢' from G’ to the identity. And since G preserves pullbacks, G’G’ B is the 
pullback of GGB along the composite Ga oa: A — GGA; but this composite 
equals 64 0 a, and so the morphism ôg induces a morphism ô}: G’B —> G'G’ B 
over A, which is also natural in f. It is straightforward to verify that «’ and 0’ 
form a comonad structure on G’. Moreover, this comonad is cartesian: G’ pre- 
serves pullbacks since G and the pullback functor a*: €/GA — €/A both do so, 
and it preserves the terminal object by construction. Finally, for any f: B > A, 
there is a bijective correspondence between morphisms f — G’f in €/A and 
morphisms 3: B > GB satisfying Gf o 8B = ao f; and it is again easy to see 
that this restricts to a correspondence between G’-coalgebra structures on f and 
G-coalgebra structures on B which make f a coalgebra homomorphism. 

(iii) Suppose € has a subobject classifier Q. Form the equalizer Og — GQ of 


a ce EN 


Ne 


where 7 is the classifying map of G(T): 1 = G1 — GQ; as in (i), we may equip 
Qg with a coalgebra structure by regarding the vertices of this diagram as cofree 
coalgebras. Now, given a G-coalgebra (A, a), the subobjects of (A, a) in Eg are 
just those subobjects A’ — A of A in € for which the composite of A’ — A with 
a admits a (necessarily unique) factorization a’ through GA’ — GA; but if such 
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a factorization exists then the square 


, a! , 
A’ — GA 


AS GA 
is necessarily a pullback, since a and a’ are split by €4 and €,4’ respectively. If 
p: A— Q is the classifying map of A’ — A, then the composite 7 o Go classifies 
GA' — GA; so, by the above remarks, A’ — A is a subcoalgebra of A iff the 
diagram 


$ 


A ———> 2 


Go 
GA ——> GN 
commutes. But this in turn is equivalent to saying that the coalgebra homomor- 
phism (A,a) — (GQ, ôn) corresponding to @ factors through the equalizer Qg 
defined above; so the latter has the universal property of a subobject classifier 
for Eg. 

(iv) The argument for weak subobject classifiers is identical, except that we 
need to show that cocovers in Eg are precisely those coalgebra homomorphisms 
(A, a) = (B, 8) which are cocovers in E. One direction is easy, since the forgetful 
functor Eg — E preserves epimorphisms and reflects isomorphisms. Conversely, 
suppose f: (A,a) — (B, G) is a cocover in Eg. Form its epic-cocover factorization 


e f 


Æ >——> B 


A 
in £; then since cocovers in € coincide with regular monomorphisms, Gf’ is a 
cocover, and hence we obtain a unique a’: A’ + GA’ making 


e f 


A A’ >——> B 


a a’ B 


G 1 
Cro 5 E 
commute. It is now clear that (A’,a’) is a G-coalgebra, and that 
e: (A, a) — (A’, a’) is an epimorphism in Eç through which f factors; so e is 
an isomorphism, and f is a cocover in E£, as required. 
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(v) Suppose (N,o, s) is a natural number object in €, and let v: N — GN 
be the unique morphism making 


fee a ee 


Gi 22s GN Ss GN 


commute. It follows easily from the uniqueness clause of Definition 2.5.1 that v 
satisfies the commutative diagrams required for a G-coalgebra structure on N. 
Moreover, if (A, a) is a G-coalgebra, and a: 1 — (A,a) and t: (A, œ) — (A, a) are 
coalgebra homomorphisms, then a similar application of the uniqueness clause 
shows that the unique f: N — A making 


[ase SN 


f J 


Awe A 


commute is a coalgebra homomorphism (N,v) — (A,a@). So ((N,v),o,s) is a 
natural number object in Eg. 

(vi) For toposes, this is immediate from (i) and (iii); for quasitoposes, it 
follows from (ii), (iv) and the fact that Eg inherits finite colimits from £. D 


Corollary 4.2.2 For any cartesian comonad G on a topos E, there is a geo- 
metric morphism f:€ — F such that G is (isomorphic to) the comonad induced 


by (f* 4 fa). 


Proof Take Fto be Eg, and let f be the geometric morphism whose direct 
and inverse images are the cofree and forgetful functors. c 


Remark 4.2.3 With the notion of pre-geometric morphism, as introduced ìn 
4.1.13, in mind, we might ask what happens if we relax the cartesianness condi- 
tion in 4.2.2 to the mere preservation of pullbacks (equivalently, by 1.2.9, of finite 
connected limits}. The answer is that the result survives essentially unchanged. 
Let G = (G,<,ĝ) be a comonad on a topos €, such that G preserves pullbacks. 
Then, applying the construction of 4.2.1 (ii) to the terminal G-coalgebra (G1, 61), 
and noting that this construction only requires G to preserve pullbacks, we may 
construct a cartesian comonad G’ on €/G1, whose category of coalgebras is iso- 
morphic to that of G; so this category is a topos, and the cofree and forgetful 
functors form a pre-geometric morphism € — Eg. 
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We have already seen a number of examples of toposes which are comonadic 
over other toposes. We list a few of them here, together with a couple of new 
examples (of which the second is of considerable importance). 


Examples 4.2.4 (a) Let G be a topological group. Then the category 
Cont(G) of Example 2.1.6 is coreflective in [G,Set] and closed under finite 
limits; so it is comonadic over [G, Set] by an idempotent cartesian comonad. 
Thus 4.2.1 provides a new proof that Cont(G) is a topos. Note that in the case 
when the comonad G is idempotent (i.e. 6 is an isomorphism), the construction 
in the proof of 4.2.1(i) may be appreciably simplified: the exponential (B, G)(4-~ 
is just G(B%), since all the morphisms in the diagram defining E are isomor- 
phisms. However, Qg is not simply G(Q), unless we make a further assumption 
about G — see 4.6.6 below. 

(b) Let M be a monoid. Then the functor (—)™ : Set — Set has a comonad 
structure M: the counit «4: AM — A sends a function f: M — A to f(e), where 
e is the identity element of M, and the comultiplication 6,4 similarly sends f to 
the function z + (y+ f(yz)). Moreover, (—)™ is cartesian, because it is right 
adjoint to (—) x M. But it is easy to see that, for a set A, a function A — A™ is 
an M-coalgebra structure on A iff its transpose A x M — A is a right action of M 
on A in the usual sense; thus Sety is isomorphic to the category [M°?, Set], and 
4.2.1 provides a new proof that this functor category is a topos. More generally, 
we shall see in B2.3.17 that for any small category C the topos (C°P, Set] may be 
identified with a category comonadic over the slice category Set/ob C. 

(c) Let F:E€— F be a cartesian functor between toposes. We may 
define a cartesian comonad (G,€,6) on the cartesian product € x F as fol- 
lows: the functor G sends (A,B) to (A,B x FA), the counit €(4,2) is 
(14,71): (A, B x FA) — (A, B), and the comultiplication ô is similarly induced 
by the diagonal map FA — FA x FA. It is easy to verify that a coalgebra struc- 
ture for this comonad, on an object (A, B) of E x F, is essentially the same thing 
as a morphism B — FA, and hence that the category (€ x F)g is isomorphic 
to the category GI( F’) of 2.1.12. Since a cartesian product of toposes is trivially 
a topos, 4.2.1 thus provides a new proof that GI(F) is a topos; and if we take 
€ and F to be quasitoposes rather than toposes, then it establishes the result, 
promised in 2.6.7, that the glueing construction works for quasitoposes too. By 
4.2.3, it also shows that GI(F) is still a topos if (£ and F are toposes and) we 
merely assume that F preserves pullbacks; but this could easily have been verified 
directly, since in this case GI(F) is isomorphic to GI(F), where F: E — F/F1 
is the cartesian functor obtained by factoring F as in 1.2.9. (Rather remarkably, 
this result is best possible: if F: E — F is a functor between toposes such that 
Gi(F) is a topos, then F preserves pullbacks. See [224] for a proof.) 

(d) Let € be a cocomplete, locally small topos with a separating set of objects 
(that is, a Grothendieck topos as defined in Section C2.2). The full subcategory 
E4 of decidable objects of £ is closed under finite limits and arbitrary coproducts 
(this was proved in 1.4.15 except for the infinite coproducts, for which the proof 
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is just like that for finite ones); from this it follows that the full subcategory 
Eqa Of quotients of decidable objects (i.e. objects A for which there exists an 
epimorphism B -» A with B decidable) is closed under finite limits and arbitrary 
colimits. Thus we may apply the Special Adjoint Functor Theorem to obtain a 
right adjoint for the inclusion functor Ega —> E: explicitly, the adjoint sends 
an object A to the union of all those subobjects of A which are quotients of 
decidable objects. Thus we have an idempotent comonad on € whose category 
of coalgebras is isomorphic to Ega, and it is cartesian because Ega is closed under 
finite limits. So we may deduce that Eza is a topos, for any such £. 

We remark that the use of non-elementary means (i.e. the Adjoint Functor 
Theorem) in this example cannot be avoided; for an arbitrary topos E, Ega 
may not be coreflective in E. An example is provided by the topos € obtained 
by glueing along the inclusion functor Sets — Set (cf. 2.1.12). It is easy to 
see that an object (A, B, f: B — A) of this topos is decidable iff f is injective 
(cf. 1.4.16), which forces B as well as A to be finite, and hence that Ega is 
simply the category (2, Sets] whose objects are maps between finite sets. This 
is clearly not coreflective in E€ (although, by 2.1.3, it is a topos). We shall see in 
Section F2.2 that if E is the effective topos Eff, then Ega is not even a topos. (On 
the other hand, if € admits a geometric morphism to a Boolean topos S which is 
bounded in the sense defined in B3.1.7, then we may use the S-indexed version of 
the Adjoint Functor Theorem, proved in B2.4.6, to deduce that Eza is coreflective 
in €, and hence a topos; cf. [521]. In fact, neither of the counterexamples just 
mentioned admits any geometric morphism to a Boolean topos; see 4.5.24 below.) 

(e) Let X be a topological space. We shall describe a cartesian comonad G 
on the topos Set /X, whose category of coalgebras is isomorphic to LH/X; this 
will prove that the latter is a topos, justifying claims made in Sections A1.5 and 
A1.6. Given an object p: E — X of Set/X and a point x € X, we define a local 
section of p at x to be a function s: U — E, where U is an open neighbourhood 
of x in X, such that ps is the inclusion U — X. By a germ of sections of p at 
x we mean an equivalence class of local sections, where two sections s: U + E 
and s’: U’ — E are considered equivalent if they agree on some open set V with 
xz € V C UNU‘; and we write GE, for the set of all germs of sections of p 
at x. Let GE denote the disjoint union of the GE,, xz € X, and Gp: GE — X 
the obvious projection; then it is straightforward to verify that the assignment 
p +» Gp defines a cartesian functor Set/X — Set/X. Moreover, we have a 
natural transformation € from G to the identity functor, such that €p sends a 
germ of sections of p at x to its value at x (defined as the value at x of any local 
section representing the given germ; this is clearly independent of the choice of 
representative). And we have a natural transformation 6: G — GG: if [s]z is a 
germ of sections of p at x (represented by a local section s: U — E, say), then 
by({s]z) is the germ at x of the local section U — GE whose value at y € U is 
[s], (again, it is easy to verify that this germ is independent of the choice of s). 
It is also straightforward to check that (G,¢,6) is a comonad on Set/X. Now 
suppose that ((p: E — X),(a: E — GE)) is a G-coalgebra. Then we may define 
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a topology on E, by specifying that V C E is open iff, whenever e € V, there 
is a representative s: U — E of the germ a(e) whose image is contained in V. 
This is clearly a topology, and it makes p a continuous map E — X; but in fact 
it makes it a local homeomorphism since, for any e € F, the restriction of p to 
the image of a representative of a(e) is a homeomorphism from an open set in Æ 
to an open set in X. Conversely, if we are given a topology on E such that p is 
a local homeomorphism, then we may define a G-coalgebra structure a on p by 
setting a(e) to be the germ at p(e) of any continuous local section of p passing 
through e. (The fact that such sections exist, and that they all have the same 
germ at p(e), follows from the assumption that p is a local homeomorphism.) It 
is now not hard to check that these two constructions are inverse to each other, 
and that a map f: (p: E —> X) — (q: F — X) between G-coalgebras in Set /X 
is a coalgebra homomorphism iff it is continuous for the corresponding topologies 
on E and F. Thus we have established an isomorphism (Set/X)g = LH/X, as 
claimed. 


Further important applications of Theorem 4.2.1 will be found in 4.6.3 and 
B2.3: 17; 

The following characterization of geometric morphisms between toposes of 
coalgebras is sometimes useful. 


Proposition 4.2.5 Let f:F —€ be a geometric morphism, and let G = 
(G,¢,6) and H = (H,n,¢) be cartesian comonads on E and F respectively. 
Then there is a bijection between (isomorphism classes of) geometric morphisms 
g: Fa — Eg making 


f 


F ——-—> E 


commute (where the vertical arrows are the geometric morphisms defined as in 
4.2.2), and natural transformations 8: f*G — Hf* which are compatible with 
the comonad structures, in the sense that the diagrams 


fG 0 > H f* 


eee ead 


f” 
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and P 
poe 
fro Cre 
0 
rea > apa 4% nap 
commute. 


Proof It is well known that natural transformations @ satisfying the above 
commutative diagrams correspond to ‘liftings’ of f* to functors g*: Eg — Fr 
such that 


* 


g 
Eg ———> Fu 


E ————— F 


commutes (where the vertical arrows are the forgetful functors): the correspon- 
dence sends 0 to the functor defined by 


g* (A, a) = (f*A, 04 (e f*a), 
and g* to the transformation whose value at A is 


B H f*ea 7 
f*GA ——> Hf*GA ——> HFA 
where @ is the structure map of the H-coalgebra g* (GA, 64). However, any such 
lifting g* will be cartesian, since the forgetful functor Fy — F creates finite 
limits; and since Eg has equalizers, it will also inherit a right adjoint gẹ, from 
f+ by (the dual of) 1.1.3(i). Thus we have the desired correspondence between 
geometric morphisms and natural transformations 0 as above. D 


In particular, taking f to be the identity morphism on €, we see that if G 
and H are two cartesian comonads on E, then geometric morphisms Eg — Ee 
under € correspond to morphisms of comonads G — H in €. 


Next, we consider the class of geometric morphisms which arise (up to 
equivalence) as in 4.2.2. 


Lemma 4.2.6 Let f: F — E be a geometric morphism. The following condi- 
tions are equivalent: 


(i) E is comonadic over F; i.e. the comparison functor from E to the category 
of coalgebras for the comonad on F induced by (f* 4 f.) is (part of) an 
equivalence. 
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(ii) f* is conservative. 

(iti) f* is faithful. 

(iv) The unit n of the adjunction (f* 4 fa) is monic. 

(v) For every monomorphism m: A! > A in E, the naturality square 


A’ m ffA 


m fa fim 


A zy EFA 


is a pullback. 


Proof (i)e (ii) follows from 1.1.2 (bearing in mind that f* always preserves 
equalizers); (ii) = (iii) holds since toposes are balanced categories; (iii) & (iv) 
is a well-known property of adjunctions, and (iv) © (v) is a particular case of 
1.6.9. D 


A geometric morphism satisfying the equivalent conditions of Lemma 4.2.6 
is called a surjection. We next list some typical examples. 


Examples 4.2.7 (a) Let f: A — B bea morphism in a topos €. The geometric 
morphism €/A — E/B induced by f (as in 4.1.2) is a surjection iff f is an 
epimorphism; for we saw in 1.3.2(iii) that, in any regular category, the pullback 
functor f* is conservative iff f is a cover. 

(b) Let f:C — D be a functor between small categories. If f is surjective 
on objects, then it is easily verified that the functor f*: [D, Set] — [C, Set] is 
conservative; for a natural transformation a between functors D — Set is an 
isomorphism iff ag is bijective for every object B of D. So the geometric mor- 
phism [C, Set] — [D, Set] induced by f as in 4.1.4 is surjective. The converse is 
false; in fact, for the surjectivity of this morphism, it is necessary and sufficient 
that every object of D should be a retract of one in the image of f. The suf- 
ficiency is clear, since if B is a retract of fA then the component at B of any 
natural transformation between functors D — Set is uniquely determined by its 
component at fA. For the converse implication, suppose B is not a retract of 
any fA; let F: D — Set be the functor D(B,—) and F’ — F the subfunctor 
defined by 

F’(C) = {u: B — C | u is not split monic} . 


Then the inclusion F’ — F is not an isomorphism, but is mapped by f* to an 
isomorphism. 

(c) Let f: X — Y bea continuous map of topological spaces. If f is surjective, 
then the geometric morphism Sh(X) — Sh(Y) induced by f as in 4.1.11 is a 
surjection. The simplest proof of this involves the identification of Sh(X) with 
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LH/X, so we shall defer giving it until C1.5.1(ii). Note, however, that once 
we have this identification the surjection of 4.2.4(e) may be identified with that 
induced by the identity map Xz, —> X (where Xq denotes the set X re-topologized 
with the discrete topology), since Set/X is isomorphic to LH/Xq. 


Now let f: F — € be an arbitrary geometric morphism. Write G for the 
comonad on F induced by (f* 4 f,), and g: F — Fg for the surjection of 4.2.2. 
The comparison functor h*: E — Fg satisfies g*h* = f*, and is therefore carte- 
sian since g* creates finite limits. Moreover, since € has equalizers, h* has a right 
adjoint h,, which sends a G-coalgebra (B, 3) to the equalizer of 


f. 
IB =s] B 
Nf, B 


(where 7 is the unit of (f* 4 f,)), and hgs = fẹ by the uniqueness of adjoints 
(or by direct calculation, if you prefer). Thus we have a factorization f = hg of 
our original geometric morphism. 


Proposition 4.2.8 With the notation established above, the counit h*h, — 1 
is an isomorphism. 


Proof Let (B,@) be a G-coalgebra. Since f* preserves equalizers, we have an 
equalizer diagram 


fie 
f*h.(B, B) >—> f* f,B ——S fr fa f* f.B 


ôB 


where ô = f*nf. is the comultiplication of G. But, by the definition of a 
G-coalgebra, we also have a (split) equalizer diagram 


B S RB 
B>—> f*f.B Z ff. fr f.B. 
ôB 


So there is a canonical isomorphism f*h,(B,@) — B; but this isomorphism is 
easily seen to be the result of applying the forgetful functor g* to the counit 
h*h,(B, 8) — (B, 8), and g* reflects isomorphisms. o 


A geometric morphism h satisfying the condition that the counit h*h, — 1 is 
an isomorphism, or the equivalent condition that h, is full and faithful, is called 
an inclusion (though some authors prefer the term embedding). We shall study 
inclusions in greater detail in the next three sections; for the present, we digress 
briefly to note an alternative characterization of them: 


Lemma 4.2.9 A geometric morphism is an inclusion iff its direct image is a 
cartesian closed functor (i.e. preserves exponentials). 
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Proof Since inverse image functors preserve finite products, this is immediate 
from 1.5.9. o 


We are now ready for the second main theorem of this section (the first being 
4.2.1). 


Theorem 4.2.10 Every geometric morphism can be factored, uniquely up to 
canonical equivalence, as a surjection followed by an inclusion. 


Proof The existence of such a factorization has been established by 
Proposition 4.2.8 and the argument preceding it; so we have only to prove the 
uniqueness. Let 


o 
>l 


G 


be any factorization of a given morphism f into a surjection followed by an 
inclusion. Since the comonad on G induced by (k* 4 h,) is trivial, the comonad 
on F induced by (g* 4 9,) is isomorphic to that (G, say) induced by (f* 4 fa). 
But g* is comonadic by 4.2.6(i); i.e. there is an equivalence G ~ Fg making 


pa SG 


Fg 


commute up to isomorphism. The fact that g* reflects isomorphisms then shows 
that 


G ———> € 


Fg 


commutes up to isomorphism. O 


Corollary 4.2.11 A geometric morphism which is both a surjection and an 
inclusion is an equivalence. 0 


184 GEOMETRIC MORPHISMS~-BASIC THEORY A4.3 


The corollary can easily be proved directly: if f is both a surjection and an 
inclusion, and 7 and e denote the unit and counit of (f* 4 f,), then f*7 is an 
isomorphism since it is a (one-sided) inverse for €f», but f* reflects isomorphisms, 
so 7) is an isomorphism. 

To conclude the present section, we revisit the three examples discussed in 
4.2.7, and consider them in the light of 4.2.10. 


Examples 4.2.12 (a) Let f: A— B be a morphism in a topos £. As we saw 
in the proof of 2.4.7, the counit of the adjunction (f* 4 II) is an isomorphism 
iff the unit of (Ly 4 f*) is an isomorphism; but the latter happens iff f*(f) is 
an isomorphism, i.e. iff f is monic. Combining this with 4.2.7(a), we see that 
the factorization of 4.2.10, applied to the morphism €/A — E/B induced by an 
arbitrary morphism f: A — B of E, corresponds to the usual image factorization 
of f arising from the fact that £ is a regular category. 

(b) Let f:C — D be a functor between small categories. If f is full and 
faithful, then the induced geometric morphism |C, Set] — [D, Set] is an inclusion; 
for the comma category (fA f) has an initial object 1¢4 for any object A of 
C, whence we deduce that (lim ,(F))(fA) = F(A) for any F:C — Set, i.e. 
f* lim s(F) = F. Thus, for an arbitrary f, the factorization of 4.2.10 applied 
to the morphism ÍC, Set) — [D, Set] induced by f yields [D’, Set] as the image, 
where D is the full image of f, ie. the full subcategory of D on all objects of 
the form fA. Alternatively, by the last part of 4.2.7(b), we could ‘complete’ this 
subcategory by closing it off under images of idempotents in D; of course, by 
1.1.9, this makes no difference to the functor category [D’, Set]. 

(c) Let f: X —Y be a continuous map of topological spaces. Then it is 
straightforward to verify that f,:Sh(X)— Sh(Y) is faithful iff it is full and 
faithful, iff f-?: O(Y) — O(X) is surjective. If X is a subspace of Y and f 
is the inclusion, then the latter condition is satisfied; the converse holds (up to 
homeomorphism) provided Y satisfies the Ty separation axiom, in which case the 
surjectivity of f~? forces f to be injective. Combining this with 4.2.7(c), we see 
that if we apply the factorization of 4.2.10 to the morphism Sh(X) —> Sh(Y) 
induced by an arbitrary continuous f: X — Y, we obtain Sh(J), where I is 
the image of f topologized as a subspace of Y (that is, we obtain the coimage 
factorization in Sp, rather than the image factorization). 


Suggestions for further reading: Carboni & Johnstone [224], Johnstone 
et al. [546]. 


A4.3 Cartesian reflectors and sheaves 


In contrast to Theorem 4.2.1, the category of algebras for a cartesian monad T = 
(T,n, 2) on a topos £ is not in general a topos. (It can be shown to be cartesian 
closed, by an argument similar to that in the proof of 4.2.1(i): if (A,@) and 
(B, 3) are T-algebras, we define the exponential (B, 3)(4‘ to be the equalizer 
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of 
BA Be Bra 


equipped with a suitable algebra structure. However, it lacks a subobject classi- 
fier in general.) But in the particular case when the monad is idempotent (i.e. p is 
an isomorphism), so that the category of algebras may be identified with a reflec- 
tive subcategory of €, we do get a topos, as we shall see in this section. (We 
recall from Section Al.1 our conventions regarding the terms ‘reflective subcate- 
gory’, ‘reflection’ and ‘reflector’. Normally we shall not distinguish notationally 
between a reflector (that is, an idempotent monad) and its functor part. We 
should also mention the tradition, deriving from module theory, whereby reflec- 
tive subcategories of a cartesian category C for which the reflector is cartesian 
are called localizations of C; but we shall not use this term.) 

We begin with a simple but important result, which is a close relative of 
Lemma 1.5.8. 


Proposition 4.3.1 Let E be a cartesian closed category, and L a reflective 
subcategory of E, corresponding to a reflector L on E. Then L preserves finite 
products iff (the class of objects of) L is an exponential ideal in €. Moreover, if 
these conditions hold then B”: B’4A — BA is an isomorphism for every object 
B of L and every object A of E, where n: ie — L is the unit of the reflection. 


Proof Suppose L preserves finite products; let B be an object of £ and A 
an object of £. For any object C of E€, we have the following string of bijective 
correspondences, which are natural in C: 


C — B4 
CxA — B 
LCxLA — B 
LCxA — B 
LC — BA 


where we have used the fact that LC x LA is the reflection in £ of both C x A and 
LC x A. So every morphism C — B^ factors uniquely through nc: C > LC; 
applying this to the identity morphism B4 — B4, we deduce that nga is an 
isomorphism, i.e. that B4 is in L. Moreover, examination of the above argument 
shows that we have a natural bijection between morphisms C — B^ and C > 
BA, so B4 and B44 must be isomorphic (the isomorphism being B”). 
Conversely, suppose £ is an exponential ideal. It is trivial that any reflector 
preserves 1, so we need only consider binary products. Let A and B be two 
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objects of £; then for any object C of £L we have natural bijections 


YANYANA 


ga 


where we have used the fact that C? and C¥4 belong to £ if C does. So 
every morphism from A x B to an object of £ factors uniquely through 
na xng: Ax B— LA x LB; but LA x LB belongs to £, since the latter is 
closed under any limits which exist in €, and so it must be (isomorphic to) the 
reflection L(A x B) of A x B. O 


Now suppose that € has pullbacks, and let L be a reflector on € which 
preserves pullbacks. Then, for any object A of €, we may define a unary operation 
cr,A (or simply cz) on subobjects of A, as follows: if A’ — A is monic, then so 
is LA’ — LA, and we define cz (A) by the pullback diagram 


ro a eee Be 


pea TÀ 


Lemma 4.3.2 The operation cz just defined is a closure operation on Sub(A); 
that is, it is order-preserving and satisfies A’ < cy,(A’) & cLeL( A’) for any A’. 
Moreover, cy, commutes (up to isomorphism) with pullback along an arbitrary 
morphism of €. 


Proof The fact that cz is order-preserving is clear from the form of the def 
inition, and the inequality A’ < c,(A’) follows immediately from the naturality 
of 7. For the idempotency, we note that the pullback above is preserved by L; 
but Lya is an isomorphism, and so Lc, (A’) = LLA’ = LA’ as subobjects of LA, 
whence cy,cz(A’) = c,(A’). Finally, given a morphism f: B —- A, let B’ — B 
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be the pullback of A’ — A along f; then in the diagram 


OB xe 2 sD! 


FO | a ee BU 


B————— | ———_> LB 


A ——— LA 


the front, back and right vertical faces are pullbacks, whence it follows that the 
left one is too. oO 


A closure operation on subobjects which commutes with pullback, as above, 
will be called a universal closure operation on €. We shall use the terms dense 
and closed with their usual meanings relative to a universal closure operation. 
For future reference, we note some elementary properties of dense and closed 
subobjects: 


Lemma 4.3.3 Let c be a universal closure operation on a cartesian category 
€. Then 


(i) Given a commutative square 


TEE 


where m is a dense subobject and n is closed, there is a unique morphism 
g: A— B’ satisfying ng = f and gm = F”. 

(ii) For any A’ — A, c(A') may be characterized as the unique subobject A” 
of A such that A’ — A” is dense and A” - A is closed. 

(iii) For subobjects A’ — A and A” — A, we have c( A'N A”) & c(A’)Nec(A”). 
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Proof (i) The commutativity of the square implies that A’ < f*(B’) in 
Sub(A). So we have 


A = eA!) < cf f*(B)) = f*(e(B')) = f*(B’), 


which establishes the existence of g. Uniqueness follows from the fact that n is 
monic. 

(ii) Clearly, c(A’) — A is closed; and A’ — c(A’) is dense, since its closure 
is the pullback of c(A’) — A along c(A’) — A. The fact that this characterizes 
c(A’) up to isomorphism in Sub(A) follows from two applications of (i) (in the 
particular case when f and f’ are also monic). 

(iii) By (ii), it suffices to show that A’ A” >=> c(A’) N c(A”) is dense and 
c(A’) N c(A”) — A is closed. But this follows at once from the facts that the 
classes of closed and dense monomorphisms are both stable under pullback and 
composition, which in turn follow easily from the definition of a universal closure 
operation. O 


Definition 4.3.4 Let c be a universal closure operation on a cartesian 
category E. 


(a) We say an object A of € is (c-)separated if, whenever we have a diagram 


f 


B' ———>A 


B 


where m is c-dense, there is at most one f: B + A with fm = f. 


(b) We say A is a (c-)}sheaf if, whenever we have a diagram as above, there is 
exactly one f with fm = F. 


The following example explains the re-use of the terms which we met before 
in Chapter A2. 


Example 4.3.5 Let (C,T) be a site, as defined in 2.1.9. Given a monomor- 
phism F” — F in [C°P Set] (where, as usual, we assume for convenience that 
each F’(A) is an actual subset of F'(A)), we define a new subfunctor d(F’) > F 
by declaring that d(#’)(A) should contain, in addition to the elements of F'(A), 
all those z € F(A) for which there exists a T-covering family (f;: A; ~ A] i € J) 
such that F'(f;)(z) € F’(A;) for each i € I (that is, those elements of F(A) which 
are ‘T-locally in F”). The stability condition (C) in the definition of a coverage 
ensures that d(F’) is indeed a functor; also, d(F’) contains F’ by definition, 
and d is order-preserving as an operation on Sub(F). Unfortunately, d is not in 
general idempotent (though it is so in many particular cases, for example that 
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which arises from the usual notion of open covering on the open-set lattice of 
a topological space); but since [C°P, Set] is well-powered we may obtain a clo- 
sure operation c by iterating d (transfinitely if necessary, taking unions at limit 
ordinals) until it converges. It is also easy to see that d commutes with pullback 
along a fixed morphism of (C°P, Set]; hence so does its transfinite iterate, i.e. the 
closure operation c just described is universal. 

We claim that the notions of T-sheaf and T-separated functor, which we 
defined in 2.1.9 and 2.6.4(d) respectively, coincide with those of c-sheaf and 
c-separated object defined in 4.3.4. In one direction, this is easy: given a 
T-covering family (fı: A; > A | i € J), we form a subobject R of C(-, A) 
by setting 


R(B) = {g: B > A | g factors through f; for some i} . 


It is easy to verify that R — C (—, A) is c-dense; in fact d(R) is already the whole 
of C (—, A). Moreover, for any F': C°P — Set, natural transformations R — F 
correspond bijectively to compatible families s; € F(A;) (¢ € J), as defined in 
2.1.9; the correspondence sends 0: R — F to the family (04,(f;) | i € J). So the 
assertion that F satisfies the sheaf axiom (resp. the separatedness axiom) for 
the family (f; | i € I) says precisely that each morphism R — F has exactly 
one (resp. at most one) extension to a morphism C (—, A) — F. Thus c-sheaves 
(resp. c-separated objects) are T-sheaves (resp. T-separated functors). 

For the converse, let F be a T-sheaf, G’ — G a c-dense monomorphism, and 
8: G’ — F a natural transformation. For any x € d(G’)(A), either z € G’(A) or 
we have a covering (fi: A; ~ A | i € J) such that each G(f;)(x) is in G’(A;); 
and then the family of elements 


(84,(G(fi)(2)) € F(A) [7 € 1) 


is clearly compatible and so defines a unique y = d(0)a(x) € F(A). In this 
way we can extend 0 uniquely to a natural transformation d(0): d(G’) > F; 
repeating this process as often as necessary, we get a unique c(0): G = c(G’) — F 
extending 0. The argument for separated objects is similar. 


The argument above does not establish a bijection between coverages on a 
small category C and universal closure operations on [C°?, Set]; several differ- 
ent coverages can give rise to the same closure operation (and consequently to 
the same category of sheaves). In Section C2.1 we shall distinguish a particular 
class of coverages (called Grothendieck coverages) which do correspond bijec- 
tively to universal closure operations on {C°?, Set] (and to cartesian reflectors on 
[CP Set]). 

In the case where a universal closure operation is derived (as in 4.3.2) from a 
cartesian reflector, it is easy to recover the reflector (or at least the corresponding 
reflective subcategory) from it. 


Lemma 4.3.6 Let L be a cartesian reflector on a cartesian category €, corre- 
sponding to a refiective subcategory L, and let cz, denote the universal closure 
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operation derived from L as in 4.3.2. Let A be an object of E. Then 
(a) The following are equivalent: 


(i) A is cy-separated. 
(ii) The unit map na: A — LA is monic. 
(iii) A is a subobject of an object of L. 
(iv) The diagonal map A >=> A x A is cy-closed. 


(b) The following are equivalent: 


(i) A is a cy -sheaf. 
(ii) The unit na: A — LA is an isomorphism. 
(iii) A is an object of L. 


Proof (a) Assume (i); let (a,b): R=3 A be the kernel-pair of n4, and 
d: A» R the diagonal map (witnessing the fact that R is reflexive), so that 
d is the equalizer of a and b. Since L is idempotent, Ln, is an isomorphism; 
so since L preserves kernel-pairs and equalizers we have La = Lb and Ld is an 
isomorphism. But this implies that d is c,-dense; since A is separated we deduce 
that a = b, ie. that 74 is monic. The implication (ii) > (iii) is trivial. Condition 
(iv) holds for all objects of £, since it is easy to see that all monomorphisms in 
L£ are cz,-closed; but it is inherited by subobjects, since the pullback of the diag- 
onal along a monomorphism is the diagonal, and so (iii) implies (iv). Finally, 
(iv) implies (i) by an application of 4.3.3(i): if fı and fz are two morphisms 
completing the diagram of 4.3.4(a), then the diagram 


B’ a ar A 
m A 
(fi, fa) 


B— > AxA 


commutes, so (f1, f2) factors through the diagonal, i.e. fi = fo. 

(b) Again, let us begin by assuming (i). By part (a), na is monic, and since 
Ina is an isomorphism it is cz-dense. So the identity morphism 14 extends 
uniquely to a morphism r: LA — A with rna = 1,4. Thus A is a retract of 
an object of £L; but £ is closed under retracts in E, since it is closed under 
equalizers. So (iii) holds; the equivalence of (ii) and (iii) is again trivial (recall 
our convention that reflective subcategories are replete). Conversely, if (iii) holds, 
then given a diagram as in 4.3.4(a) we obtain a unique factorization g': LB’ — A 
of f’ through ng’; but Lm is cz-closed (since its domain and codomain are in £), 
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and so on applying 4.3.3(i) to the square 


B’ sa LB’ 


m Lm 


TRER y; 


we obtain a unique factorization of ng through m, and hence a factorization of 
f’ = g'ng through m (the uniqueness of the latter follows from part (a)). OD 


It should be remarked that the equivalence of (i) and (iv) in 4.3.6(a) holds 
for any universal closure operation, whether or not it is induced by a cartesian 
reflector. For a direct proof of (iv) from (i), let A — A x A be the closure of 
the diagonal, and observe that the two projections A = A are equalized by the 
dense monic A — A. 

The next result is reminiscent of 2.6.9 (indeed, we shall see before long that 
2.6.9 is a special case of it). 


Lemma 4.3.7 Let c be a universal closure operation on a cartesian category E, 
and m: B — A ac-dense monomorphism in E. Then pullback along m induces a 
bijection from (isomorphism classes of) closed subobjects of A to closed subobjects 
of B. 


Proof Let B’ — B be a closed subobject. If there is any closed A’ — A such 
that 


B! >——=> A’ 


Bes ss 4 


is a pullback, then B’ — A’ must be dense, and so by 4.3.3(ii) A’ must be the 
closure of the composite monic B’ —» B — A. But if we define A’ — A to be 
the closure of this composite, then m*(A’) is the closure of B’ — B, and so is 
isomorphic to B’; that is, the square above is a pullback. QO 


Lemma 4.3.8 Let c be a universal closure operation on a cartesian category 
E, and let A be ac-sheaf. Then a subobject m: A’ — A is c-closed iff its domain 
A’ is also a c-sheaf. 
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Proof Suppose m is closed, and suppose given a diagram 


F 


B! ——> A’ 


B 


where n is dense. Since A is a sheaf, there is a unique g: B — A satisfying 
gn = mf’, and then 4.3.3(i) yields a unique factorization f : B — A’ of g through 
m. So A’ is a sheaf. 

Conversely, suppose A’ is a sheaf; let m: A” =— A be the c-closure of m. Then 
A’ > A” is dense, and so has a splitting r: A” — A’; and then the composite 
mr: A” — A equals m since the two morphisms are equalized by the dense 
monomorphism A’ — A”. Since m is monic, it follows that r is a two-sided 
inverse for A’ —+ A”; so the latter is an isomorphism, and A’ is closed in A. O 


We have now assembled all the ingredients for the main result of this section. 


Theorem 4.3.9 Let E be a topos, and L a cartesian reflector on E, corre- 
sponding to a reflective subcategory L. Then £L is a topos, and the inclusion 
L — E is the direct image of a geometric morphism, whose inverse image is (the 
factorization through £L of) L. 


Proof C is cartesian since it is closed under limits in £, and it is cartesian 
closed by 4.3.1. So it suffices to show that it has a subobject classifier. Let 
J — Q be the cz,-closure of the generic subobject T: 1 — Q, and j: Q —- Q its 
classifying map. Then since cz is stable under pullback it is easy to see that the 
entire closure operation is induced by j, in the sense that if @: A — Q classifies 
a subobject A’ — A, then the cz-closure of A’ is classified by the composite 
j@. In particular, if we define Q; — Q to be the equalizer of j and Ip, then 
Q; ‘classifies closed subobjects’; that is, a morphism ¢: A — Q factors through 
Q; iff the subobject it classifies is cz-closed. It now follows immediately from 
Lemma 4.3.7 that Q; is a cy-sheaf (equivalently, by 4.3.6(b), an object of £), 
and from 4.3.8 that it is a subobject classifier for £. The fact that the indicated 
functors form a geometric morphism is immediate. o 


Thus there is a bijection between (isomorphism classes of) cartesian reflectors 
on a topos € and (equivalence classes of) geometric inclusions, as defined in the 
previous section, with codomain €. We do not yet know that these concepts also 
correspond bijectively to universal closure operations on £, since we have not 
shown that all such closure operations arise from cartesian reflectors; but we shall 
do this in the next section. Once we have done so, then 4.3.5 and 4.3.9 together 
will provide a full proof of the result, only sketched in 2.1.10, that the category 
of sheaves on an arbitrary small site is a topos. In view of Example 4.2.12(c), 
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we commonly refer to the reflective subcategories of a topos E which correspond 
to cartesian reflectors as subtoposes of € (or geometric subtoposes, if we need 
to distinguish them from logical subtoposes which are full subcategories closed 
under finite limits and power objects, like Sets in Set; we have seen that a 
geometric subtopos is closed under finite limits and exponentials, but it cannot 
be closed under power objects (equivalently, contain (2) unless it is the whole of 
E, by the remark after 4.1.2). 


Remark 4.3.10 It is instructive to re-examine the factorization of 4.2.10 from 
the point of view of cartesian reflectors and universal closure operations. Let 
f: F — E be a geometric morphism. Adopting the notation of 4.2.8, we see that 
the reflector L = h,h*: € — E sends an object A to the equalizer of 


fa f*na 
RTAS ELA 
Df. frA 


(where 7 is the unit of (f* 4 f,)), the unit map A — LA being the factorization 
of na through this equalizer. That is, the monad on € induced by (h* 7 h,) is the 
associated idempotent monad (in the sense of Fakir [347]) of the monad induced 
by (f* 4 f). This idea can be used, in conjunction with 4.3.9, to provide an 
alternative construction of the surjection-inclusion factorization of an arbitrary 
geometric morphism, constructing the image as a subtopos of € rather than as 
a category of coalgebras over F. 
Now let A’ — A be a monomorphism in E. In the diagram 


LA >—> GPA 3 BP Pa 


LA >—»> f f*A ZR fff fA 


the two rows are equalizers, and since the vertical arrows are monic one read- 
ily deduces that the left-hand square is a pullback. Thus the universal closure 
operation cz induced by L may be described directly in terms of f: specifically 
cz(A' — A) is the pullback along n4: A —> fa f*A of f,f*A’ — faf*A. Note 
that 4.2.6(v) provides a direct proof that this closure operation is trivial if f is 
a surjection. 


It will frequently be useful, later on, to have a criterion for an arbitrary 
geometric morphism to factor through a given inclusion. 


Proposition 4.3.11 Let f: F — € be a geometric morphism, and L a carte- 
sian reflector on E. The following are equivalent: 


(i) f factors (uniquely) through the inclusion h: L — E which corresponds to 
L under 4.3.9. 
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(ii) f*(m) is an isomorphism for every c; -dense monomorphism m. 

(iii) f.(B) is a cL-sheaf for every object B of F. 
Proof  (i)= (ii): From the definition of cz, it is clear that the functor L = h,h* 
sends c,-dense monomorphisms to isomorphisms. But h, reflects isomorphisms, 


so h* also has this property; hence so does f* if there is a factorization f* = g*h*. 
(ii) = (iii) is immediate from the definition of a sheaf: given 


A’ ey f.(B) 


A 
with m c,-dense, we transpose across the adjunction (f* 4 fx) to obtain 


Py 


f(A) > B 
f*(m) 


F(A) 
But f*(m) is an isomorphism, so there is a unique g: f*(A) — B making this 
diagram commute; its transpose g: A — f,(B) is the morphism we seek. 
(iii) = (i): If (iii) holds, then by 4.3.6(b) there is a unique factorization f, = 
h«g- Moreover, the composite g* = f*h, is a cartesian left adjoint for g,, since 
h, is full and faithful. So we have a factorization f & hg in Top. o 


Corollary 4.3.12 Let f: F — E be a geometric morphism, and let L and M 
be cartesian reflectors on E and F respectively, corresponding to subtoposes L 
and M. The following are equivalent: 


(i) There erists a (unique) geometric morphism M — L making the diagram 


M ——> L 


F ——— > E 
commute up to isomorphism. 
(ii) f* sends c-dense monomorphisms to cm-dense monomorphisms. 


(ii) fẹ sends cm-sheaves to cr-sheaves. 
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Proof These are just the three equivalent conditions of 4.3.11 applied to the 
composite M — F — E. (Note that the monomorphisms in F which are sent 
to isomorphisms in M by the inverse image functor are precisely the cm-dense 
ones.) o 


Suggestions for further reading: Bell & Gebellato [98], Borceux [141, 144], 
Borceux & Kelly [151], Borceux & Van Den Bossche [164], Johnstone [538, 539], 
Kelly & Lawvere [586], Wyler [1248]. 


A4.4 Local operators 


In this section we study universal closure operations on a topos; our primary aim 
is to show that they are all induced by cartesian reflectors, and so correspond to 
subtoposes via 4.3.9. However, for the sake of completeness we shall also include 
a number of results about universal closure operators on quasitoposes. 

In the course of proving 4.3.9, we saw that a universal closure operation c 
on a topos is entirely determined by a particular endomorphism of Q, namely 
the classifying map of the closure of the generic subobject T. It is convenient to 
have a name for the endomorphisms of Q which arise in this way; we shall call 
them local operators. (The old name for them is (Lawvere-Tierney) topologies, 
but this name can lead to confusion since they have little to do with topologies 
in the classical sense.) 


Definition 4.4.1 Let E bea topos, with subobject classifier Q. A local operator 
on € (or on Q) is a morphism j: Q —> Q such that the diagrams 


1— >g ; a and axqg——s a9 
: ye : 

T J j J JX] J 

Q Q Q x Q——> 2 


commute (where A is the binary meet operation on Q defined in 1.6.3). 


Lemma 4.4.2 For any topos E, there is a bijection between universal closure 
operations on E and local operators on E. 


Proof Given j: Q — Q, let c; denote the (pullback-stable) unary operation on 
subobjects in € induced by composing classifying maps with j. We have to show 
that c; is a universal closure operation iff j is a local operator. But the first two 
diagrams of 4.4.1 are precisely equivalent to saying that A’ < c;(A’) S cjc;(A’) 
for any subobject A’ — A in £; the third diagram says that c; commutes with 
binary intersection of subobjects, which in particular implies that it is order- 
preserving. Conversely, we saw in 4.3.3(iii) that any universal closure operation 
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commutes with binary intersections, so the endomorphism of Q which induces it 
must be a local operator. O 


Given a local operator j, we adopt as standard the notation already intro- 
duced in the proof of 4.3.9; that is, we write J — Q for the subobject classified 
by j, and Q; — Q for the equalizer of j and 1g. Note that since j is idempo- 
tent, we may equivalently describe 0; as the image of j; in particular, it is a 
retract of Q, and therefore injective in E by 2.2.6. As in 4.3.9, we see that Q; 
‘classifies closed subobjects’ for the corresponding universal closure operation; 
and J similarly classifies dense subobjects — that is, a morphism ¢: A — Q fac- 
tors through J iff 7¢ factors through T, iff the subobject of A classified by ¢ is 
c;-dense. 

We may also talk about local operators on a quasitopos € (defined exactly as 
in 4.4.1); but for a quasitopos the relationship between universal closure oper- 
ations and local operators is slightly more complicated. We shall say that a 
universal closure operation on a quasitopos is proper if the closure of a cocover 
is always a cocover; we note that the closure operation cz induced by a carte- 
sian reflector L always has this property, since cocovers coincide with regular 
monomorphisms and L preserves equalizers. If c is a proper universal closure 
operation, then its action on cocovers is determined as in 4.4.2 by a local operator 
j: Q — Q. However, we cannot hope to recover the whole of c from 7: for exam- 
ple, let cı be the closure operation such that, for any A’ >> A, A’ > ¢(A’) — A 
is the unique factorization of A’ — A as a (monic) epimorphism followed by a 
cocover. (This factorization is stable under pullback, since epimorphisms and 
cocovers both are. Moreover, the c,-sheaves are exactly the coarse objects of 
E, as defined in Section A2.6; so by 2.6.12 and 4.3.6(b) this closure operation 
corresponds to a reflector on E£. We shall see shortly that this reflector is in fact 
cartesian.) Then cı sends each cocover to itself, and so the local operator which 
it induces is the identity; but c, is not the identity operation on subobjects, 
unless € is a topos. 

We shall say that c is strict if the closure of every subobject is a cocover. 
Given a proper universal closure operation c, we can construct a strict closure 
operation € having the same effect on cocovers as c, by defining G(A’) = c(c,(A’)) 
where cı is as above. There is a bijection between local operators 7: Q — Q 
and strict universal closure operations on £: given j, we define c;(A’ — A) to 
be the cocover classified by jọ, where œ is the classifying map of c,(A’ — A). 
We further note that the passage from c to € above does not affect the notion 
of separated object: since every ¢-dense monomorphism factors as an epimor- 
phism followed by a c-dense cocover, we see that every c-separated object is 
¢-separated (and the converse is trivial). However, in general ¢ will have fewer 
sheaves than c. 

Given a universal closure operation c on a cartesian category E, we shall 
write sep,(€) and sh,(€) for the full subcategories of c-separated objects and 
c-sheaves in £. 
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Lemma 4.4.3 Let c be a universal closure operation on a cartesian category E. 
Then 


(i) The subcategories sep,(E) and sh,(E€) are closed under finite limits in E. 


(ii) If E is cartesian closed, then sep,(E) and sh,(€) are exponential ideals 
in E. 


Proof (i) This is immediate from the fact that the conditions for A to be a 
sheaf or a separated object involve only morphisms with codomain A. 

(ii) Suppose A is a c-sheaf and B is any object of £. For any c-dense 
m: C’>> C, the morphism m x 1g: C’ x B> C x B is also c-dense (since it 
is the pullback of m along the product projection), and so any morphism 
C’ x B — A extends uniquely to a morphism C x B — A. But this is equivalent 
to saying that A? is a sheaf. The argument for separated objects is similar. O 


Now suppose c is a proper universal closure operation on a quasitopos €. 
Write j: Q — Q for the local operator induced by c (i.e. by the action of c on 
cocovers), and Q; — Q for the equalizer of j and lg. As before, we see that 
morphisms A — Q; correspond to monomorphisms A’ — A which are cocovers 
and c-closed, i.e. to €-closed subobjects of A. So Lemma 4.3.7 implies that Q; is 
a c-sheaf, and in particular a c-sheaf. We shall write P;(A) for the exponential 
04; by 4.4.3(ii) this is also a ¢-sheaf. 


Proposition 4.4.4 Letc be a proper universal closure operation on a quasito- 
pos E. Then the subcategories sep,(E) and sh,(€) are both reflective in E. 


Proof By definition, they are both full and replete, so it suffices to construct 
the appropriate left adjoints. Let A be an arbitrary object of E£; write A Ax A 
for the ¢-closure of the diagonal A: A — A x A (which in fact coincides with 
its c-closure, since A is regular monic), and let d: A — P(A) be the ‘name’ 
of this ¢-closed relation. We note in passing that A is an equivalence relation 
on A, as defined in 1.3.6; this follows easily from the pullback-stability of the 
closure operation. We write MA — P;(A) for the image of d (in the sense of 
the cover—-monic factorization, not the epic-cocover one!), and LA — P;(A) for 
the c-closure of MA. Since P;(A) is a c-sheaf, it follows from 4.3.8 that LA is 
a c-sheaf; and M A is c-separated, since it is easily seen that any subobject of a 
separated object is separated. 

By (the quasitopos version of) Proposition 2.4.1, the kernel-pair of d (and 
hence of the regular epimorphism A-> M A) is A 3 A; so a morphism f: A— B 
factors (uniquely) through A -> M A iff it coequalizes these two projections. But 
this is always the case if B is c-separated, since the two projections are equalized 
by the c-dense monic A — A. Thus MA is the reflection of A in sep,(€). If 
further B is a c-sheaf, then any morphism MA — B can be uniquely extended 
along the c-dense monic MA > LA, from which it follows that LA is the 
reflection of A in sh,(€). o 
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The reflection LA is commonly called the associated sheaf (or sometimes the 
sheafification) of the object A. 


Theorem 4.4.5 Let c be a proper universal closure operation on a 
quasitopos E. Then the categories sep,(E) and sh,(€) are both quasitoposes. 
Further, if c is strict, then sh, (E) is a topos. 


Proof By 4.4.3(i) both subcategories inherit finite limits from £, and by 4.4.4 
they are both cocartesian. By 4.4.3(ii), they are both cartesian closed; but we 
have to show that they are locally cartesian closed. To this end, let B be an 
object of sep,(€), and define a universal closure operation cg on E/B by setting 


cp((A’ > B) = (A > B)) = ((c(A’) > B) > (A > B)) 


where c(A’) denotes the c-closure of A’ in A. It is straightforward to verify that 
an object (A — B) of E/B is cg-separated iff A is c-separated in £; so we have 
an isomorphism 

sep,(€)/B = sep., (E/B), 


from which it follows that sep,(€)/B is cartesian closed. The argument for 
sheaves is similar. 

Finally, we must determine the cocovers in sep,(€) and sh,(&); we shall 
show that they are exactly the ¢closed monomorphisms, so that Q; will serve 
as a weak subobject classifier for both categories. Moreover, in the case when 
c is strict, so that all c-closed monomorphisms are cocovers and hence C-closed, 
Q; is actually a subobject classifier for sh,(€), since all monomorphisms in this 
category are c-closed by 4.3.8. One direction is easy: any c-dense monomorphism 
is epic in sep,(€), by the very definition of a separated object, and since any 
¢-dense monomorphism factors as an epimorphism in € (which is clearly still 
epic in sep,(€)) followed by a c-dense monomorphism, it follows that a cocover 
in sep,(€) must be ¢-closed. In sh,(€) the argument is even easier: here we 
know already that all monomorphisms are c-closed, so it suffices to prove that 
cocovers in sh,(€) must be cocovers in E — which is immediate. For the converse, 
it suffices to observe that every ¢-closed monomorphism is a pullback (in £, and 
hence in sep,(€) or sh,(€) as appropriate) of the generic ¢-closed monomorphism 
Tj: 1>+Q; (that is, the unique factorization of T through Q;), which is split 
and hence regular monic in any full subcategory of € in which it lives. 0 


We note that Theorem 4.4.5 includes as particular cases a number of results 
which we have seen before. For example, if we take c to be the particular (strict) 
closure operation cı mentioned earlier, it yields the result of 2.6.12 that Cs(E) is 
a topos for every quasitopos €. When € itself is a topos (so that every universal 
closure operation on € is strict), it reduces to the result of 4.3.9 that every 
reflective subcategory of € with cartesian reflector is a topos. (Note: when £ is a 
topos, it is customary to designate subcategories of separated objects or sheaves 
by the name of the corresponding local operator rather than that of the universal 
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closure operation: that is, we write sep; (£) and sh;(€) rather than sep,(€) and 
sh,(€).) And when £ is a topos of the form [C°?, Set], then (in conjunction with 
4.3.5) 4.4.5 yields the results of 2.6.4(d) and 2.1.10 that Sep(C, T) is a quasitopos 
and Sh(C,T) is a topos for any small site (C, T). 

Next, we investigate the limit-preservation properties of the two reflectors L 
and M constructed in the proof of 4.4.4. By 4.3.1 and 4.4.3(ii), we already know 
that they both preserve finite products. 


Lemma 4.4.6 The functor L is cartesian. 


Proof It suffices to show that it preserves equalizers. Let 
€ f 
A >—— B ——2 C 
g 


be an equalizer diagram in €, and let d: E — LB be the equalizer of Lf and Lg. 
Since L is a functor, we have a unique factorization k: LA — E of Le through d; 
we must show that k is an isomorphism. 

First we show that k is monic. Let (u,v): X = LA be a pair of morphisms 
with ku = kv. On forming the pullback 


p———_—__>X 


(Pp, 9) (u,v) 


x 
A JALA 


we obtain a pair (p,q) with 
nBep = (Le)nap = dkur = dkur = (Le)naq = ngeq - 


So the pair (ep, eq): P = B factors through the kernel-pair B = B of np; but 
by the construction of 4.4.4 the diagonal map B — B is dense, and so on pulling 
it back we get a dense P’ — P equalizing ep and eq (and hence equalizing p 
and q, since e is monic). Since LA is separated, it follows that nap = nq, i.e. 
ur = ur; but the image of r is dense in X, because the image of na x na (namely 
MA x MA) is dense in LA x LA, and hence u = v, as required. 

Now LA and E are both sheaves, so by 4.3.8 the monomorphism k: LA >= E 
is closed. We shall show that it is also dense, and hence an isomorphism. Form 


the pullback 
Q B 
| hr 
E B 


—— L 


h 
a 
d 
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then we have 
no fh = (Lf)ngh = (Lf)di = (Lg)di = (Lg)ngh = ncgh, 


so the pair (fh, gh): Q = C factors through the kernel-pair C = C of nc. As 
in the previous paragraph, this implies that there is a dense m: Q’ >> Q equal- 
izing fh and gh, so that the composite hm: Q’ — B factors through e (say by 
l: Q’ — A). Now since LA is a sheaf, the composite nal’: Q’ + LA extends 
uniquely to a morphism |: Q — LA; and since E is separated, the diagram 


Qu" 94 


commutes, for we have 
dim = nghm = ngel = (Le)nal’ = (Le)lm = dklm 


and d is monic and m is dense. So, in Sub(£), k contains the image of i; but the 
latter is dense, since it is the pullback along d of the image of ng. E 


Remark 4.4.7 For toposes, there is a much shorter (but more ‘high- 
technology’) proof of 4.4.6 using properties of injective objects. First we note 
that, by 2.2.6, the injective objects of sh,;(€) are exactly the retracts of power 
objects 24, A € ob sh;(E); but since 2; is a retract of Q in E, all such objects 
are in fact injective in E€. Again by 2.2.6, we note that a morphism m: A — B in 
a topos is monic iff every morphism from A to an injective object factors through 
m; from this, and the fact that the inclusion sh;(€) — E preserves injectives, it 
is easy to see that L preserves monomorphisms. 

We now claim that L preserves binary intersections of subobjects (and hence 
in particular equalizers of coreflexive pairs, cf. 1.2.10); given that it preserves 
finite products by 4.3.1 and 4.4.3(ii), this is enough to ensure that it preserves 
all finite limits. Suppose given subobjects A, — A and A> — A in £; then by 
1.4.3 we have a pushout diagram 


A N Ag >———> A 


Ag >——> A, U Ag 
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in E, and hence the square 


L(A, N Ag) >——> LA, 


LA} >——> L(A, U A2) 


is a pushout in sh; (£) since L is a left adjoint. By 2.4.3 (or by 1.4.8), this square 
is also a pullback; but since L(A; U Az) — LA is monic, the square 


L(A, N Ag) >_> LA, 


LA >———~> LA 


is also a pullback, i.e. L(A; Ag) is the intersection of LA, and LAg in Sub( LA). 

The first part of the above argument works for quasitoposes as well as 
for toposes: note that powers of Q in a quasitopos are injective for arbitrary 
monomorphisms (not just for cocovers), because they are coarse objects by 
2.6.9. It can therefore be used to give a proof that the reflector M: E — sep,(€) 
preserves monomorphisms, for any proper universal closure operation c on a 
quasitopos £; however, the same result could alternatively be deduced from the 
fact that M is a subfunctor of L and L preserves monomorphisms. On the other 
hand, the second part of the argument above definitely requires the reflective 
subcategory of £ to be a (pre-)topos, since it requires the result of 2.4.3 to hold 
for pushouts of arbitrary monomorphisms, and not just for cocovers (cf. 2.6.2). 


Theorem 4.4.8 For a quasitopos E, there is a bijection between reflective sub- 
categories of E with cartesian reflector, and proper universal closure operations 
on €. In particular, if E is a topos, there is a bijection between subtoposes of E 
and local operators on €. 


Proof We saw in 4.3.2 how every cartesian reflector gives rise to a universal 
closure operation (and we observed before 4.4.3 that this closure operation is 
proper); conversely, the combination of 4.4.4 and 4.4.6 shows how every proper 
universal closure operation gives rise to a cartesian reflector. If we start from 
a cartesian reflector and apply these two constructions in succession, we arrive 
back at the same reflective subcategory, by 4.3.6(b). So it remains to show that, 
if we start from a proper universal closure operation c and perform the two 
constructions in the other order, we recover c; that is, for any monomorphism 
m: A’ > A in E, the pullback A” — A of Im along na is isomorphic to c(A’) 
in Sub(A). But A” > A is closed, since Lm is closed by 4.3.8, so by 4.3.3(ii) 
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it suffices to show that A’ — A” is dense. But the latter monomorphism is 
mapped by L to an isomorphism; so the induced morphism MA’ — MA” is 
dense monic, since the composite MA’ — MA” — LA” = LA’ is dense monic 
by the construction of 4.4.4. Now there is a pullback square 


A > A" 


A’ x MAN A" — A” x MAM A" 


in which the right vertical arrow (the diagonal) is dense monic by the construction 
of 4.4.4, so the left vertical arrow is dense monic. Moreover, the projection 
A’ xman A” — A” factors as a cover followed by a dense monomorphism, since 
it is a pullback of the composite A’ M A’ —+ M A"; and a monomorphism which 
factors as a dense monic followed by a cover must be dense, since pullback along 
a cover, being conservative, reflects as well as preserving dense monomorphisms. 
Thus we have expressed the monomorphism A’ — A” as a composite of two 
dense monomorphisms; so it is dense, as claimed. 

The second assertion of the theorem follows immediately from the first 
and 4.4.2. O 


Thus we see that a proper universal closure operation c on a quasitopos € is 
uniquely determined by the subcategory sh,(€). It is not uniquely determined by 
sep.(E), since we have already observed that sep,(€) = sep;(€) for any c; but 
if we add the requirement that c should be strict, then sep,(€) does determine 
it, since it is easily seen that in this case we have sh,(€) = Cs(sep,(€)). In 
particular, if € is a topos, then sep, (E ) determines the local operator 7. 

We saw in 4.4.7 that the separated reflector M preserves monomorphisms, 
but it is not cartesian in general. In fact if M is cartesian, we see from 4.4.8 
that sep,(€) must coincide with shy (E) for some c’; moreover, since arbitrary 
subobjects of c-separated objects are separated, it follows from 4.3.6(a) that 
every separated object for c’ must be a sheaf for it (and so, if E is a topos, 
c’ must coincide with c). Nontrivial examples of universal closure operations c 
with sep,(€) = sh,(€) can exist, even when £ is a topos; but they are not very 
common. Here is a simple one: 


Example 4.4.9 Let M be the two-element monoid {1,e} with e? = e, and 
let E be the topos of M-sets (that is, sets equipped with an idempotent endo- 
morphism). Let f,: Set — E be the functor which equips a set with its identity 
endomorphism, and f*: E — Set the functor which sends (A,e) to the set of 
fixed points of e. It is well known that f* and f, are adjoint to each other both 
ways round, and so they form a geometric morphism in either direction between 
E and Set. In the direction indicated by the notation, this morphism is an inclu- 
sion, since f, is full and faithful; but if we use it to identify Set with a subtopos 
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of E, we see that the latter is closed under arbitrary subobjects in £, so that 
every separated object for the corresponding local operator must be a sheaf. 


We conclude this section by briefly mentioning an alternative construction of 
the reflector € — sh; (E) for a local operator j on a topos E, which is described in 
greater detail in [501]. As previously, let J — Q denote the subobject classified 
by j, and let d: 1 — J be the factorization of T: 1 — Q through J (the generic 
dense subobject). Let 6: A — J denote Ia( A); equivalently, it is the pullback of 
T: A— Qalong J (cf. 2.4.7). Just as in the proof of 2.4.7, we may show that 
A classifies partial maps into A with dense domain. The alternative construction 
involves making A — J into a (contravariant) internal diagram on the internal 
poset J = (J; 3 J) (where Jı is the intersection of J x J with the ordering 
Qı — QxQ defined in the proof of 1.6.3), and forming its colimit A‘; it can then 
be shown that, for any object A, A** is the reflection of A in sh,(€). Since we 
have not yet developed the theory of internal categories and diagrams, we shall 
not give any further details of this construction; but we note that its ‘external’ 
version, for sheaves on a site, may be found in C2.2.6. For future reference, we 
also note the following characterization of sheaves and separated objects, which 
is used in the proof: 


Lemma 4.4.10 Let j be a local operator on a topos E, A an object of E. Let 
pa: Ax J — A classify the partial map 


Pee 


1xd 


AxJd 


Then A is separated (resp. a sheaf) iff a is monic (resp. an isomorphism). 


Proof For any object B, morphisms B — A x J and B — A correspond 
respectively to diagrams 


B’ A ad B' oe ae A 


B B 


with m j-dense; and composition with Ya corresponds to the operation which 
sends (m, f) to (m, fm). So this is immediate from the definitions. 0 
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Suggestions for further reading: Betti & Carboni [115], Bucalo & 
Rosolini [178], Dupont & Loiseau [317], Freyd [371], Goldblatt [412], 
Johnstone [501], Veit [1197]. 


A4.5 Examples of local operators 


In this section we discuss some particular classes of local operators on a topos, 
and the subtoposes which correspond to them; we shall also give some results on 
the structure of the ordered set of all local operators on a topos. Much of this 
discussion, too, could be generalized to quasitoposes; but we shall not go into 
the details here. 

To begin with, let U be a subterminal object in a topos E. By 4.2.12(a), 
the geometric morphism €/U — €/1 & E induced by U — 1 is an inclusion, 
so we may identify the slice category €/U with a subtopos of €. Note that the 
identification must be made, not via the forgetful functor Sy, but via the direct 
image Ily of the geometric morphism. (However, since UY = 1, as follows easily 
from 1.5.10, the description of IIy given in 1.5.2(i) may be simplified in this case; 
Ily(f: A — U) is simply the exponential AY.) In the case where € = Sh(X), 
and U is (identified with) an open subset of X, then €/U may be identified with 
Sh(U) (and the geometric morphism above is identified with that induced by 
the inclusion map U — X); for this reason, subtoposes of this form are called 
open subtoposes. 

The reflector on € corresponding to this subtopos is simply the composite 
IlyU* & (—)”, and the unit map A — AY is the transpose of the projection map 
A x U — A. It is thus easy to verify that the corresponding universal closure 
operation sends a subobject A’ — A to the implication (A x U) = A’ in the 
Heyting algebra Sub(A); equivalently, the local operator corresponding to the 
subtopos is the composite 


1 
o21xasaxa sa, 


where u: 1 — Q is the classifying map of U — 1. It is also easy to verify directly 
from the axioms for a Heyting algebra that this composite satisfies the conditions 
of 4.4.1; we shall denote it by o(U). 

Open subtoposes may be characterized by a number of equivalent conditions; 
the following proposition should be compared with 2.3.8. 


Proposition 4.5.1 Let 7 be a local operator on a topos E, and let L denote 
the topos of j-sheaves. The following conditions are equivalent: 
(i) £ is an open subtopos of €. 
(ii) The reflector L: E — L£ is a logical functor. 
(iii) The reflector L: E — L is a cartesian closed functor. 
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(iv) Ifm: A’ > A is j-dense, so is m? : A'B — AP for any B. 
(v) If d:1— J denotes the unique factorization of T:1->Q through the 
subobject classified by j, then Ilj (d) is a j-dense subobject of 1. 


(vi) There exists a morphism u:1— J such that the morphism 


uxl1 
J =&1 xJ —> JxJ 


factors through the order-relation Jı — J x J (i.e. the restriction of the 
order-relation on Q, as defined in 1.6.3). 


Proof (i)> (ii) is immediate from 2.3.2, and (ii) > (iii) follows from 2.3.7(iii). 
For (iii) > (iv), recall that the j-dense monomorphisms are precisely those which 
are mapped by L to isomorphisms. (iv) implies that d: 17 — J J is j-dense, 
since d is dense by the definition of J; hence II;(d) — 1 is j-dense by the 
construction of II; given in 1.5.2, i.e. (v) holds. For (v) = (vi), let u be (the 
factorization through J of) the classifying map of II; (d) — 1; since the counit 
of (J* + IL;) yields an inequality J*II;(d) < d in Sub(J), we deduce that the 
displayed morphism factors through the order-relation on J. Finally, if (vi) holds, 
let U ~ 1 be the subobject classified by u; then we see that, for any object A 
of E, Ax U is the unique smallest j-dense subobject of A. So an object B is a 
j-sheaf iff, for any A, each morphism Ax U — B extends uniquely to a morphism 
A — B; but this is equivalent to saying that every morphism A — BY factors 
through the transpose B — BY of the projection, and hence to saying that the 
latter is an isomorphism. So the j-sheaves are exactly the objects of the open 
subtopos corresponding to U. 0 


Although, as we saw in the course of the above proof, an open local operator 
has the property that every object of the topos has a smallest dense subobject, 
the latter condition does not suffice to characterize open subtoposes; we need the 
extra information that the smallest dense subobject of A is of the form A x U 
for a fixed subterminal object U. 


Example 4.5.2 Let C be a small category, and D a full subcategory of C. Then 
the geometric morphism [D, Set] — [C, Set] induced by the inclusion D — C is 
an inclusion by 4.2.12(b); so it corresponds to a local operator on [C, Set]. It is 
not hard to see that a monomorphism F” > F in [C, Set] is dense for this local 
operator iff F’(b) — F(b) is bijective for all objects b of D; hence any F has a 
smallest dense subobject F’, given by 


F'(a) = {x € F(a) | If: b— a with b € ob D and y € F(b) with F(f)(y) = z}. 


But the subterminal objects in [C, Set] correspond to full subcategories of C 
which are cosieves, i.e. have the property that b € ob D and f:b — a in C imply 
a € ob D. So if D is a full subcategory which is not a cosieve, we obtain an 
example of a local operator which is not open, but for which every object has a 
smallest dense subobject. 
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The next class of subtoposes we wish to consider is also indexed by the sub- 
terminal objects of a given topos; they play the rôle of the ‘closed complements’ 
of the open subtoposes we have just investigated, and we naturally call them 
closed subtoposes. Given U > 1 in a topos E, with classifying map u: 1 > Q, 
consider the composite 


1 
EE 3 T AEE, 


where V is the binary join map on Q, as defined in 1.6.3(ii). Since a Heyting 
algebra is distributive as a lattice, this composite satisfies the third commutative 
diagram of 4.4.1; the fact that it satisfies the first two is immediate, and so it isa 
local operator, which we shall denote by c(U). We shall see below that the local 
operators on a topos form a distributive lattice (in fact a Heyting algebra), and 
that the open and closed local operators corresponding to a given subterminal 
object are indeed complementary to each other in this lattice. For the present, 
we note that if U is a complemented subobject of 1, with complement V say, 
then c(U) coincides with o(V), by an easy calculation in the Heyting algebra Q; 
thus, for a Boolean topos, the notions of open subtopos and closed subtopos 
coincide. (Again, we shall see later that if £ is a Boolean topos then its only 
subtoposes are open ones.) 


Proposition 4.5.3 LetU be a subterminal object in a topos. Then 


(i) A monomorphism m: A’ — A is c(U)-dense iff A is the union of the 
subobjects A’ and A x U; and it is c(U)-closed iff it contains A x U. 
(ii) A is a c(U)-sheaf iff Ax U =U. 
(iii) The reflection of an arbitrary object A in the subtopos of c(U)-sheaves is 
the pushout of 


TI 
A x U ——> A 
T2 


U 


Proof (i) is immediate from the definition of c(U). 

(ii) Suppose A is a sheaf. If B — U is any object of €/U, then the projection 
Bx U — B is an isomorphism, and so by (i) any subobject of B (in particular, 
0 — B) is c(U)-dense. Thus the unique morphism 0 — A extends to a unique 
morphism B — A; hence there is also a unique morphism B — A x U. But this 
says that Ax U is a terminal object of €/U, so it is isomorphic to U. Conversely, 
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if A x U SU and we are given 


B 


where m is c(U)-dense, then the unique morphism B x U —> U Œ A x U agrees 
with f on the intersection B’ x U of B’ and B x U, and so can be combined with 
f to yield a unique extension of the latter to a morphism B — A. 

(iii) Since the functor (—) x U preserves pushouts (having a right adjoint), it 
is clear from (ii) that the pushout of the given diagram is a sheaf. On the other 
hand, for any sheaf B there is a unique morphism U — B (since 0 — U is c(U)- 
dense), and so any morphism A — B factors uniquely through the pushout. O 


Corollary 4.5.4 For a subterminal object U of E, the inclusion functor 
sh.y)(E) — E creates all connected colimits which exist in E. 


Proof This is immediate from 4.5.3(ii), since the functor (—) x U preserves 
colimits and the colimit of a connected diagram whose vertices are all U is 
(isomorphic to) U. O 


Example 4.5.5 Reverting to the case of a functor category [C, Set], we noted 
in 4.5.2 that subterminal objects of such a category correspond to cosieves in C. 
If R is such a cosieve, then it is easy to see that a functor F': C — Set satisfies 
the condition that F(a) = 1 for all a € ob R iff it is isomorphic to the right Kan 
extension of its restriction to the full subcategory S on all objects not in R. So 
by 4.5.3(ii) we may identify the closed subtopos of |C, Set] complementary to 
[R, Set] with [S, Set]. Note also that the condition that R should be a cosieve 
is equivalent to saying that S is a sieve, i.e. satisfies the dual condition that it 
contains any morphism whose codomain lies in S. Thus we have a natural (order- 
preserving) bijection between closed subtoposes of [C, Set] and open subtoposes 
of [C°P, Set], for any C. (However, this identification depends on the Booleanness 
of Set, as is clear from the use of complementation which we made in proving 
it: the analogous result for internal categories in a general topos is false.) 


There is a close relationship between the notions of open and closed subtopos 
and the glueing construction of 2.1.12. If F: € — F is a cartesian functor between 
toposes, we observed in 4.1.12 that we have geometric morphisms 


6 SG F 


whose inverse images are the two projections; and from the descriptions of the 
direct images which we gave there, it is clear that they are both inclusions. 
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Further, p is an open inclusion, since we observed after 2.1.12 that its inverse 
image functor is logical; the corresponding subterminal object of GIF) is easily 
seen to be (1,0,0 — 1). And if we form the closed local operator corresponding 
to this subterminal object, it follows at once from 4.5.3(ii) that the sheaves for 
it are exactly those (A, B, f) for which A = 1, i.e. (up to isomorphism) exactly 
the objects in the image of q.. So our original toposes € and F sit inside Gl(F) 
as complementary open and closed subtoposes; moreover, as we saw in 4.1.12, 
the original cartesian functor F may be recovered as the composite q*p,. (The 
composite p*q,, on the other hand, is simply the constant functor with value 1.) 
In the converse direction, we have 


Proposition 4.5.6 Let E be a topos, and U a subterminal object of E. Let 
L and M be the open and closed subtoposes corresponding to U, and write 
p:£L-€E, qa: M—E for the inclusions. Then E is equivalent to the topos 
obtained by glueing along the composite q*p,: L — M. 


Proof Itis clear that p and q may be combined into a single geometric mor- 
phism f: £ x M — E, whose inverse image sends an object A to (p* A, q* A), and 
whose direct image sends (B,C) to p, B x q.C. Moreover, from 4.5.3(i) we know 
that the monomorphisms in € which are dense for o(U) are precisely those which 
are closed for c(U), and in particular that any monomorphism which is dense for 
both local operators is an isomorphism. Hence A +> (p* A, q* A) preserves proper- 
ness of subobjects, and so by 1.2.4 it is conservative, i.e. f is a surjection. So it 
suffices to show that the comonad on £ x M induced by (f* 4 fa) is isomorphic 
to that induced by q*p,, as described in 4.2.4(c). But, for any object (B,C) of 
L x M, we have p*p,B = B and q*q,.C = C, since p and q are inclusions; and 
since C is a sheaf for the closed local operator, we have CU Œ (CxU)Y 2 UY = 1 
by 4.5.3(ii), i.e. p*q„C & 1. So we see that 


f* f.(B,C) = (p*(p.B x qC), "(Ds B x q.C)) 
(p*p.B x parC, pB x q*°qC) 
(B,q*p.B x C); 


IR IR 


it is now easy to verify that this isomorphism of functors is an isomorphism of 
comonads, i.e. that it commutes with the counits and comultiplications. O 


It will be observed that we have already encountered a particular instance of 
(the quasitopos version of) Proposition 4.5.6, in Proposition 2.6.7. 

We have seen in 4.5.1 that the associated sheaf functor for a local operator 
is cartesian closed iff it is logical. However, it can preserve Q without being 
logical; we next investigate the local operators for which this happens. (We shall 
discuss geometric morphisms whose direct images preserve © in the next section.) 
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First we need 


Lemma 4.5.7 For any local operator j on a topos E, Q; is an ‘internal 
exponential ideal’ in Q, i.e. the composite 


xQ > xn >A 
factors through Qj — Q. 
Proof By definition, the composite classifies the equalizer E — Q x Qj of 


TI 
Qx Q > Qx Z3 Q; 
A 


so we have to show that E is a closed subobject. But given a morphism 
(ġ, Y): A— Q x Qj, corresponding to a pair of subobjects A’ — A, A” >= A of 
which the second is closed, we see that (¢, y) factors through E iff A’ < A” 
in Sub(A), iff c;(A’) < A” in the lattice of closed subobjects of A. Since 
binary intersections of closed subobjects are closed by 4.3.3(iii), we deduce that 
E > Q x Qj is also the equalizer of 


3x1 m 
Q x 0; 1m 0; x Q; Eaa A; 
A 


where 7: Q —> Q; is the epic part of the image factorization of 7; but Q; is a 
j-sheaf by 4.3.7, so this equalizer is closed by 4.3.6(a) (iv). O 


Proposition 4.5.8 Let j be a local operator on a topos E€. The following 
conditions are equivalent: 


(i) The associated sheaf functor L:E — shj(E€) preserves the subobject 
classifier. 
(ii) The canonical monomorphism Q; — Q is j-dense. 
(iii) For any 6: A —> Q, the equalizer of @ and jọ is a j-dense subobject of A. 
(iv) Every monomorphism in E may be factored (not necessarily uniquely) as 
a j-closed monomorphism followed by a j-dense one. 


(v) j commutes with implication, i.e. the diagram 


QaxQ — 50 


commutes. 
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(vi) The diagram 


H<—-—_ 5 
4 
Q 


commutes. 


Proof (i) © (ii): Let 7: L(Q) —> Q; be the classifying map of L(T); then we 
have a diagram 
J ———> L(1) ———> 1 


L(T) 


eo) So: 
in which the right-hand square is a pullback by definition, and the left-hand 
square is a pullback by the remarks preceding 4.3.2. It follows that the composite 
Tno coincides with J, hence if we identify Q; with L(Q;) 7 becomes identified 
with L(7). In particular, 7 is an isomorphism iff Z(7) is an isomorphism; but 
since 7 is split epic, this holds iff L(Q, > Q) is an isomorphism, i.e. iff Q, — Q 
is j-dense. 

(ii) > (iii): The equalizer of jọ and ¢ is the pullback of 0; — Q along ¢; so 
it is dense if Q; — Q is. 

(iii) > (iv): If ¢ is the classifying map of A’ — A, then the equalizer A” — A 
of @ and jọ is the largest subobject which contains A’ as a j-closed subobject. 
So this is immediate. 

(iv) = (ii): Applying (iv) to the generic subobject T:1>— Q, we obtain a 
j-dense subobject B — Q containing T as a j-closed subobject. But then B — Q 
must factor through Q; — Q, so the latter is also dense. 

(iii) >(v): Since both ways round the given diagram factor through 0; — Q 
(the lower one by an application of Lemma 4.5.7), it suffices (since Q; is 
j-separated) to find a j-dense subobject E — Q x Q equalizing them. But (iii) 
implies that we may take Ẹ to be the intersection of the equalizers of the pairs 
(371,71), (jm, 72) and (j >>). 

(v)= (vi) follows from composing the two ways round the diagram of (v) with 
(7,1): Q — Q x Q, since the composites A: Q — Q x Q — Q factors through T. 

(vi) = (iii): Since any subobject is contained in its closure, we have 
A(jo, 6) = ¢ for any ¢: A — Q, and hence the composite 


m 
r en EE Ga 2G 
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classifies the equalizer of jọ and ¢. Condition (vi) says immediately that this 
equalizer is j-dense. oO 


Example 4.5.9 Let ~: Q — Q be the Heyting negation map, i.e. the classi- 
fying map of L: 1> Q. It is straightforward to verify that the composite ~~n 
is a local operator, i.e. that it satisfies the conditions of 4.4.1. Moreover, it 
satisfies the conditions of 4.5.8: to see this, observe that for any element x of 
a Heyting algebra H, we have x < (~~nt = x) and nar < (~nr = x) (the 
latter since (>x A vz) = L < 2), and so (~nz => x) > (x V ~nz); hence 
aA(-744 > T) > a(x Var) = T. But this is just the statement that the 
diagram in (vi) of 4.5.8 commutes. Alternatively, we could use condition (iv): 
given a subobject A’ — A, if we set A” = A’ U-—A’, then A’ — A” is ——-closed 
(since it is complemented) and A” — A is —7-dense (cf. the proof of 1.4.14). 

We note that the subtopos sh__(€) is Boolean; for if A is any ——-sheaf, 
its subobjects in sh__(€) are its ——-closed subobjects in £, and these form a 
Boolean algebra. It is easy to see that it is not an open subtopos in general; for 
example, if X is a To-space (such as R) in which no nonempty open subspace is 
discrete, then sh__(Sh(X)) cannot be open. We shall have more to say about 
Boolean subtoposes in 4.5.21 below. 


We write Lop(€) for the class of all local operators on a topos € (note that 
it is a set if € is locally small). Lop(€) carries a natural partial order, defined 
by jı < j2 iff A(ji, j2) = ji; this is equivalent to saying that J; < Jp in Sub(Q), 
or that Q; < Qj, or that sh; (£) C sh;, (E£) as subcategories of € (the more 
dense monomorphisms we have, the more conditions an object has to satisfy to 
be a sheaf). We shall see eventually that Lop(€) is a Heyting algebra; for the 


moment, we note 


Lemma 4.5.10 The partial ordering Lop(E) has greatest and least elements, 
and binary meets. 


Proof The composite Q — 1 +, Q is a local operator (it is the open local 
operator associated with the subterminal object 0), and is clearly the greatest 
element of Lop(€). Similarly, the least element is the local operator 1p. The 
binary meets are constructed ‘pointwise’ — that is, if j and k are local operators, 
so is the composite 


F 
E og LETN 


It is straightforward to verify that this composite satisfies the commutative dia- 
grams of 4.4.1; alternatively, one may verify that if c and d are universal closure 
operations on a cartesian category, so is the mapping A’+> c(A’)Nd(A’). O 
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We further note that if U is a subterminal object, then (o(V) A c(U)) = 1g, 
as claimed earlier; for in any Heyting algebra we have 


(usar) A (u V x) = ((u=2) Au) V ((u=>2) Az) 
= (u AZ) Ve Se 


by the identities of 1.5.11 and the distributive law. 

A problem which frequently arises is that of constructing the smallest local 
operator (in the ordering defined above) for which all the monomorphisms in 
a given class are dense. We next present a particularly elegant solution to this 
problem, due to A. Joyal. First we need a lemma characterizing the class of dense 
monomorphisms for a local operator. We introduce the notation =(D) for the 
class of monomorphisms whose classifying maps factor through a given subobject 
D of Q; thus if 7 is a local operator, =(J) is the class of j-dense monomorphisms, 
and =(Q) is the class of j-closed monomorphisms. 


Lemma 4.5.11 Let D — Q be a subobject of N in a topos E. Then 


(i) E(D) is stable under pullbacks. 
(ii) If T: 1+ Q factors through D (equivalently, if E(D) contains all isomor- 
phisms), then =(D) is stable under pushouts. 
(iii) The classifying map of D — Q is a local operator iff =(D) contains all iso- 
morphisms and satisfies the condition (mn € E(D) m € S(D) and n € 
=(D)) for all composable pairs of monomorphisms m, n. 


Proof (i) is immediate since if m is classified by ¢ then f*(m) is classified 


f' 


A’ —— P' 


A ————> B 


be a pushout diagram with m € 3(D). By 2.4.3, the square is also a pullback; 
so if @: B— Q is the classifying map of n, then @f classifies m, and so factors 
through D. But ọn classifies n*(n) = 1p, so it too factors through D; the result 
follows from the universal property of pushouts. 

(iii) If the classifying map d: Q — 2 of D — Q is a local operator, then =(D) 
is the class of d-dense monomorphisms, and it satisfies the given conditions by 
4.3.3. Conversely, suppose the conditions are satisfied. It suffices by 4.4.2 to 
show that the unary operation (c, say) on subobjects which sends the subobject 
classified by ¢ to that classified by dọ is a universal closure operation. But since 
=(D) contains isomorphisms, it is clear that we have c(A’) > A’; and in fact 
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c(A’) is the largest A” in Sub(A) for which A’ — A” is in =(D). Now the 
second condition on =(D) implies that the composite A’ — ¢(A’) > c(c(A’)) 
is in =(D), so c(c(A’)) < c(A’), i.e. c is idempotent. To show that c is order- 
preserving, consider two subobjects A’ — A and A” > A with A’ < A”. Form 
the diagram 

c( A’) eA) > eA") 


c(A') >——> ¢(A’) Uc(A”) 


Now since A’ < A” < c(A”), the monomorphism A’ > c(A’) factors through r, 
som € =(D). And the square is a pushout by 1.4.3, so by (ii) we have n € E(D). 
But now the composite A” — e( A”) — c(A’) U ce( A”) is in S(D), so n must be 
an isomorphism; that is, c(A’) < c(A”). Finally, it is clear from the form of the 
definition that c is stable under pullback. o 


Next we define a binary relation Ô on Sub(A), for each object A of £, by 
saying that 0(A’, A”) holds iff (A’ => A”) & A”. We note that this relation always 
holds if A’ is dense for some local operator j and A” is closed for the same local 
operator; for we have A'N (A’= A”) < A” < (A' = A”), and ÆA N (A = A”) is 
j-dense in (A’=> A”). Note also that 0(A’, A’) holds iff A’ is the whole of A, since 
(A = A’) & A. It is clear that, if we define © — Q x Q to be the equalizer of > 
and 72, then a morphism (¢ġ, Y): A — Q x Q factors through © iff the subobjects 
classified by ¢ and ~ satisfy the relation 8. Hence in particular if j is a local 
operator, then J x Q; < © in Sub(Q x Q). 

We now use O to define an ‘internal Galois connection’ on Sube (Q), as follows: 
if D — Q, we define D” — Q to be 


Vang ((7](D) 38) 3 2x Q) 


where m, and 72, as usual, denote the product projections Q x Q — Q. (Equiv- 
alently, D” is the unique largest subobject of Q such that D x D” < © in 
Sub(Q x N).) And we define D! — Q by the same formula with 7, and 72 
interchanged. It is straightforward to verify that a monomorphism m: A’ — A 
belongs to =(D") iff, for every f: B — A and every n: B’ ~ B in (D), the 
relation 0(n, f*(m)) holds. 


Proposition 4.5.12 (i) The mappings D |> D" and D+ D! form a Galois 
connection from Sub(Q) to itself; that is, they are order-reversing and adjoint to 
each other on the right. 

(ii) If j is a local operator on E, then J” = Qj and (Q;)' & J. 

(iii) A subobject D — Q satisfies D"! ~ D iff its classifying map is a local 
operator on E. 
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Proof (i) is immediate from the description of D” given above, and the 
corresponding description of D}. 

(ii) We have already observed that if j is a local operator then J x Q; < 9, 
so we have Q; < J" and J < (Q,). 

To show that we have equality in the first case, suppose (A’ — A) € S(J"). 
Let A” — A be the j-closure of A’; then A’ — A” is the pullback of A’ — A 
along A’ — A”; so it too belongs to =(J"). But it also belongs to S(J), since it is 
4-dense; so it stands in the relation ô to itself, and hence must be an isomorphism, 
i.e. A’ ~ A is closed. Applying this argument to the subobject of J” classified by 
the monomorphism J” >> Q, we deduce that the latter must factor through 2. 

Similarly in the second case, suppose that (B’ — B) € =((Q;)'). Let B” — B 
be the j-closure of 8’; then since it belongs to (Q;) we have 6(B’, B”); that 
is, B” S (B’ => B”) = B since B’ < B". So B’ > B is j-dense. Once again, 
applying this in the generic case B = (Q,)! yields (Q;)! < J. 

(iii) If the classifying map of D > Q is a local operator, then D S D 
follows from (ii). Conversely, suppose Dt = D; we shall show that the class 
=(D) satisfies the conditions of 4.5.11(iii). 

If m: A’ > A is an isomorphism, then it is clear that 0(A’, A”) holds for any 
A"; hence m € =(E') for any E, and in particular m € =(D") = =(D). Now 
suppose we have a composable pair n: A” — A’, m: A’ ~ A of monomorphisms 
in =(D). Then for any p: A” — A in E(D"), we have O(m, p) and 0(n,m*(p)). 
from which it is easy to deduce that #(mn,p) holds. And this remains true 
after pullback along any f: B — A, so we deduce that mn € =(D™) = E(D). 
Conversely if mn € =(D), then it is clear that n € S(D), since n = m*(mn); 
and for any p: A” >=> A in E(D"), we can deduce 0(m,p) from 0(mn, p). Once 
again, this remains true after pullback, so we have m € =(D™) = E(D). O 


Corollary 4.5.13 Let D—Q be a subobject of N in a topos. Then 


(i) There is a unique smallest local operator j such that all the morphisms in 
E(D) are j-dense. 

(ii) There is a unique largest local operator k such that all the morphisms in 
=(D) arè k-closed. 


Proof (i) We take j to be the classifying map of D"! >=> Q; the result is 
immediate from 4.5.12(i) and (iii). 
(ii) Similarly, we take k to be the classifying map of D! — Q. o 


Examples 4.5.14 (a) We may now verify that Lop(€) has binary joins, as 
well as the binary meets constructed earlier; we simply define 71 V jz to be the 
smallest local operator for which all monomorphisms in =(J; U J2) are dense. 
(Note in passing that the closed monomorphisms for jı V j2 are exactly those 
which are closed for both jı and j2, since 2;,vj, S (J1 U Jo)” & (JI N JG). It 
follows that if 7; and jz are the open and closed local operators associated with 
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a subterminal object U, then 7; V jz is the top element of Lop(€); for we saw in 
4.5.3(i) that every j2-closed monomorphism is j,-dense, and so a monomorphism 
which is closed for both local operators must be an isomorphism.) 

(b) Let m: A’ + A be a single monomorphism in a topos E. If we apply 
4.5.13(i) to the image 7 — Q of the classifying map of m, we obtain the smallest 
local operator on € for which m is dense. (Dually, we may obtain the largest 
local operator for which m is closed, using 4.5.13(ii).) For example, if U is a 
subterminal object of £, it may easily be verified that o(U) is the smallest local 
operator for which U — 1 is dense, and c(U) is the smallest for which 0 — U is 
dense. 

(c) Let f: A — B be a morphism in a topos. By applying (b) to the image 
of f (resp. the diagonal map A ~ A xg A), we may obtain the smallest local 
operator for which L(f) is epic (resp. monic), where L is the associated sheaf 
functor, as usual. And by taking the join of these two, we may obtain the smallest 
local operator for which L(f) is an isomorphism. 

(d) Similarly, by applying the dual of (b) to the diagonal map A — A x A of 
an object A of E, we may obtain the largest local operator on E for which A is 
separated, by 4.3.6(iv). 

(e) Let f: F — E be a geometric morphism, j a local operator on €. Then 
there is a unique smallest local operator k on F such that the composite 
sh,(F) — F — E factors through sh;(€) — E. For such a factorization exists, 
by 4.3.12, iff f*(m) is k-dense for every j-dense monomorphism m; so we simply 
have to apply 4.5.13(i) to the image of the composite 


Ls fa Se 


where 7 is the classifying map of f*(T). The resulting local operator k is some- 
times called the pullback local operator, since it is easy to see using 4.3.11 that 


the square 
sh (F) ——> sh,(€) 


f 


F ———> E 


is a pullback in Yop. For, given a topos G, a cone over the right and bottom edges 
of this square with vertex G corresponds to a geometric morphism g: G — F such 
that g* f* maps j-dense monomorphisms to isomorphisms; but if it does so, then 
the local operator k’ on F which corresponds to the image of g (in the sense of 
4.2.10) must satisfy k’ > k, and so g factors through shy(F) > F. 

(f) We have already seen that Lop(€) is a lattice; we now show that it is a 
Heyting algebra (and hence in particular a distributive lattice). To do this, let 
ji and jz be local operators, and define 73 to be the largest local operator such 
that every monomorphism which is both 7;-dense and j2-closed is 73-closed, i.e. 
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set J3 = (Jı N Q; )!. We shall show that j3 = (jı => jz) in Lop(€). Clearly, 
if k is any local operator such that k A jı < je, then any A’ >> A which is 
both j;-dense and j2-closed must be k-closed, for the inclusion A’ — A” of 
A’ in its k-closure would be both k-dense and jı-dense (and hence j2-dense), 
and also j2-closed. Hence any such k satisfies k < j3; we thus have to prove 
that j3 itself satisfies 73 A jı < j2. Suppose m: A’ — A is both j3-dense and 
ji-dense. If n: A” — A is any 72-closed monomorphism, let p: A” — A be its 
ji-closure, and q: A” >= A’ the factorization of n through p. Then we have 
O(m, p) since m is jı-dense and p is j,-closed, and 0(p*(m), q) since p*(m) is 
js-dense and q = p*(n) is both ĵjı-dense and j2-closed. From these it follows that 
we have 0(m,pq); and the same argument works if we first pull back m along 
some f: B + A, so m € ((9;,)') = (J2). 


We saw in 4.5.14(d) that there is a largest local operator for which a given 
object A is separated. The same result holds true if we replace ‘separated’ by ‘a 
sheaf’, but it does not seem possible to derive it directly from 4.5.13 (unless A 
happens to be subterminal, in which case we may take the largest local operator 
for which A > 1 is closed - cf. 4.5.21 below). 


Proposition 4.5.15 For any object A of a topos E, there is a largest local 
operator on E for which A is a sheaf. 


Proof Let y: Ax Q — A represent the partial map 


AxQ 


and let r: A — Q classify A ~ A. Note that 4 is the case J = Q of the morphism 
considered in 4:4.10; so we want to find the largest subobject J of Q such that 
the pullback of Y: 2* A — r along J — Q is an isomorphism. To do this, form 
the pullback 


Vv, w 
E o E 


u {} 
Q 


APA P(AxQ) 
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Now a map f: B — A factors through u iff there exists g: B — A such that 


1 
Pe 


(1B,9,TÎ) (1s, f) 


peasan T BxA 
is a pullback, since the composite Py)o{}o f names the pullback of the graph of f 
along 1g x w (cf. 2.2.2(i)). But this holds iff, for every h: C — B, the composite 
gh is the unique morphism C — A extending the partial map C — A represented 
by fh. Now if we define J ~ Q to be V;(W — A), then a morphism x: B —> Q 
factors through J iff, for all h: C — B, the subobject C” — C classified by xh 
has the property that each morphism C’ — A extends uniquely to a morphism 
C — A. It follows easily that the class E(J) satisfies the conditions of 4.5.11 (iii), 
so that the classifying map of J — Q is a local operator; and it is clearly the 
largest local operator for which A is a sheaf. O 


Corollary 4.5.16 If 7, and jz are two local operators on E, then shi; vja (E) = 
sh;, (£) N sh;, (E) and seP; vj (E) = sep,, (E) N sep, (E). 


Proof An object A of € is a sheaf for jı V jz iff the local operator j constructed 
in 4.5.15 satisfies 7 > (j1 V j2), iff A is a sheaf for both 7; and jg. The second 
assertion follows similarly from 4.5.14(d). o 


In general, there is no simple formula for the join of two local operators, 
comparable to that for their meet which we gave in 4.5.10. However, the following 
is sometimes useful. 


Lemma 4.5.17 Letj and m be two local operators on a topos E. The following 
conditions are equivalent: 
(i) The join of j and m in Lop(E) is the composite mj. 

(ii) mj is a local operator. 

(iii) mj is idempotent. 

(iv) mj > jm in the canonical partial ordering on E (Q, Q). 
Proof Clearly (i) implies (ii), but (ii) implies (i) since we have lg < j, lo < m 
and hence m < mj, j < mj in E€ (Q,Q), and 

mj < (j V m)(J V m) = (j V m). 


(ii) > (iii) is again immediate, and the converse holds since it easily seen that mj 
always satisfies the first and third commutative diagrams of 4.4.1. (iii) implies 
(iv) since we have mj = mjmj > 19jm1g; and (iv) implies (iii) since we have 
mjmj <mmjj = mj, the reverse inequality being trivial. 
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The conditions of 4.5.17 are easily seen to hold if either m is an open local 
operator, or 7 is closed. 

Although calculating the join of two local operators 7 and m as a local 
operator on € is.in general a hard problem, it turns out that calculating the 
local operator k on sh,,(€) which corresponds to the subtopos shjyvm(E) = 
sh; (£) N shm(E) is often easy. The latter is of course the pullback of j along 
the inclusion f: sh,,(€) — E, in the sense of 4.5.14(e), so we know that it is the 
smallest local operator for which the corresponding K >> Qm contains the image 
of f*J = f*Q — Nm. But in many cases it turns out that the classifying map 
of this image is already a local operator. 


Lemma 4.5.18 Let j and m be local operators on a topos E, and assume one 
of the following: 


(i) The pair (j,m) satisfies the conditions of 4.5.17; 
(ii) m is closed; 
(iii) m satisfies the conditions of 4.5.8. 


Then the classifying map of the m-closure of the image of the composite 


Pei yet ae 


is a local operator k in sh,,(E€), and it is the pullback of j (in the sense of 
4.5.14(e)) along the inclusion f:sh,,(E€) —> E. 


Proof The second assertion is immediate from the first, since the closure of 
the image of J — Nm is exactly the image of the composite f*J = fN — Nm 
in shm (€E). Let us write K — Qm for this subobject: then it is easy to see that a 
(necessarily m-closed) monomorphism B’ — B in shm(£) belongs to =(K) iff it 
can be factored in £ as B’ ~ B” >=> B where B’ — B” is j-dense and B” — B 
is m-dense. (Specifically, we take B” — B to be the pullback along B — K of 
the m-dense monomorphism 7 >> K, where I >> Qm is the image of J — Qn.) 
We have to show that =(K) satisfies the conditions of 4.5.11 (iii); but they are 
all trivial except for stability under composition, for which we argue differently 
in the three cases, as follows. 

(i) The description just given says that =(K) consists of those monomor- 
phisms B’ — B whose classifying maps ¢: B > Q in E satisfy mjọ = TB. If 
B’ — B is a composite of two such monomorphisms, then its classifying map 
satisfies mjmj@ = TB; but the conditions of 4.5.17 say that this is equivalent 
to mjd = TB. 

(ii) If m is a closed local operator, then it follows from 4.5.3(i) that any 
subobject containing an m-closed one is m-closed. So the description of =(k) 
simplifies still further: it consists of those monomorphisms in sh,,(€) which are 
j-dense. As such, it is clearly closed under composition. 
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(iii) Here we argue slightly differently: in this case the inverse image functor 
f*: E — sh,,(E€) preserves Q, by 4.5.8(i), and the isomorphism f*Q = 2, iden- 
tifies K — Qm with f*J — f*Q. Since f* preserves pullbacks, it also identifies 
the classifying map k of K — Q,, with f*(j); but the latter clearly inherits the 
conditions of 4.4.1 from j, so it is a local operator. oO 


Next, we investigate how the open and closed local operators sit inside the 
lattice Lop(€). 


Lemma 4.5.19 (i) The mapping c: Sube(1) — Lop(E€) is a lattice homomor- 
phism, and has a right adjoint ext. 

(ii) The mapping 0: Subg(1)°? — Lop(€) is a lattice homomorphism, and 
has a left adjoint int. 


Proof (i) The fact that c preserves finite meets follows easily from 4.5.10 and 
the distributive law in Q; it preserves finite joins by the remark after 4.5.17. 
Given j, we define ext(/) (the exterior of j) to be the j-closure of 0 > 1 (i.e. the 
subobject classified by jL). It is clear that ext(c(U)) S U; and for any 7 we have 
c(ext(j)) < j, since for any subobject A’ — A we have both A’ < c,(A’) and 
c;(0 — A) = (A x ext(j)) < c;(A’), where c; is the universal closure operator 
corresponding to 7. So the adjunction is established. 

(ii) The fact that o is a lattice homomorphism may be verified directly as 
in (i), or alternatively deduced from (i) and the fact (which we have already 
observed) that c(U) and o(U) are complementary elements of Lop(€). For the 
adjunction, we take the interior int(j) of j to be I1;(d), where d: 1 — J is the 
generic j-dense subobject (cf. 4.5.1). Now we have U < int(7) iff J*(U) < din 
Sub(J); but this is equivalent to saying that every j-dense subobject is o(U)- 
dense, i.e. that 7 < o(U). m) 


The local operator c(ext(j)) is of course called the closure of 7. We note that 
the inclusion sh; (E) — sh,,ex¢(;)) (E) corresponding to the inequality c(ext(j)) < 
j has the property that its direct image preserves the initial object (equivalently, 
the local operator on sh,(ex¢(;))(E) which corresponds to it has exterior 0), since 
the initial object of sh;(€) is simply ext(j). Of course, we call a subtopos or 
local operator dense if it has this property; we may thus conclude 


Corollary 4.5.20 Any geometric inclusion E’ — E has a unique factorization 
E’ — E" — E, where E' — E” is dense and E" — E is closed. 


Proof The existence has been discussed above. For the uniqueness, let j and 
7’ be the local operators on € corresponding to €’ and E”; then the denseness 
of the first inclusion is equivalent to the equality ext(j) = ext(j’), and so the 
closedness of the second implies that 7’ = c(ext(j)). o 


We note that the local operator ~~ of 4.5.9 is dense. Indeed, it is the unique 
largest dense local operator on E (i.e. sh__(€) is the smallest dense subtopos): 
for if 7 is a dense local operator, then c; preserves disjointness of subobjects by 
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4.3.3(iii), and so c;(A’) NAA’ < c;(A’) Ne;j(7A’) = 0, whence c;(A’) < ~ans for 
any A’ >= A. 


We conclude this section by discussing the relation between local operators 
and Booleanness. For this purpose, we introduce a third family of local operators 
indexed by the subterminal objects of €: given a subterminal object U, we define 
the guasi-closed local operator g(U) associated with U to be the composite 


(u, u) 


1x =>xl => 
Q SQ x 1 — 2x N x 2 — Q x N ——S 2 


where u, as before, is the classifying map of U >» 1. (In particular, we 
note that g(0) is the local operator ~~ considered in 4.5.9.) It is again a 
straightforward exercise in the algebraic theory of Heyting algebras to verify 
that this map satisfies the commutative diagrams of 4.4.1. Further, we have 
ext(q(U)) = U = ext(c(U)), so we have a composite inclusion 


shau) (E) => shv) (£) —— > E 


as in 4.5.20; it is easily seen that this factorization identifies shgiy)(€) with 
sh__(sh,(y7)(E)), and hence in particular shy) (E) is Boolean. 


Lemma 4.5.21 Let j be a local operator on a topos E. Then sh;(€) is Boolean 
uf 7 = q(U) for some subterminal object U of E. 


Proof One direction follows from the remarks above. Conversely, suppose 
sh;(€) is Boolean. By factoring the inclusion sh;(€) — E as in 4.5.20, we may 
reduce to the case when j is dense; in this case, we have to prove that j = ~>. 
Now it is easy to see that a composite of two inclusions is dense iff each of the 
two factors is dense; so, for any dense local operator 7, the smallest dense subto- 
pos sh__(sh;(€)) of sh; (E) coincides with the smallest dense subtopos sh__(€) 
of E. But if sh;(€) is Boolean, its double-negation operator is the identity; so it 
coincides with sh__(sh,;(€)) and hence with sh_.(€). Hence j = =~. Oo 


We note that, since q(U) is the largest local operator for which the inclusion 
shay) (E) > sh,is)(E) is dense, it may also be characterized as the largest local 
operator on E for which U > 1 is closed (equivalently, for which U is a sheaf). 


Proposition 4.5.22 For a topos E, the following are equivalent: 
(i) E is Boolean. 

(ii) Every subtopos of E is Boolean. 

(iii) Every subtopos of E is quasi-closed. 

(iv) Every subtopos of E is closed. 

(v) Every subtopos of E is open. 


Proof (i)=> (ii): Suppose £ is Boolean; let j be a local operator on E, and Aa 
j-sheaf. We have to show that the lattice of j-closed subobjects of A is Boolean. 
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But if A’ > A is such a subobject, let A” — A be its complement in Subg (A); 
then A’ c;(A”) = c;(A’) Nc;(A”) = cA N A”) = c;(0) is the least element of 
Subsp,(e)(A), and A’ Uc;(A") > AUA” = A. 

(ii) > (iii) follows immediately from 4.5.21. 

(iii) = (iv): If (ii) holds, then every closed subtopos is quasi-closed; but this 
implies that c(U) = q(U) for every subterminal object U. So every subtopos is 
closed. 

(iv) = (v): Similarly, if (iv) holds, then every open subtopos is closed; but 
if o(U) = c(V) then U and V must be complementary elements of Sub(1), and 
so c(U) = o(V). Hence every closed subtopos is open, so (iv) implies that every 
subtopos is open. The converse is similar. 

(iv) > (i): If (iv) holds, then the double negation operator ~~ is closed; but 
its exterior is 0, so it must be c(0) = lg. So E is Boolean by 1.4.14. a 


We note that, in a Boolean topos €, a subobject of Q is classified by a local 
operator iff it contains T; this may be used to provide an alternative proof of 
the isomorphism Lop(€) = Sube (1). (It also shows that the conclusion of 4.5.18 
holds whenever the subtopos shm (E) is Boolean; but, in view of 4.5.21 and 4.5.9, 
this fact may also be deduced from parts (ii) and (iii) of 4.5.18.) 

As an application of quasi-closed local operators, we prove an important 
result, due (in this form) to P. Freyd [371]: 


Proposition 4.5.23 For any topos E, there erists a surjection f: F — E where 
F is Boolean. 


Proof We consider the generic subobject T: 1 > Q as a subterminal object 
of €/Q, and form the topos F = shg+)(€/). By 4.5.21, F is Boolean, and it 
clearly admits a composite geometric morphism to £E: 


where the second factor is induced by the unique morphism 2 — 1 in E. We 
shall show that this morphism is a surjection. 
Suppose given a proper monomorphism m: A’ > A in E. We may similarly 
form the composite geometric morphism 
fm: Shg¢m)(E/A) ——> EJA ———> E. 
Now in €/A we have a pullback square 


m ———> A*A’ 
A*m 


1, ———> A*A 
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where the bottom edge is the diagonal map A — Ax A. Since both m and 14 are 
q(m)-sheaves, the left-hand edge of this square remains a proper monomorphism 
when we apply the associated q(m)-sheaf functor to it; and since this functor 
preserves pullbacks, the same applies to the right-hand edge, i.e. f} (m) is not 
an isomorphism. 

Now let ¢: €/A — €/Q be the geometric morphism induced as in 4.1.2 by 
the classifying map of m. We claim that there is a commutative diagram 


Shg(m)(E/A) -_> shyt) (E/Q) 


EJA —— En 


To prove this, it suffices by 4.3.12(iii) to show that ġą = Ilẹ sends m to a 
q(T )-sheaf, since g(m) is the largest local operator for which m is a sheaf. But 
IIg(m) S Wg¢*(T) = T? in E/Q, so this follows from the fact that sheaves form 
an exponential ideal (4.4.3). Hence fm factors through fr = f; so the fact that 
f%,(m) is not an isomorphism implies that f*(m) is not an isomorphism. But 
this is true for any m; so by 1.2.4 f* is conservative, i.e. f is a surjection. O 


We remark that if € itself is Boolean, then the geometric morphism of 4.5.23 
is an equivalence, since in this case we have Q = 11f1 and q(T) =c(T) = o( 1) 
(cf. 4.5.22). 

Having considered geometric morphisms from Boolean toposes to a given 
topos, it is natural to ask what can be said ‘in the opposite direction’: given 
a topos €, does there exist a ‘nice’ geometric morphism € — F where F is 
Boolean? Clearly, there can be no such inclusion unless € itself is Boolean, by 
4.5.22(ii); but in general, there may be no such morphism at all. The following 
argument is due to R. Paré [929]: 


Example 4.5.24 Let € be the topos obtained by glueing along the inclu- 
sion functor Set; — Set (cf. 2.1.12). Suppose we have a geometric morphism 
f:E— F with F Boolean. By 4.2.10 and 4.5.22(ii), we may assume without 
loss of generality that f is surjective. Since the objects of F are all decidable 
(cf. 1.4.15), every object of the form f*B is decidable in E£; hence, as we observed 
in 4.2.4(d), it must be an injection between finite sets. It follows that, for any 
two objects B, B’ of F, the set E(f*B, f*B’) must be finite, and since f* is 
faithful it follows that F (B, B’) is also finite. But, for any set A, let A denote 
the object (1, A, A — 1) of £; then we have 


F (1, fA) S E(f*1, A) S A, 


so we obtain a contradiction by taking A to be infinite. 
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A very similar argument (but with the word ‘finite’, as applied to the size 
of hom-sets, replaced by ‘of cardinality at most that of the continuum’) may be 
used to show that the effective topos Eff studied in Chapter F2 also does not 
admit any geometric morphism to a Boolean topos. 


Suggestions for further reading: Barr [71], Borceux & Kelly [151], 
Freyd [371], Tierney [1169], Wraith [1236]. 


A4.6 The hyperconnected-—localic factorization 


In Theorem 4.2.10 we saw that every geometric morphism factors (essentially 
uniquely) as a surjection followed by an inclusion. There are several other fac- 
torization theorems for geometric morphisms, some of which will be discussed in 
Chapter C3, but one in particular deserves to be mentioned now. 

If A and B are objects of a topos, we say A is a subquotient of B if there is 
a diagram of the form 


C A ; 


B 


i.e. Ais a quotient of a subobject of B. Since monomorphisms and epimorphisms 
are stable under pullback and pushout in a topos, this is equivalent to saying 
that A is a subobject of a quotient of B, i.e. that there is a diagram of the form 


A 


B D 


It also follows that the relation of being a subquotient is transitive. Yet another 
way of expressing the condition is to say that there exists a relation ¢: B% A 
(namely, that tabulated by the first displayed diagram above) such that ¢°¢ 
is the identity relation ¿4, in the notation of Section A3.1. (Such a relation is 
commonly called a partial surjection.) 


Definition 4.6.1 We say a geometric morphism f: F — E is localic if every 
object of F is a subquotient of one of the form f*A, where A is an object of E. 


The definition is a special case of the notion of a bounded geometric mor- 
phism, which we shall meet in B3.1.7; it is equivalent to saying that 1 is a bound 
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(in the sense defined there) for F over €. The reason for the name ‘localic’ will 
become apparent in C1.4.7. 


Examples 4.6.2 (a) Every inclusion is localic, for if f is an inclusion then 
every object of its domain is isomorphic to one of the form f* A. More generally, 
if f, is merely faithful, then the counit f* f,B — B is epic for all B, and so f is 
localic. 

(b) Let f: A — B be a morphism in a topos E. The direct image Il; of the 
induced geometric morphism €/A — €/B is not in general faithful; but the left 
adjoint LU, of f* is, and so the unit g — f*Xyg is monic for all objects g of E/A. 
Thus all such morphisms are localic. 

(c) Let f:C + D be a functor between small categories. If f is faithful, 
then the induced geometric morphism {C,Set] — [D, Set] of 4.1.4 is localic. 
For every functor C — Set is a quotient of a coproduct of representable func- 
tors; if f is faithful then the representable functor C(A,—) is a subfunctor of 
f*(D(f(A), —)); and f* preserves coproducts. The converse is also true: if 
C(A,—) appears as a subquotient of some f*(F), then (being projective) it 
actually occurs as a subobject of f*(F), and this can only happen if there exists 
x E€ F(f(A)) such that F(fa)(x) #4 F(f8)(x) whenever a, 8: A = B are distinct 
morphisms of C — which in particular forces fa # ff. 

(d) In particular, if C is a preorder (so that the unique functor from C to the 
terminal category 1 is faithful), then the unique geometric morphism |C, Set] — 
Set of 4.1.9 is localic. 

(e) It is easy to verify that a composite of localic morphisms is localic, 
since the subquotient relation is transitive and inverse image functors preserve 
monomorphisms and epimorphisms. So, combining (a) and (d), we see that if 
(C,T) is a small site whose underlying category is a preorder, then the unique 
geometric morphism Sh(C, T) — Set is localic. (We shall prove a converse to 
this result in B3.3.5.) In particular, for any topological space X, Sh(X) — Set 
is localic. Similarly, combining (a) and (b), we note that the surjection with 
Boolean domain constructed in the proof of 4.5.23 is localic. 

(f) It is even easier to verify that, if 


9 f 


G ————> F —— oF 


is a composable pair of geometric morphisms and the composite fg is localic, 
then g is localic. Hence if F and G both admit localic morphisms to Set, then 
any geometric morphism between them is localic. For example, the geometric 
morphism Sh(X) — Sh(Y) induced by a continuous map of spaces X — Y, as 
in 4.1.11, is always localic. 


Now let f: F — E be an arbitrary geometric morphism, and let G be the full 
subcategory of F whose objects are all subquotients of objects of the form f*A. 


Lemma 4.6.3 (i) G is coreflective in F. 
(ii) G ts a topos. 
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Proof (i) If an object B of F belongs to G, there is a canonical way of choosing 
an object A such that B is a subquotient of f*A, namely A = f,B, where B is 
the partial-map representer for B (cf. 2.4.7). For if we are given a diagram 


€e 


C B 


fA 


there is a unique morphism a: f*A — B making the appropriate square a pull- 
back; and then a factors through the counit map f* f,B — B, which induces a 
factorization of e through the pullback 


P——>B 


f* f.B ——> B 
and hence forces the top edge of the above square to be epic. 

Now, given an arbitrary object B of F, let us form the pullback P as above, 
and consider the image factorization P +» GB — B. G is clearly a functor 
F — F, since all the constructions involved in its definition are functorial, 
and it takes values in G since GB is a subquotient of f*f,B. Moreover, the 
monomorphism GB — B defines a natural transformation from G to the identity, 
which is an isomorphism precisely when B is in G. It follows immediately from 
this that GB is the coreflection of B in G. 

(ii) It suffices to show that G is closed under finite limits in F (equiva- 
lently, that the inclusion G — F preserves finite limits), since this implies that 
the coreflector on F corresponding to G is cartesian, and allows us to apply 
Theorem 4.2.1. But G is clearly closed under equalizers, since it is closed under 
arbitrary subobjects, and it contains 1 since f* preserves 1. Finally, if By and By 
are subquotients of f*A; and f*A2 respectively, then Bı x Bz is a subquotient 
of f*(A, x Ag) S f* Ai x f* Ae. 0 


We define a geometric morphism f: F — E to be hyperconnected if it restricts 
to an equivalence between E and the topos G constructed above; equivalently, if 
f* is full and faithful, and its image in F is closed under arbitrary subobjects 
and quotients. (The reason for the name ‘hyperconnected’ will become apparent 
in C1.5.7, when we introduce the class of connected morphisms.) For example, 
if G is a topological group, then the morphism [G, Set] — Cont(G) of 4.1.6 is 
hyperconnected, since every subobject and every quotient of a continuous G-set 
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is continuous. We shall give further examples of hyperconnected morphisms in 
4.6.9 below. 


Lemma 4.6.4 Suppose given a commutative square (up to isomorphism) 


es aay 
h 9 
f 


F ——> E 


in Top, where g is localic and h is hyperconnected. Then there exists (uniquely 
up to unique isomorphism) a morphism l: F —G making the two triangles 
commute. 


Proof Let B be an object of G. Then B is a subquotient of g*A for some 
A, whence k*B is a subquotient of k*g*A = h*f*A. Since the image of h* is 
closed under subquotients, it follows that k*B lies in it, and so (since h* is 
full and faithful) we get a unique factorization k* = h*l*, where [* is a functor 
G — F. As in the proof of 4.3.11, we now see that l, = k,Ah* is a right adjoint 
for [*; moreover, [* preserves finite limits (because h* creates them), and the 
isomorphism h*l*g* = k*g* = h* f* yields an isomorphism [*g* = f* because 
h* is full and faithful. This establishes the existence of l; the uniqueness again 
follows from the fullness and faithfulness of h*. g 


Theorem 4.6.5 Any geometric morphism can be factored, uniquely up to 
equivalence, as a hyperconnected morphism followed by a localic one. 


Proof The existence of the factorization was mostly established in 4.6.3: given 
a morphism f: F — E, we let G be the topos defined there, h: F — G the geo- 
metric morphism whose inverse image is the inclusion G — F, and g: G — E the 
morphism whose inverse image is the factorization of f* through this inclusion 
(its direct image, ‘once again, being the composite f.h*). Then it is easily seen 
that h is hyperconnected, that g is localic and that f = gh. The uniqueness 
follows straightforwardly from 4.6.4. O 


There are a number of alternative characterizations of hyperconnected 
morphisms which are of interest; we now list them. 


Proposition 4.6.6 Let f: F — E be a geometric morphism. The following are 
equivalent: 
(i) f is hyperconnected. 
(ii) f* is full and faithful, and its image is closed under subobjects in F. 
(iii) f* is full and faithful, and its image is closed under quotients in F. 
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(iv) The unit and counit of (f* 4 f.) are both monic. 


(v) fa preserves 2, i.e. the comparison map T: f.(QF) > Ne (the classifying 
map of fa(TF)) is an isomorphism. 
(vi) For each object A of E, f* induces an equivalence Subg(A) ~ Sub¢(f* A). 


Proof (i) = (ii) and (i) = (iii) are trivial. 

(iii) > (iv): If f* is full and faithful, then the unit of (f* 4 f,) is an isomor- 
phism, and hence monic. Now consider the image factorization f* f,B— I = B 
of the counit cg: by assumption I is in the image of f*, so by the universal 
property of eg the epic part of this factorization must be an isomorphism, i.e. 
€g is monic. 

(iv) > (iii): Let 7 and e be the unit and counit of (f* 4 f,). From one of 
the triangular identities and the fact that € is monic, we deduce that f*7 is 
an isomorphism; but since 7 is monic f* is conservative (cf. 4.2.6), and so 7 
is an isomorphism, i.e. f* is full and faithful. Now we may identify € (up to 
equivalence) with a coreflective subcategory of F; if B is a quotient of an object 
in this subcategory, then we see that the counit €g: f* f,B — B must be epic as 
well as monic, and so B is in the subcategory. 

(iii) => (v): Let us identify € with a coreflective subcategory of F, and write 
G for the coreflector f* f,. Then the square 


Cl eae 


1 
G(T) T 


G 
EQ 
E =50 


commutes by naturality of €, and since €g is monic by (iv) (and G preserves 1) 
it must be a pullback. Now if A’ — A is a subobject in €, the classifying map 
ġ: A-+Q of A’ in F factors uniquely through cg; and in the diagram 


AC) aed 


G(T) T 


p ED 
Ae SCO) so 


the outer and right-hand squares are pullbacks, whence the left-hand square is 
a pullback. But the requirement that this square be a pullback determines eo@, 
and hence ¢, uniquely; thus G(Q) is a subobject classifier for E. 
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(ii) > (v): Again, we identify E with a coreflective subcategory of F, and 
write G for the coreflector. This time we form the pullback 


P 1 
G(Q) ——> 2 

and observe that P, being a subobject of G(Q), belongs to €. As in the previous 
part, we see that every subobject in € is uniquely expressible as a pullback of 
P > G(Q), ie. the latter is a generic subobject in E. So by 1.6.1 P is a terminal 
object of E, and hence of F; thus P > G(Q) is isomorphic to G(T) in Sub(G(Q)) 
(since the latter factors through the former, and both have domain 1). So G(T) 
is a generic subobject in £, i.e. G preserves 2. 

(v) > (vi): It is immediate from (v) (and the adjunction (f* 4 f,)) that 
we have a bijection between (isomorphism classes of) subobjects of A and of 
f*A, for any object A of €. However, as in 2.4.8, it requires proof that this 
bijection is induced by applying f* to monomorphisms in €. (Unfortunately, we 
cannot employ the argument in the proof of 2.4.8, since we do not know that 
f. preserves partial-map representers.) Let 7’: f*Q¢ — Nr be the comparison 
map for f* (i.e. the classifying map of f*(T¢)); then the bijection induced by 
the fact that f, preserves Q sends the subobject classified by ¢: A — Qg to the 
subobject classified by the composite 


——_> 


fo pe 
FAS Spa Se SG 


whereas its image under f* is the subobject classified by 


Fay ee RN eer 


So we need to show that the diagram 


FFF Dp *Qe 
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commutes. Since the square 
€l 
a 
FFT qT 


* EQ 
F Rar — AF 


commutes (though we don’t yet know it is a pullback), the subobject of f* f,Q¢ 
classified by eg contains that classified by 7’(f*7), and hence the subobject 
of f*A corresponding under the bijection to A’ — A contains f*A’. Thus 
f* preserves properness of subobjects; since it also preserves equalizers, it is 
conservative by 1.2.4, and so the unit 7 of (f* 4 f) is monic. Now, in the 
diagram 


m 
ipa GE lp le eee 


Te ff” Ve fT F Te 


Na fer T: 
Qe >——> ff“ Ne — fF ——> Re 
all three squares are pullbacks, so the bottom composite is the identity, and 
since 7 is an isomorphism the composite (f.7’)no7T is the identity on faQ r. 
Transposing this composite across the adjunction (f* 4 f,), we obtain 7/(f*r), 
so the latter must equal eg, as required. 

(vi) > (i): First we note that (vi) implies that f* preserves proper- 
ness of subobjects, so it is conservative by 1.2.4. Next, let a: f*A— f*B 
be a morphism in F between objects in the image of f*. Now the graph 
(1,a): f*A>> f*A x f*B = f*(A x B) is isomorphic to the image under f* of 
some subobject (p,q): A’ — A x B; but then f*p is an isomorphism and so p 
is an isomorphism, i.e. (p,q) is the graph of a morphism gp~': A — B, whose 
image under f* is clearly a. So f* is full as well as faithful. It is immediate 
from (vi) that the image of f* is closed under subobjects in F, so it remains 
to verify that it is closed under quotients. Given an epimorphism f*A —> B, let 
S >> f*Ax f*A S f*(Ax A) be its kernel-pair; then S must be of the form f*R 
for some R — A x A. Let A > A’ be the coequalizer of R 3 A; then since f* 
preserves coequalizers we have f* A’ = B. | 


Remark 4.6.7 Combining 4.6.6(v) with 4.2.9, we obtain another proof of the 
result (already established in 2.3.9) that a geometric morphism whose direct 
image is logical must be an equivalence. 


We digress briefly to note a connection between hyperconnected morphisms 
and the weak Beck—Chevalley condition introduced in 4.1.16. 
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Lemma 4.6.8 Suppose given a commutative square 


J 


E 


eee 


in Top, such that either f and k are both hyperconnected, or g and h are both 
hyperconnected. Then the square satisfies the weak Beck—Chevalley condition, i.e. 
the canonical natural transformation 0: g* fs — k,h* is pointwise monic. 


Proof First suppose f and k are hyperconnected. As we saw in the proof of 
4.6.6, this implies that f* and f, restrict to an equivalence between the categories 
of subterminal objects in € and in F; and similarly for k* and k,. Hence, for any 
subterminal object B of F, wehaveg*f, B S k,k* g* fa B S k,h* f* fB = k,h* B 
so 9g must be an isomorphism. Since the property of hyperconnectedness is 
‘stable under slicing’ in the obvious sense, it follows that the square satisfies 
condition (iii) of 4.1.17. The case when g and h are hyperconnected is proved 
similarly, using the natural isomorphism fhs = g.k.. g 


Next, we re-examine Example 4.6.2(c) in the light of the results established 
in this section. 


Example 4.6.9 Let f:C — D be a functor between small categories. Suppose 
f is full and bijective on objects; then D may be identified with the quotient 
category C/R, where R is the congruence induced by f (i.e. the equivalence 
relation on morphisms such that (a, 8) € R iff fa = f8), and [D, Set] is iden- 
tified via f* with the full subcategory of [C, Set] consisting of functors that 
respect this congruence (equivalently, factor through f). Since this subcategory 
is clearly closed under subobjects and quotients, it follows that the geometric 
morphism [C, Set] — [D, Set] induced by f is hyperconnected. Combining this 
with 4.6.2(c), we see that for an arbitrary f the factorization of 4.6.5, applied to 
the geometric morphism induced by f, corresponds to the unique factorization 
C — C/R — D of f as a full functor bijective on objects followed by a faithful 
functor. (This result should be compared with 4.2.12(b).) 


Restricting this example to monoids (small categories with one object), we 
see that in this context the factorization of 4.6.5 corresponds to the usual 
cover-monic factorization in the (regular) category of monoids. In contrast, the 
factorization of 4.2.10 yields nothing at all here, since the geometric morphism 
induced by a monoid homomorphism is always surjective. (On the other hand, 
for the geometric morphisms between sheaf toposes induced by continuous maps 
of spaces, it is the factorization of 4.6.5 that yields nothing, since all such mor- 
phisms are localic.) Note also that it follows from 4.6.9 that [C,Set] — Set is 
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hyperconnected iff C is strongly connected, i.e., for any two objects A and B of 
C, there exist morphisms A — B and B —> A. 

We have observed that every inclusion is localic, and similarly every hyper- 
connected morphism is surjective. It follows easily that the two factorizations 
of 4.2.10 and 4.6.5 coincide (up to equivalence), for a given f, precisely when f 
can be factored as a hyperconnected morphism followed by an inclusion. In this 
connection, we have 


Proposition 4.6.10 For a geometric morphism f: F — E, the following are 
equivalent: 


(i) The surjection-inclusion and hyperconnected-localic factorizations of f 
coincide. 


(ii) f can be factored as a composite of morphisms which are either hypercon- 
nected or inclusions. 


(iii) The counit of the adjunction (f* 4 fa) is monic. 


Proof (i) > (ii) is trivial. 

(ii) = (iii) because both hyperconnected morphisms and inclusions satisfy 
(iii), and it is easily verified that this property is stable under composition. 

(iii) > (i): Let f S gh be the surjection—inclusion factorization of f. By the 
construction of 4.2.8, the counit of (h* 4 h,) coincides with that of (f* 4 fa) 
and is therefore monic; but the unit of (h* 4 h.) is also monic by 4.2.6(iv), and 
so h is hyperconnected by 4.6.6(iv). 


For a functor f: C — D between small categories, it is easy to see that the 
induced geometric morphism |C, Set] — [D, Set] satisfies the conditions of 4.6.10 
iff f is full. 

A further result relating hyperconnected morphisms and inclusions, which 
will be of importance later on, is the following. 


Proposition 4.6.11 Let f: F — E be a hyperconnected morphism, let j be a 
local operator on E, and let k be the smallest local operator on F for which the 
composite sh,(F) — F — E factors through sh,(€) — E (ef. 4.5.14(e)). Then 
the resulting factorization f’: shy(F) —> sh;(€) is hyperconnected. 


Proof First we observe that since f* induces an isomorphism Subg(A) — 
Sub ;(f*A) for any A, it preserves the Heyting implication; hence if A’ — A is 
a j-dense subobject and A” — A is j-closed, then the pair (f*A’, f* A”) must 
satisfy the relation 6 defined before 4.5.12. In particular, the composite 


TXT 
fJ x Fa >> f* Qe x f Qe > Qr x F 


factors through OF ~ Nr x Nr (note, incidentally, that the comparison map 
T’ is monic, by the proof of (v) > (vi) in 4.6.6). Since K — Qr is defined to be 
(f*J = f*Qe — Xz)", it follows that we have f*Qj < (f*J)" = K" = Qy in 


’ 
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Sub(Qz), and hence that f* maps j-closed monomorphisms to k-closed ones. In 
particular, if A is a j-sheaf in €, then f*A is k-separated by 4.3.6(a)(iv); hence 
the unit map A — f| f’*A is monic, since it is obtained by applying f, to the 
unit map from f*A to its associated k-sheaf. 

Thus we have shown that f’ is a surjection; hence sh;(€) is the image (in the 
surjection-inclusion sense) of the composite sh, (F) — F — E. The result is now 
immediate from 4.6.10, since the composite just mentioned satisies condition (ii) 
of 4.6.10. g 


The ‘orthogonality’ result 4.6.4 tells us that, for a fixed topos E, the 
hyperconnected-localic factorization applied to geometric morphisms with 
codomain E yields a left adjoint to the inclusion CLop/E — Top/E, where 
Lop is the 2-category of toposes, localic geometric morphisms and geometric 
transformations between them. (Note that CTop/E is a full sub-2-category of 
Zop/E, by 4.6.2(f).) We refer to this left adjoint as the localic reflection for 
toposes over £E. 


Proposition 4.6.12 The localic reflection functor Top/E —> LVop/E pre- 
serves surjections and inclusions. 


Proof Let fi: Fj — E (i = 1,2) be two objects of Top/E, and let g: Fi > Fe 
be a geometric morphism over €. Form the diagram 


a 


hy 


where (h;,1;) is the hyperconnected-localic factorization of f; and k is the mor- 
phism obtained by applying the orthogonality result 4.6.4 to hy and lz. It is clear 
that if g is surjective, then so is the composite hag = kh, and hence k is surjec- 
tive. Suppose g is an inclusion; by 4.6.2(f), we know that k is localic, and hence 
the left and bottom edges of the square form the hyperconnected—localic factor- 
ization of its diagonal. But since g is an inclusion and ho is hyperconnected, it 
follows from 4.6.10 that this must also be the surjection—inclusion factorization 
of the diagonal; so k is an inclusion. g 


Further preservation properties of the localic reflection will be encountered 
in C2.4.13 and C2.5.12. 


Suggestions for further reading: Johnstone [518], Joyal & Tierney [560]. 


PART B 


2-CATEGORICAL ASPECTS OF 
TOPOS THEORY 


Bl 
INDEXED CATEGORIES AND FIBRATIONS 


B1.1 Review of 2-categories 


For many of the topics we wish to study in topos theory, it is impossible to 
ignore the fact that toposes and geometric morphisms form not just a cate- 
gory but a 2-category; that is, we have to take account also of the geometric 
transformations between geometric morphisms, as introduced in A4.1.1(b). Even 
amongst experienced category-theorists, there is often a good deal of reluctance 
to take 2-categorical (or, still worse, bicategorical) notions seriously; so the pur- 
pose of this section is to review as much of the basic theory as we shall need for 
subsequent developments. 

The neatest definition of a 2-category is as a Cat-enriched category, where 
Cat denotes the category of small categories. In other words, a 2-category R 
consists of a class of objects A, B,... together with, for each pair of objects 
(A, B), a category & (A, B), and for each triple (A, B, C) a composition operation 


R(B,C) x (A,B) —> R(A,C) 


which is bifunctorial and associative, and equipped with identities. However, 
since this definition begs the question of local smallness (and because we have 
not yet reviewed the theory of enriched categories — see Section B2.1 below), we 
shall spell out in full what this means in elementary terms. 


Definition 1.1.1 A 2-category R consists of three types of entities: objects or 
0-cells (usually denoted A, B,C,...), morphisms or 1-cells (denoted f,g,h,...), 
and transformations or 2-cells (denoted a, 3,7,...), equipped with the following 
structure: 


(a) There are two operations dom, cod from morphisms to objects. 


(b) There are two operations sce, tgt from transformations to morphisms, sat- 
isfying dom(sce a) = dom(tgt a) and cod(sce a) = cod(tgt a) for all a. 
(We shall abbreviate dom(sce a) to dom a and cod(sce a) to cod œ when 
this is unlikely to cause confusion.) 


(c) There is an operation A ++ 1,4 from objects to morphisms, which is a 
simultaneous splitting for dom and cod. 
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(d) There is an operation f +> iş from morphisms to transformations, which is 
a simultaneous splitting for sce and tgt. 


(e) There is a partial binary operation (f,g) +> gf on morphisms, such that 
gf is defined iff cod f = dom g, and the equations dom gf = dom f, 
cod gf = cod g, h(gf) = (hg)f, fla = f and lpf = f hold whenever they 
make sense. 


(f) There are two partial binary operations on transformations: vertical com- 
position (denoted (a, 3) ++ Ba) and horizontal composition (denoted 
(a, 8) > Boa), such that 


(i) Ba is defined iff tgt œ = sce 8 and G0 a is defined iff cod a = dom ĝ; 
(ii) sce Ba = sce a and tet Ba = tgt 8; 
(iii) sce 8 o a = (sce )(sce a) and tgt Boa = (tgt 8)(tgt a); 
(iv) the associative laws y(8a) = (y@)a and yo (Goa) = (yo 8) oa hold; 
(v) transformations of the form ef act as units for vertical composition, 
and those of the form 4, act as units for horizontal composition, and 


(vi) the interchange law (6y)0(@a) = (60)(you) and the identity tgous = 
tf hold, 


it being understood that all the above identities are to be read in the sense 
‘if both sides are defined, then they are equal’. 


The names ‘horizontal composition’ and ‘vertical composition’ are suggested 
by the pictures 
f h 
—> > 
A la B 43 C 
ee 
g k 
and 


f 


|B 


— 

h 
which we frequently draw to provide a visual description of what is going on inside 
a 2-category. We shall not go into the detailed theory of such ‘pasting diagrams’ 
here, but we shall feel free to use it at least in an informal way. In passing, we 
remark that if one of the 2-cells in a horizontal composite is a (vertical) identity 
Lf, then we normally write a o f instead of a o ts. 

Of course, if A and B are two objects of a 2-category £, then the morphisms 
and transformations of R with domain A and codomain B form a category, 
which we denote by & (A,B). Composition with a fixed morphism of £ induces 
functors between these categories, and horizontal composition with a fixed trans- 
formation induces natural transformations between such functors. It should be 
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noted that we have not imposed any smallness condition (not even local small- 
ness) on the hom-categories of R, and in many of the examples we shall have 
to consider this lack of restriction will be essential. However, in introducing 
2-categorical concepts we shall often ‘cheat’ by pretending that our 2-categories 
do have ‘category-valued hom-functors’, and the like, simply in order to avoid 
the tedium of writing out in full the elementary version of what the definitions 
mean. (But the reader should be aware that it is always possible to do so, if one 
is sufficiently determined!) In this connection, we shall use the notation CAT 
for the ‘meta-2-category’ of categories (without size restriction), functors and 
natural transformations, as opposed to the (legitimate) 2-category Cat of small 
categories. 

There are two different duality principles that we can apply to an arbitrary 
2-category A, one involving the reversal of 1-cells and the other the reversal of 
2-cells. It is convenient to distinguish notationally between these; accordingly, if 
K is a 2-category, we shall write R°? for the 2-category obtained by reversing the 
1-cells but not the 2-cells of R (so that R°P (A, B) = R(B, A)), and R° for that 
obtained by reversing 2-cells but not 1-cells (so that °° (A, B) = (R (A, B))°?). 
If we need to reverse both the I-cells and the 2-cells of R, we denote the result 
by KOOP | 

If P is a property of categories, we shall say that a 2-category is locally P if its 
hom-categories all have property P; thus a locally small 2-category is precisely 
a Cat-enriched category. This usage conflicts with our normal use of the word 
‘locally’ for ordinary categories, as introduced in Section Al.1 (though not with 
our use of ‘locally smal!’ for ordinary categories!), but it seems unlikely to lead 
to confusion. In particular, if C is an ordinary category, we shall identify it 
when necessary with the locally discrete 2-category having the same objects and 
morphisms as C, but only (vertical) identity transformations. 

A guiding principle of our approach to 2-category theory (which, inciden- 
tally, is part of the reason for not regarding 2-categories simply as Cat-enriched 
categories) is that it is in most cases unreasonable to expect diagrams in a given 
2-category to commute ‘on the nose’; usually we are much more interested in 
diagrams which commute up to 2-isomorphism (that is, up to invertible 2-cells 
placed in every cell of the diagram). These 2-isomorphisms must of course satisfy 
‘coherence conditions’ which ensure that any two composite 2-cells constructed 
by pasting them together, and having the same source and target, are equal. In 
the interests of maintaining clarity, we shall normally not spell out such con- 
ditions in their fullest detail; indeed, we shall often suppress the names of the 
‘canonical’ 2-isomorphisms up to which our diagrams commute, and simply write 
f = (where f and g are the names of composite 1-cells) where in an ordinary 
category we would write f = g, to indicate that we are considering the canon- 
ical 2-isomorphism between them. We shall also occasionally use the notation 
a = 3, where @ and are 2-cells whose sources and targets are not equal but 
only canonically isomorphic, to indicate that they become equal on composi- 
tion with the canonical isomorphisms. But it is important to remember that the 
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canonical 2-isomorphisms really are there, and must be carried along as part of 
the structure we are talking about. 

In particular, when we speak of a functor between 2-categories, we generally 
mean what is officially called a pseudofunctor. (We shall also sometimes call it 
a 2-functor, to emphasize the fact that it acts on 2-cells as well as objects and 
1-cells of the domain category.) 


Definition 1.1.2 Let R and £ be 2-categories. A (pseudo) functor F: R > £ 
consists of the following data: 
(a) a mapping At FA from objects of & to objects of £; 
(b) a mapping f + Ff from morphisms of R to morphisms of £; 
(c) a mapping a+ Fa from transformations of R to transformations of £; 
(d) for each object A of R, a 2-isomorphism $4: lea —> F (1,4), and 
(e) for each composable pair (f,g) of morphisms of &, a 2-isomorphism 


$39: (F9)(F Ff) > F(gf), 


such that dom Ff = F(dom f), cod Ff = F(cod f), sce Fa = F(sce a), 
tgt Fa = F(tgt a), F(t) = ters, F(Ba) = (FG)(Fa), and the @’s satisfy the 
coherence conditions that 


lra 
———— > Ff 
FA l|l¢a_ FA and FA ————> FB 
F(1a) 
Spa 
1 
bong T Ff Me F(1p) Pal Lee 
FB FB 
Ff 
are both equal to urs, that the two composites 
Fg Fg 
FB ——\—> FC and FB-——— >FC 
Fh l Ph,g 
F(gh) 
l Pgh, f 
FA —— FD 
F(fgh) 


are equal, and that 
bg k(FB o Fa) = F(Boa)dbsn 
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for any horizontally composable pair of 2-cells 


A la B l8 C. 
—— — 


g k 


In practice, the pseudofunctors we meet are generally normalized (or strictly 
unital); that is, they satisfy F(14) = lr, and the ġa are identity morphisms. 
We shall often assume this condition without commenting on it — after all, if we 
are given a pseudofunctor which happens not to be strictly unital, we can easily 
‘normalize’ it by redefining F'(1,4) to be 1, for all A (and suitably modifying the 
definition of $+, whenever either f or g is an identity morphism). However, there 
is no similar way of modifying a given F so that the ¢7,, all become identities, 
and so we shall not assume the latter condition without saying so explicitly. (We 
say so by calling F a strict functor.) 

Occasionally we shall wish to consider laz functors, which differ from 
pseudofunctors in that the pa and $+, are not even required to be isomor- 
phisms; and oplaz functors, which are similar except that the ¢4 and df, all 
point in the opposite direction. (In other words, an oplax functor R — £ is the 
same thing as a lax functor R — L°°.) 

There is, perhaps, a certain lack of consistency in the fact that we have 
standardized on pseudofunctors rather than strict 2-functors as the basic notion, 
but also on ‘strict’ 2-categories rather than bicategories, in which the associative 
and unit laws for composition of morphisms hold only up to (specified, coherent) 
2-isomorphisms. Our justification for this is basically pragmatic: virtually all 
the 2-dimensional categories we shall meet (with one important exception, the 
bicategory of profunctors which we shall encounter in Section B2.7) are in fact 
strict 2-categories (usually this is because their morphisms are in fact functions 
of some kind, and their composition is the usual composition of functions), and it 
saves notational complications if we treat them as such. Nevertheless, nearly all 
the results about 2-categories which we review here are also true for bicategories; 
we refer the reader to [99] for further details. 

Between (pseudo)functors, we have (pseudo)natural transformations, such a 
transformation between functors (F, ¢) and (G, y): R — £ assigns to each object 
A of R a morphism e4: FA — GA of £, and to each morphism f: A— B of R 
a 2-isomorphism npr fitting into the diagram 


Ff 
FA ——> FB 
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subject to the evident compatibility conditions with the ¢’s and w’s. As with 
functors, we shall occasionally want to consider laz natural transformations, 
which are defined in exactly the same way but without the requirement that the 
ny should be isomorphisms. 

There is also a fourth level of structure, that of modifications between nat- 
ural transformations: a modification between natural transformations (e,7) and 
(e’,n’) assigns to each object A of £ a transformation e4 — e’, in £, subject to 
compatibility conditions with the ns and n}. 

In any 2-category, we say a morphism f: A — B is left adjoint tog: BA 
(and write (f 4 g)) if there are transformations 7:14 — gf and €: fg —> ig 
(the unit and counit of the adjunction) satisfying the usual triangular identi- 
ties. The right adjoint of a given morphism, if it exists, is unique up to unique 
2-isomorphism. An equivalence in a 2-category may conveniently be defined as an 
adjunction whose unit and counit are 2-isomorphisms; we shall also use the terms 
reflection and coreflection for adjunctions where just one of these transformations 
is invertible. 

Similarly, given functors F and G:R— £, a natural adjunction between 
F and G consists of a pair of natural transformations d: F — G, e: G —> F, 
equipped with modifications 7: 1 — ed and €: de — 1 satisfying the triangular 
identities; what this means is that we have, for each object A of R, an adjunction 
(da 4 e,) in £, and that these adjunctions ‘fit together’ — that is, they satisfy 
appropriate compatibility conditions with the coherence 2-isomorphisms (whose 
names we have already suppressed) forming part of the specification of d and e. 
We shall be mainly concerned with natural equivalences, i.e. natural adjunctions 
which are ‘pointwise’ equivalences. 

We shall also speak of equivalences between 2-categories. By an equiva- 
lence between 2-categories R and £, we mean a pair of functors F: R — £ and 
G: £ — R, together with (specified) natural equivalences between the compos- 
ites FG and GF and the respective identity functors. Occasionally it happens 
that we have not just equivalences but isomorphisms between these composites 
and the identities; if we wish to draw attention to the fact that this happens, we 
shall call (F, G) a strong equivalence. 

When we speak of representability of a €at-valued functor on a 2-category, we 
shall (in keeping with the general principle that diagrams in a 2-category cannot 
be expected to commute ‘on the nose’) interpret this notion in the ‘up to natural 
equivalence’ rather than the ‘up to natural isomorphism’ sense. Thus a repre- 
sentation of F: R — Cat consists of an object A of A together with a natural 
equivalence between F and the (strict) functor R (A, —) (here we are — temporar- 
ily — assuming that & is locally small). Of course, the representing object A is 
unique up to equivalence in R, the equivalence between two representations A 
and A’ being itself unique up to unique 2-isomorphism. 

The idea of a representation can be formulated in elementary terms, even 
when & is not locally small. For example, a limit for a diagram D: 3 — & 
consists of an object A of & together with a (pseudo)cone over D with vertex A 
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(that is, a natural transformation from the constant diagram with value A to 
D) and an operation assigning to each cone over D with vertex B a morphism 
B — A and an invertible modification from the given cone to the composite of 
the limit cone with this morphism, and to each modification between cones with 
vertex B a transformation between the corresponding morphisms B — A such 
that an appropriate diagram commutes, .... We shall generally not spell out 
the details of such definitions; it is always possible to work them out for oneself, 
by asking what it would mean in elementary terms, in the locally small case, 
to have a natural equivalence between the appropriate Cat-valued functors, and 
then throwing away the requirement that R be locally small. 

It is by now well known that the ordinary ‘conical’ limits described in the last 
paragraph are not sufficient to capture the notion of completeness required for 
2-categories. Instead, we need to consider weighted limits, which are representa- 
tions of Cat-valued functors sending an object to the category of weighted cones 
over a given diagram (with a given weighting). (When they were first introduced 
by R. Street [1130], weighted limits were known as ‘indexed limits’; but that 
term has been abandoned because of the clash with indexed categories, which 
we shall introduce in the next section.) 


Definition 1.1.3 
(a) A weighted diagram of type J in a 2-category & consists of a functor 
D:3— K together with a functor W: J — Cat (called the weighting of 
the diagram). 
(b) A weighted cone over a weighted diagram (D,W) consists of an object A 
of R (the verter of the cone) together with 


(i) for each object J of J, a functor L(J): W(J) > R(A, D(J)), and 


(ii) for each morphism f: J — J’ of J, a natural isomorphism L(f) fitting 
into the diagram 


L(J) 
W (J) ——> RA, D(J)) 


W(P) LEC) D(f) o (-) 


L(J'’) 
W(J') ——> R(A, D(J')) 


such that the L(f) are compatible in a suitable sense with composition 
in J, and such that for each 2-cell of J an appropriate diagram of natural 
transformations commutes. 

If L and L’ are weighted cones over the same weighted diagram (D, W) 
with the same vertex A, then we may define a morphism L — L’ to bea 
family of transformations L(J) — L’(J), J € ob J, which are compatible 
with the L(f) and L’(f). Thus we have a category Cone(A,(D,W)) of 
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cones over (D, W) with vertex A; and it is easy to see that the assignment 
A > Cone(A, (D, W)) is contravariantly functorial in A. 


A weighted limit for the weighted diagram (D, W) is a representation of the 
functor just described; in other words, it consists of an object A equipped 
with a weighted cone Z such that, for each object B of R, composition with 
L is part of an equivalence of categories R(B, A) ~ Cone(B,(D,W)). (We 
call a weighted limit strong if this equivalence is in fact an isomorphism for 
all B.) 


Dually, given a functor W: J°P — Cat, we may define the notions of 
W-weighted cone under a diagram D: J — K, and of a weighted colimit for 
the pair (D,W). (Thus weighted colimits in R are the same thing as weighted 
limits in RP.) 


(e 


— 


Examples 1.1.4 (a) Ordinary diagrams in & can be regarded as weighted dia- 
grams for which W is the constant functor with value 1. Thus weighted limits 
include ordinary ‘conical’ limits as a special case. More generally, if the weight- 
ing functor W : J — Cat takes discrete categories as values, then a W-weighted 
diagram is equivalent to an ordinary diagram over a ‘larger’ 2-category, in which 
the Jth vertex of the original diagram is repeated as many times as there are 
objects of W(J). (This is the reason why the difference between ordinary and 
weighted limits does not become apparent in the theory of 1-categories.) 

(b) Suppose 3 is the terminal category 1; then the weighting W simply picks 
out a particular (small) category C, and a W-weighted limit for (the functor 
picking out) an object A of R is what is commonly called a cotensor of A with C; 
that is, an object C ħ A together with a diagram of type C in R(C h A, A), 
which is universal in the sense that composition with it induces an equivalence 
between (B,C h A) and the functor category [C, 8 (B, A)| for any B. (The 
corresponding weighted colimits are called tensors with C, and denoted C & A.) 
If R itself is Cat, then cotensors are just functor categories (and tensors are 
ordinary products), since we have [C, [B, Al] = [B x C, A] = [B, (C, Al]. 

(c) Let J be the finite category 


e ——— > è €— o 
and let W : J — Cat be the diagram 


ee, E ee 


where, as usual, n denotes an n-element totally ordered set, and 0 and 1 denote 
distinct objects of 2. Then a W-weighted limit for a diagram 


f g 


A ———-> C «e B 


in Ñ is what is commonly called the comma object (f | g) (also sometimes called a 
Jax pullback - though this name properly belongs to a slightly different concept, 
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cf. 1.1.6 below); that is, it is a universal solution to the problem of completing 
the above diagram to one of the form 


o 
B 
with a 2-cell as indicated. If & has pullbacks and cotensors with 2, then it also 


has comma objects: (f |g) may be constructed by forming the (iterated) pullback 
of the diagram 


——>aA 
l Í 
g 
— C 
A 2hc B 


where dp and d; are the domain and codomain of the universal diagram of type 2 
in R (2 h C, C). Conversely, if £ has comma objects then it has cotensors with 2, 
since we may take 2 rh C to be (Ic llo). 

(d) Let 3 be the finite category 


‘Se 


and give it the weighting represented by the diagram 
0 
Poe. 2 
1 
in Cat. Then a weighted limit for a diagram 


A= SB 


g 


in R is what is commonly called an inserter for the pair (f, g); it is a universal 
solution to the problem of finding a morphism h: C — A together with a 2-cell 
fh — gh. Inserters may be constructed from pullbacks, products and cotensors 
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with 2: specifically, if we form the pullback 
I(f,g) ———> 2h B 


h (do, di) 


a (f,9) ee 


then J(f,g) is an inserter for the pair (f, g). 
(e) To construct what are known as inverters as weighted limits, we need to 
take J to be a ‘real’ 2-category (that is, one with non-identity 2-cells), namely 


equipped with the weighting 


where I is the indiscrete category with two objects 0 and 1. An inverter for a 
diagram 
f 


——_-> 
A la B 
——-> 
g 


in & is a universal solution to the problem of finding k: C — A such that a o k 
is invertible. 

If we take the same Jf as above, but replace W by the constant functor with 
value 1, we obtain the notion sometimes called an identifier; for a diagram 


f 
A la B 
> 
g 


in Ñ, it is the universal solution to the problem of finding h: C — A such that 
aoh ‘is’ an identity 2-cell. At least, that is what we would get if we were working 
with strict limits; that is, if we replaced pseudo-cones by strict cones (ones in 
which the 2-cells L(f) of Definition 1.1.3(b)(ii) are all identities). However, in 
the context of pseudo-cones which we have adopted as standard, the notion of 
identifier cannot be distinguished from that of inverter. More generally, if W 
and W”: J — Cat are naturally equivalent functors, then W -weighted limits for 
diagrams D: J — & (whenever they exist) are also W’-weighted limits. 
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Yet again, we can construct inverters from pullbacks and cotensors with 2: 
the inverter of a fits into the pullback square 


e ——— B 


4 


at 
A——> 2hB 


where ‘a! denotes the 1-cell into 2 hh B corresponding to the 2-cell a with 
codomain B. 

(f) An equifier is the ‘2-dimensional analogue’ of an equalizer: specifically, 
an equifier for a diagram 


f 
Aal 16 B 


g 


is a universal solution to the problem of finding h: C — A such that aoh = Boh. 
We can describe equifiers as weighted limits: we take J to be the 2-category 
indicated by the shape of the above diagram, equipped with the weighting 


E LS 
SS 


where the two 2-cells are equal. And we can construct them from products, 
pullbacks and cotensors with 2: the equifier of a and 8 fits into the pullback 
square 

e ————— 2AB 


A 


(Ta), TB.) 
A ———> 2h Bx2nhB 
where A is the diagonal map. 

Note also that inverters may be constructed from inserters and equifiers: to 
form the inverter of a 2-cell a: f — g, we may first ‘insert’ a universal 2-cell 
3: gh — fh, and then successively form two equifiers to force the vertical com- 
posites (ach)@ and (aoh) to equal the appropriate identity 2-cells. We remark in 
passing that the class of weighted limits which can be constructed from products, 
inserters and equifiers has received some attention [987]. 


We saw in Examples (c-f) above that many weighted limits can be con- 
structed from cotensors with 2 and ordinary limits. This is typical: in the same 
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way that all finite ordinary limits can be constructed from finite products and 
equalizers (A1.2.1), we have 


Theorem 1.1.5 If a 2-category R has finite (resp. small) products, equalizers 
and cotensors with 2, then it has all finite (resp. small) weighted limits. (Here 
‘finite weighted limit’ means one such that the index 2-category J is finite, and 
the values of the weighting functor W : 3 — Cat are finite categories.) o 


We omit the proof, which may be found in [1130] or [583]. 

We shall also have occasion to consider lax and oplax limits: these too can 
be considered as a special case of weighted limits. By a laz diagram of shape J 
in R, we of course mean a lax functor J — K, as we defined the concept after 
1.1.2. A laz cone over a lax diagram D is an oplax natural transformation from 
a constant diagram to D; that is, it assigns to each object J of J a morphism 
L(J): A — D(J), and to each morphism f: J > J’ a 2-cell L(f) as in 


L(J) 


A Se SS) 


LLCS) 
LUI’) D(f) 


D(J') 


subject to the usual conditions of coherence and compatibility with the 2-cells 
of J, but without the requirement that the L(f) be invertible. An oplaz cone is 
similar, except that the L(f) point in the opposite direction. A laz (resp. oplar) 
limit for a lax diagram is of course a universal lax (resp. oplax) cone over it. 

The notion of (op)lax limit is subsumed in that of a weighted limit, by the 
following result: 


Lemma 1.1.6 Any small 2-category 3 has a ‘relaxation’ J such that lax func- 
tors D: J > K correspond to ordinary functors D: 3 — R. Moreover, 3 has a 
canonical weighting W such that W-weighted cones over D correspond to lax 
cones over D. O 


We omit the proof of 1.1.6, which is rather messy to write out in full general- 
ity, though the basic idea is straightforward enough. The underlying 1-category 
of J is the free category generated by the underlying directed graph of the under- 
lying i-category of J (that is, it has the same objects as J, and its morphisms 
are finite composable strings of morphisms of J), and its 2-cells are generated by 
the 2-cells of J together with those whose images under a pseudofunctor defined 
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on 3 will form the coherence 2-cells of the corresponding lax functor on 3. The 
construction of the weighting W is similar. 


Example 1.1.7 A particular case of lax diagrams and limits is of special inter- 
est: if J is the terminal category 1, then a lax diagram of shape J is simply a 
monad in R, i.e. an object A equipped with a morphism t: A — A and 2-cells 
n: 14 >t, p: tt — t satisfying the usual monad identities. A lax cone over such 
a diagram consists of a morphism f: B — A together with a 2-cell 6: tf — f 
satisfying identities that look like those satisfied by an algebra for a monad; a 
universal lax cone (if it exists — in Cat it may be taken to be the Eilenberg— 
Moore category of algebras for the monad, equipped with its forgetful functor 
to A) is called an Filenberg-Moore object for the monad. Thus we see that 
Eilenberg-Moore objects are a particular case of weighted limits. (Note also 
that in this case the relaxation 1 may be identified with the strict monoidal 
category (= 2-category with one object) Ord;, whose underlying 1-category is 
the additive monoid of natural numbers, and whose unique ‘hom-category’ is the 
category Ord; of finite ordinals and order-preserving maps between them.) 


In addition to adjunctions in a 2-category, we shall also wish to consider 
weak adjunctions: we say f: A — B is weak left adjoint to g: B — A if there 
exist 2-cells 7: 14 — gf and €: fg — 1g such that 


pastes to the identity 2-cell 1, (though the other triangular identity need not 
hold). Note that the assertions ‘f is weak left adjoint to g’ and ‘g is weak right 
adjoint to f’ are not equivalent: the latter would imply that the other triangular 
identity holds, though the one above need not. 


Lemma 1.1.8 Suppose R is locally Cauchy-complete (i.e. idempotent 2-cells 
split in R). Then a morphism of R has a weak left (right) adjoint iff it has a left 
(right) adjoint. 
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Proof One direction is trivial. Conversely, suppose g: B — A has a weak left 
adjoint f as above. Then by considering the diagram 


A 
1 
| ln 4 
Po SA 


le |f la 


lg 


we see that (€ o f)(f on) is an idempotent 2-cell f > f. Let 


B 
fp SS SF 
be a splitting of this idempotent; then it is easy to verify that the 2-cells 


0 = (g°)n: 14 — gh and ¢ = efa o g): hg — 1g satisfy both triangular identi- 
ties, and so make h left adjoint to g. CJ 


Virtually all the 2-categories that we shall consider have the property that 
their idempotent 2-cells split. (It will be recalled that we verified this condition, 
for the particular 2-category Top, in A4.1.15.) 

Weak adjunctions arise naturally in contexts where retracts are under con- 
sideration. Actually, we have to be slightly careful in defining the notions of 
‘idempotent’ and ‘retract’ in a 2-category: we state the following definitions for 
future reference. 


Definition 1.1.9 (a) By an idempotent 1-cell in a 2-category R, we mean a 
1-cell t: A — A equipped with a 2-isomorphism ju: tt — t such that pot and top 
are equal as 2-cells ttt — tt. (Equivalently, an idempotent may be regarded as 
a normalized pseudofunctor M — R, where M is the two-element monoid {1, e} 
with e? = e.) 

(b) By a splitting of an idempotent (t, u) in a 2-category R, we mean a 
pair of l-cells 1: B — A, r: A — B together with 2-isomorphisms a: ri — ig 
and §:t — ir such that u equals the vertical composite B~} (i o a o r)(8 o 8). 
We note that a splitting of an idempotent (t, p) may be obtained as either a 
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limit or a colimit of the corresponding pseudo-functor M — Ñ; in particular, if 
K has either equalizers or coequalizers (in the 2-categorical sense) then all its 
idempotents split. We say B is a retract of A if it occurs as a splitting of some 
idempotent endomorphism of A. 

(c) By an idempotent monad in Ñ, we of course mean a monad (A,t, 7, p) 
(as defined in 1.1.7) for which yz is an isomorphism. We note that (t, u) is then 
necessarily an idempotent 1-cell as defined in (a) above, since the identity pot = 
toy follows from the associative law plus the invertibility of u. We note also that 
if (i, r,a, 8) is a splitting of such an idempotent (t, j.), as defined in (b), then r is 
left adjoint to i: the unit of the adjunction is the composite 67, and the counit is 
the isomorphism a. (And in this case the splitting may alternatively be obtained 
as an Kilenberg-Moore object for the monad (A,t,7, 4).) We say an object B 
is an adjoint retract of A if it occurs as a splitting of an idempotent monad on 
A; dually, B is a coadjoint retract if it occurs as a splitting of an idempotent 
comonad. 


We note that the equivalences (resp. the cores) in an allegory, as defined in 
A3.3.1, are exactly those symmetric idempotents which are idempotent monads 
(resp. comonads). (Of course, in a locally ordered 2-category, any monad is 
idempotent, since a 2-cell has a 2-sided inverse iff it has a 1-sided inverse.) We 
saw in A3.3.12 that, in order to split all the symmetric idempotents in an allegory, 
it is enough to split those which are either monads or comonads; in a general 
2-category the corresponding result is of course false, but in Chapter C4 we 
shall encounter a number of important 2-categories where it suffices to split the 
idempotent comonads in order to split all idempotents. (Cf. also 1.1.15 below.) 

We now return to the relationship between weak adjoints and retracts. The 
following result is typical: 


Lemma 1.1.10 Suppose given a commutative diagram 
r ance E 


f g f 


d t 


Ae cee SO EE, 


(up to isomorphism) in a 2-category R, such that ri = 14 and ri’ Œ la by 
isomorphisms which are compatible in the obvious sense with the isomorphisms 
fr = r'g and gi S i'f. Suppose further that g has a weak left adjoint. Then f has 
a weak left adjoint. In particular, if g has a left adjowunt | and K is locally Cauchy- 
complete, then f has a left adjoint m. Moreover, if the adjunction (I 4 g) is a 
coreflection (i.e. its unit is an isomorphism), then so is the adjunction (m4 f). 
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Proof Let l denote the weak left adjoint of g, and consider the composite rli’. 


We have 2-cells 
rionoi 
a: ly & ri) ———> r'gli' S fri 


and 
B: rli! f S rlgi ———% ri ~ La, 

where ņ and € are the ‘weak unit’ and ‘weak counit’ of the weak adjunction 
between g and l. A straightforward diagram-chase shows that the composite 
(fo8)(aof) is the identity 2-cell on f; so rli’ is weak left adjoint to f. The second 
assertion follows from the first and 1.1.8. For the third, we note that if the unit 
1 — gl is an isomorphism, then so is the weak unit 1 > frli’; but the left adjoint 
m of f is obtained by splitting an idempotent endomorphism of rli’, and the unit 
1 — fm is obtained by factoring 1 — frlz’ through the split monomorphism 
fm — frli'. It follows easily that both halves of this factorization must be 
isomorphisms. 0 


A particular context where weak adjoints occur is in the study of KZ-monads, 
which are a special class of (2-)monads on 2-categories playing a rôle some- 
what analogous to that of idempotent monads in 1-category theory. By a monad 
D = (D,n, u) on a 2-category R, we of course mean a pseudo-monad; that is, D 
is a pseudofunctor R — R, 7 and p are pseudo-natural transformations, and the 
diagrams 


ND Dn Dp 
D ———> DD <—-——- D and DDD ———> DD 


H HD H 


D DD ahes D 
commute not ‘on the nose’ but up to (specified, but nameless) invertible modifi- 
cations, which are in turn required to satisfy appropriate coherence conditions. 
Similarly, when we speak of a D-algebra, we mean a pseudo-algebra. (Monads on 
2-categories are often called ‘doctrines’, but there seems no compelling reason to 
abandon the name familiar from 1-dimensional category theory.) 


Definition 1.1.11 A monad D = (D,n,p) on a 2-category Ñ is called a KZ- 
monad ìf u is naturally left adjoint to 7p: D —> DD, the counit of the adjunction 
being the invertible modification in the leftmost cell of the diagrams above. 


KZ-monads were introduced independently by A. Kock [605] and 
V. Zoberlein [1262]; the name is apparently due to R. Street [1132]. Although 
the notion of KZ-monad has a ‘handedness’ (that is, it is not self-dual at the 
level of 2-cells), the apparent asymmetry in the definition, in that it refers to np 
but not to Dn, is only apparent: 
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Lemma 1.1.12 (D,n, 1) is a KZ-monad iff p is naturally right adjoint to Dn, 
the unit of the adjunction being the natural isomorphism appearing in the 
definition of a pseudo-monad. 


Proof Given the unit map ¢4:lppa—7pala, we define the counit 
wa: (Dna)ua > lppa by pasting the diagram 


DDA i ey DA 
DDna 
IpDA DDDA 


Dua IppDA 
a, ov Ne Dna 


DDA ———————> DDDA 


Dnpa 
HDA 
Ippa DDA 


where the cells without explicit 2-arrows in them are those appearing in the 
definition of a pseudo-monad, plus an instance of the naturality of js. It is clear 
from the construction that %4 is natural in A; the verification that it is indeed the 
counit of an adjunction (pa 4 Dna) is tedious but straightforward. Conversely, 
given Ya, we obtain ġa by pasting a similar diagram. D 


KZ-monads may also be defined more symmetrically in terms of a modifica- 
tion Dn —> np satisfying certain axioms; see [605]. However, the importance of 
the notion resides in the existence of the adjunctions of 1.1.11 and 1.1.12. 


Proposition 1.1.13 LetD be a KZ-monad on a 2-category R, and A an object 
of R. Then 


(i) Any retraction (up to 2-isomorphism) forna: A — DA is weak left adjoint 
to na. 
(ii) Any D-algebra structure on A is left adjoint to na. 
(iii) Conversely, if na has a left adjoint such that the counit of the adjunction 
is an isomorphism, then this adjoint is a D-algebra structure on A. 
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Proof (i) Suppose given a morphism f: DA—A and a 2-isomorphism 
a: fna —> la. We obtain a 2-cell 6: ipa — naf by pasting the diagram 


ipa 


| Da™! 


DA Ta DDA 


İDDA 
Loa 
DA ———> DDA Pe es DA 
NDA 
NA 


where ¢ is the unit of the adjunction between p4 and npa, and the bottom cell 
commutes up to isomorphism by naturality of n. If we paste this to a, we obtain 


A Ipa 


D 
la DA——> DDA > DA 


of which the two central cells collapse by one of the triangular identities for 
(ua ~ npa), and the rest then collapses to the identity 2-cell on 74 by naturality 
of 7. 
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(ii) If we paste 8 to a the other way, we obtain 


Ipa 


D 
DA — > pA -SDA 
NDA 


NA 
al j 


la 
A——— >A 


which does not collapse to the identity in general; but if f is a D-algebra 
structure, then we may modify the lower half of this diagram to yield 


Ipa 


Dna 


DA ——> DDA | Dam! 
lppa 
ipa ST, -Iya 
Df 
DA ——> DDA ——> DA 
NDA 
ipa HA f 
f 


DA ——> A 


of which two cells collapse by one of the triangular identities for (ua 4 npa), 
and the rest collapses to the identity 2-cell on f by the coherence conditions for 
a D-algebra. 

(iii) Conversely, suppose 74: A — DA has a left adjoint f, such that the 
counit of the adjunction is an isomorphism a: fna — 14. Then f- Df is left 
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adjoint to Dna -na, whereas f - ua is left adjoint to np, - na; but the two right 
adjoints are isomorphic by the naturality of 7, so we obtain an isomorphism 
between the two left adjoints. This latter, together with a, may be checked to 
satisfy the coherence conditions for a D-algebra structure on A. C 


Corollary 1.1.14 IfD is a KZ-monad on a 2-category R, then a given object A 
of R admits (up to canonical isomorphism) at most one D-algebra structure. If R 
is locally Cauchy-complete, then such a structure evists iff the unit na: A— DA 
is split monic. o 


A further result on KZ-monads, which we shall exploit in Chapter C4, is the 
following: 


Corollary 1.1.15 Let D be a KZ-monad on a 2-category R, and (A, f) a 
D-algebra in R. If f has a left adjoint in R, then (A, f) is expressible as a 
retract (in the 2-category R of D-algebras) of a free algebra. The converse holds 
if R is locally Cauchy-complete. Moreover, in the latter case, every retract of a 
free D-algebra is a coadjoint retract of such an algebra. 


Proof If f has a left adjoint g, the latter is necessarily an algebra homomor- 
phism (A, f) — (DA, pa); for on taking right adjoints of all the 1-cells in the 
diagram 


Dg 
DA ——> DDA 


f HA 


Ae pA 


we obtain an instance of the naturality of 7. But f is also a morphism of 
D-algebras, by the ‘associativity axiom’ for an algebra; and the unit of the 
adjunction (g 4 f) is an isomorphism, since the counit of (f 4 na) is an iso- 
morphism. So (A, f) is a retract of (DA, ua), and indeed a coadjoint retract 
since f is still right adjoint to g in RP. 

Conversely, it follows from 1.1.12 that the algebra structure on a free 
D-algebra has a left adjoint in R; and 1.1.8 ensures that this property is inherited 
by retracts. The last assertion follows from the other two. B 


In Section B4.5 (and again in Section C4.3), we shall need to consider an 
‘intrinsic’ notion of (co)completeness for objects of a 2-category. Once again, 
we take our motivation from the 2-category CAL. Recall that a category C is 
cocomplete iff left Kan extensions of C-valued functors exist along any functor 
G: D — E between small categories, or more generally along any functor such 
that the comma categories (G | A), A € ob €, are all small. Moreover, these Kan 
extensions are what are generally called ‘pointwise Kan extensions’: this means 
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that computation of the left Kan extension at an object A of E depends only on 
the comma category (G | A). These ideas motivate the following definitions: 


Definition 1.1.16 Let A be a class of morphisms in a 2-category K. 


(a) We say an object D of & is A-cocomplete if, for every f: A — B in A, 
the functor f#: &(A, D) — £ (B, D) induced by composition with f has a left 
adjoint f+. 

(b) If & has comma objects and, for any comma square 


Gis 


C 
P ē > ; 
Í 
A ———> B 


in R, the condition f € A implies q € A, then we say an object D is point- 
wise A-cocomplete if it is A-cocomplete and, for any comma square as above, 
the Beck-Chevalley condition holds, i.e. the canonical natural transformation 
qp — g* f, is an isomorphism. 

(c) If R has pullbacks and A is stable under pullback, then we say D is 
linearly A-cocomplete if it satisfies a condition similar to (b), but with comma 
squares replaced by pullback squares. 

(d) If M is a subclass of A (to be thought of as the class of monomorphisms 
in A), then we say D is an A-cocomplete M -injective if it is A-cocomplete and, 
for every f: A— B in M, the unit map 1 > f*# f}. is an isomorphism. 

(e) We say that a morphism g: C — D between A-cocomplete objects of R 
is A-cocontinuous if, for any f: A— B in A and any h: A — C, the canonical 
2-cell fi(gh) — g- f+(h) (where the f} on the left refers to the cocompleteness 
of D, and that on the right to C) is an isomorphism. 


We note that this intrinsic notion of cocompleteness has at least one familiar 
property: 


Lemma 1.1.17 Suppose g:C — D has a right adjoint in R. Then it is 
A-cocontinuous for any A such that C and D are A-cocomplete. 


Proof A-cocontinuity of g says that the square 
9# 
R(A,C) > A (4, D) 
f4 Ji 


R(B,C) ~*s A(B,D) 
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commutes up to isomorphism, where gy is the functor ‘compose with g’. But if 
g has a right adjoint k in &, then (gz 4 ky); and, on taking right adjoints of all 
the functors in the above square, we obtain one which commutes by associativity 
of composition in K. T 


The connection between pointwise and linear A-cocompleteness is given by 
the following result. 


Lemma 1.1.18 Suppose R has pullbacks and cotensors with 2, and let A be a 
class of morphisms of R which is ‘stable under transfer across comma squares’ 
in the sense of 1.1.16(b). Let B be the class of all morphisms q of R which occur 
as the ‘upper edge’ of a comma square 


(flg) ——> c 
f 
A ———_-> B 


whose ‘lower edge’ f is in A. Then B is stable under pullback, and an object D 
of R is pointwise A-cocomplete iff it is linearly B-cocomplete. 


Proof The first assertion is immediate from the construction of comma objects 
from cotensors and pullbacks (cf. 1.1.4(c)): if we have a diagram 


P 


| 


(flg) 
A 


— 


| 


| 
| 


where the upper square is a pullback and the lower one is a comma. square, then 
the whole is a comma square. 
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Suppose now that D is linearly B-cocomplete; let f: A — B be a morphism 
in A, and form the comma square 


(flis) S 


B 
A 


We claim first that p has a left adjoint in &: the morphisms 14: A — A 
and f: A — B, together with the identity 2-cell f — f, induce a morphism 
g: A—(fl1p) such that pg ~ 1,4 and qg © f, and the universal 2-cell 
qgp = fp — q, together with the isomorphism pgp ~ p, induce a 2-cell gp — 1 
which is the counit of an adjunction (g 4 p) (the unit being the isomorphism 
= pg). Thus the functor p#: (A, D) — £ ((f | 15), D) is left adjoint to g*, 
and so we can define the left adjoint of f* = g#q# to be the composite q,p*. 
Thus D is A-cocomplete; to see that it is pointwise A-cocomplete, we simply 
factor a comma square whose bottom edge is in A as a pullback square whose 
top and bottom edges are in B, followed by a comma square as above whose 
right-hand edge is an identity morphism. 
Conversely, suppose D is pointwise A-cocomplete. Then it is certainly 
B-cocomplete, since B is a subclass of A; the only problem is to show linearity. 
Suppose given a pullback square 


- 


Po Sc 
f 
A-————> B 


with f € B; then by definition we have a comma square 


a—t sp 
a a b 
J- 
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with f’ € A’, so we may regard P as the comma object (f’ | bg). We may also 
construct the comma object 


now we have a comparison map u: P — (f |g) defined by setting pu = h, qu =k 
and taking the 2-cell fpu — gqu to be the canonical isomorphism fh = gk. 
But we also have a comparison map v: (f |g) — (f’ | bg) = P defined by setting 
ahv = ap, kv & q and by taking the 2-cell y: f'ahv S f’ap — bgq S bgkv to be 
that obtained by pasting the 2-cells a and £8 in the two comma squares above. 
Moreover, we have kvu = qu = k, ahvu = apu & ah and these isomorphisms 
identify y o u with a o h, so vu = 1p; and 8: fp > gq = gku = fhv S fpuv and 
the isomorphism ap = ahv = apuv induce a 2-cell p > puv , which in turn can 
be combined with the isomorphism q = kv = quv to yield a 2-cell 1; fy) > uv. 
It is again easy to see that this 2-cell, and the isomorphism vu = 1p, define an 
adjunction (v 4 u), so that (u# 4 v*). Now we have k,h* = kizu*p# = q4p*, 
since k,u* is left adjoint to v#k* & q#; so the Beck—Chevalley condition for 
the pullback square follows from that for the comma square. o 


Concerning the relation between cocompleteness and injectivity, we note the 
following. We say a morphism f: A — B in a 2-category K is fully monic if, 
for any object C of R, composition with f induces a full and faithful functor 
&(C,A) — &(C,B). For example, inclusions are fully monic in Yop, since if 
f» is full and faithful then any natural transformation f.g, — fh. must be 
of the form fx o a for a unique a: gẹ, — h+. (Similarly, geometric morphisms 
with full and faithful inverse image functors — which we shall christen connected 
morphisms in C1.5.7 — are fully epic in Yop.) 


Lemma 1.1.19 Suppose A is a class of morphisms of R such that D is either 
pointwise or linearly A-cocomplete. Then D is M-injective, where M is the class 
of full monomorphisms in A. 


Proof If f: A — B is fully monic, then it is easy to see that the square 


la 
— A 


A 
h f 
a f 


=> B 
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is a pullback in Ñ; it is not a comma square in general, but the comma. object 
(f Lf) coincides with (14 ]14). So the isomorphism f# f+ Œ 1 follows in either 
case from the isomorphisms 1 4, = 1% =S E 


Suggestions for further reading: Bénabou [99], Bird et al. [120], Bunge & 
Funk [199], Kelly [583], Kelly et al. [585], Kock [605], Power & Robinson [987], 
Street (1130, 1132]. 


B1.2 Indexed categories 


In the non-elementary parts of the theory of categories, we frequently find it 
necessary to talk about set-indexed families of objects or morphisms of a given 
category. Bearing in mind the viewpoint that a topos is a category of sets, the 
study of category theory relative to a topos S is going to demand that we should 
be able to talk about families of objects indexed by an object of S. The theory of 
indexed categories, and the (essentially equivalent) theory of fibrations which we 
shall expound in the next section, were created to provide a way of doing this. 

The important structure which indexed families possess is the possibility of 
re-indexing along a map: if x: I — J is a morphism in Set and (A; |j € J) isa 
J-indexed family of objects of a category C, we should be able to form the 
I-indexed family (A,(;) | ¿ € I). Thus the definition of an S-indexed category 
requires not only the specification of a category C7 of I-indexed families for each 
object I of S, but also of a re-indexing functor z*: CY — C7 for each x: I — J 
in S. (For the rest of Part B, we shall as far as possible reserve the letters I, J, K 
and x, y, z respectively for objects and morphisms of the base category S.) Our 
experience of the case S = Set might lead us to expect that the assignment 
x ++ x* should be (contravariantly) functorial on S; but the principles enunciated 
in the last section encourage us to make it merely pseudofunctorial, and this is 
supported by the examples we shall have occasion to consider. 

In order to effect a conceptual simplification in the definitions which follow, 
we shall use our informal notation CAF for the meta-2-category of categories, 
as explained in the last section; and we shall identify our base category S with 
the corresponding locally discrete 2-category. For the purposes of the definition, 
S is not required to have any structure beyond that of a category, but in all the 
examples of interest to us it will be at least cartesian. 


Definition 1.2.1 (a) An S-indezed category C is a pseudofunctor SP — CAT. 
By convention, we write C7 for the category C(I) (I € ob S) and z*: C7 — C! 
for the functor C(x) (x: I — J in S). If necessary, we shall write 0,,, for the 
coherence isomorphism z*y* — (yx)* (where (x,y) is a composable pair of mor- 
phisms of S), but in general we shall suppress all mention of these isomorphisms 
whenever it is safe to do so. If S has a terminal object 1, we shall call C? the 
underlying ordinary category of the S-indexed category C. 
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(b) If C and D are S-indexed categories, an S-indered functor F: C > Disa 
pseudo-natural transformation; that is, it assigns to each object I of S a functor 
F!:¢! — D! and to each morphism x: J + J a natural isomorphism ¢, as in 


ce) —2 sc! 


FY | bs Fi 


a 
D! > Di 
(again, we shall suppress the name of ¢, whenever possible), subject to the 
appropriate compatibility conditions with the 6; y. 

(c) If F and G: C — D are both S-indexed functors, an S-indexed natural 
transformation a: F — G is a modification; that is, it assigns to each I € ob S 
a natural transformation a? : F? — G’, subject to compatibility conditions with 
the coherence isomorphisms ¢, and yz of F and G. 


For a given S, the S-indexed categories, functors and natural transformations 
clearly form a (meta-) 2-category, which we shall denote by CATs. 


Examples 1.2.2 (a) Any ordinary category C may be made the underlying 
ordinary category of a Set-indexed category C, by setting C7 to be the product 
of I copies of C (i.e. the category of J-indexed families of objects of C, as classically 
understood). We shall refer to this as the naive indexing of C over Set. Similarly, 
ordinary functors and natural transformations extend to Set-indexed functors 
and natural transformations between the naive indexings of the corresponding 
categories. 

(b) However, even over Set an indexed category is not determined by its 
underlying ordinary category; there may be ‘non-naive’ ways of indexing the 
latter over Set. For example, if C = Sets we could define C7 to consist of those 
I-indexed families (A; | i € J) of finite sets such that the cardinality of A; is 
bounded (independently of i), and all /-indexed families of functions between 
them. More radically, for any C we could define C? to consist only of constant 
I-indexed families of objects of C. In Section B1.5 we shall investigate ‘stack 
conditions’ which may be used to distinguish the naive indexing from other 
possible indexings of a given category. 

(c) An example which will be of fundamental importance in all that follows: 
let S be a cartesian category, and define an S-indexed category S by S? = 
S/I, with z* taken to be the functor ‘pullback along z’ already denoted thus in 
A1.2.8. (Recall our convention that a category with pullbacks comes equipped 
with a choice of ‘canonical’ pullbacks (cf. A1.2.1), so we do indeed have such a 
functor x* for each morphism z of S. However, we do not expect the composite 
of two canonical pullback squares to be canonical, so the assignment x +> g* 
will not be strictly functorial — as it was in examples (a) and (b).) We note that 


B1.2 INDEXED CATEGORIES 261 


the underlying ordinary category of S is (isomorphic to) S, and we call S the 
canonical indexing of S over itself. In the case S = Set, A1.1.6 assures us that 
the canonical indexing is equivalent (in CATset) to the naive indexing. 

(d) More generally than (c), if we are given a functor F: S — C and C is carte- 
sian, then we may form an indexed category C over S by setting C? = C/F(J) 
and z* = pullback along F(z). If S has and F preserves 1, then the underlying 
ordinary category of C is isomorphic to C; and if F preserves pullbacks, then it 
extends to an S-indexed functor S — C in an obvious way. In fact, if S is carte- 
sian and CAAT denotes the meta-2-category of cartesian categories, cartesian 
functors and natural transformations, then this construction defines a 2-functor 
from the co-slice category S\EART to CATs. 

(e) Example (d) is itself a special case of a more general construction, namely 
change of base. If F': 7 — S is a functor and C is an S-indexed category, we 
obtain a 7-indexed category F*C by setting (F*C)! = CP and z* = (F(x))*. 
This construction also works on S-indexed functors and natural transforma- 
tions; thus the assignment S + €2% 5 itself becomes (contravariantly) functorial 
on CAT. (It is even 2-functorial, although it is also contravariant at the 
level of 2-cells: a natural transformation a: F — G yields an indexed natural 
transformation G*C — F*C whose I-component is (a;)*: CC® — CF.) 

(f) A special case of (e) which will be of particular importance: if S has 
finite products and F is the functor (—) x I: S — S, then we shall write C/ 
for F*C, where C is an S-indexed category. That is, (C’)’? = C’*/, and if 
x: J — K then 2*: (C/)* — (C?)Y is (x x 1,)*: C**! — C/*!. The assignment 
I+ C! is itself contravariantly functorial in J (as well as covariantly 2-functorial 
in C). In particular, we have an S-indexed functor A: C —> C? defined by 
Aa net) Cr, 

(g) Once again, suppose S is cartesian. Since monomorphisms in S are stable 
under pullback, the assignment I ++ Subs(J) defines an indexed category, in 
fact an indexed subcategory of the category S of example (c), which we shall 
denote by Sub (or Subs, if it is necessary to specify S). As we shall see, many 
of the categorical properties which we studied in Chapter Al can be interpreted 
as properties of this indexed category. 

(h) The following example will not be of major importance for us (except as a 
source of counterexamples), but it lies at the heart of the extensive subject known 
as categorical topology. Given a set I, let T(J) denote the set of all topologies 
on J, regarded as a preorder (in fact a complete lattice) with the opposite of 
the inclusion ordering — that is, there is a morphism U ~ W in T(J) iff U is 
finer than U’. If x: I — J is a function and Y € T(J), there is a unique coarsest 
topology z*V on I for which z is continuous, namely 


z*V = {xr} (V)| V EV}; 


this defines an order-preserving map z*: T(J) — T(J), and so makes the assign- 
ment I ++ T(J) into a Set-indexed category T. Note that, for each x: I — J, x* 
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has a left adjoint z4, which sends a topology U on I to the finest topology on J 
making x continuous, i.e. 


tU={VCJl|2'(V) eu}. 


We shall return to the discussion of such adjoints in Section B1.4 below. Other 
types of ‘topological’ structures that can be imposed on sets — uniformities, 
proximities and the like - may also be collected into Set-indexed preorders of 
this kind; it is a slight exaggeration, but not a vast one, to say that categorical 
topology is precisely the study of these Set-indexed preorders. 


A question which will frequently be of interest to us is whether a given 
ordinary functor between the underlying ordinary categories of two indexed cat- 
egories can be extended to an indexed functor. In this connection the following 
result, though seemingly rather special, is often useful: 


Lemma 1.2.3 Let F: S —C be a functor between cartesian categories, having 
a right adjoint R. Then R has a canonical extension to an S-indexred functor 
C — S, where S is the canonical indexing of S over itself, as defined in 1.2.2(c), 
and C is defined as in 1.2.2(d). 


Proof For each object I of S, let R! denote the composite 


R/FI nt 
C/FI ——> 8/RFI ——> S/I 


where ņ is the unit of (F 4 R). To verify that this is an indexed functor, let 
x: I — J be a morphism of S, and consider the diagram 


R/FJ n3 
C/ FI ——=> S/RFJ —— S/J 
(Fr)* (RFr)* a 
R/FI nT 


Giri S/ RFI SST 


Here the left-hand square commutes up to isomorphism because R, being a right 
adjoint, preserves pullbacks, and the right-hand one commutes by naturality of 7. 
o 


In the case where F preserves pullbacks, and so extends (as we saw in 1.2.2(d)) 
to an S-indexed functor S — C, it is easy to verify that R is an indexed right 
adjoint to F -— that is, for each object I we have an adjunction (F7 4 R7), and the 
units and counits of these adjunctions form S-indexed natural transformations. 


Remark 1.2.4 It is also easy to verify that if F has a left adjoint L, then 
F’ has a left adjoint L? for each J; explicitly, L7 sends an object (A — FT) of 
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C/FI to its transpose (LA —> I) in S/I. However, the L/ do not in general form 
an S-indexed functor C — S; note in particular that the commutativity up to 
isomorphism of the square 


(FI)* 
C ——->C/FI , 


L L! 


* 


SSS 


which would be required for the L/ to form an indexed functor, is just the 
‘Frobenius reciprocity’ condition L(A x FI) = LA x I of A1.5.8. We may thus 
deduce that if S and C are locally cartesian closed categories and F: S — C is a 
functor having a left adjoint L, then the S-indered functor S — C of 1.2.2(d) has 
an indexed left adjoint iff F is locally cartesian closed (i.e. F/I: S/I + C/FI is 
a cartesian closed functor for each I). 


When dealing with S-indexed categories, we naturally wish to be able to han- 
dle not just individual indexed functors and natural transformations between 
them, but also J-indexed families of such things, where I is any object of S. 
Provided S has finite products, the construction of 1.2.2(f) equips us with the 
means of doing so: we define an J-indexed family of functors C — D to bea 
single indexed functor C — D/, and similarly for natural transformations. If 
xz: I — J in S, we have an indexed functor z*: DY — D7, whose K-component 
is (1x x x)": D**/ — DE XI, these functors are themselves pseudo-functorial in 
xz, and so composition with them can be used to define ‘re-indexing’ for families of 
functors: that is, if we write [C, D] for the ordinary category of indexed functors 
C — D and indexed natural transformations between them, then the functors 
(F + 2* o F): [C, D7] — [C, D7] form the transition maps of an S-indexed cat- 
egory which we denote [C, D], the indexed functor category of indexed functors 
C — D. 

We shall be mainly concerned with indexed functor categories [C,D] when 
C is an internal category, a concept that we shall meet in Section B2.3 below. 
However, the following simple example is instructive: 


Example 1.2.5 Let J be an object of S, and let I denote the S-indexed cat- 
egory such that Z7 is the discrete category whose objects are all morphisms 
J — I in S, and such that if z: K — J in S then z*: T7 — T* is the operation 
of composition with x. Given any S-indexed category D, each object A of D! 
defines an indexed functor I — D sending y: J — I to y*A € ob DY; but every 
indexed functor F: I > D is (canonically) isomorphic to the one obtained in this 
way from the object F7(1,) of D!. Thus we have an equivalence between D/ and 
the functor category |I, D}; more generally, [I, D7] is equivalent to D/*/ (and 
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hence to DY*!). So we may conclude that the indexed functor category [I, D] is 
equivalent to D’, as defined in 1.2.2(f). 


We may also define indexed families of functors C — D when C is an ordinary 
category and D is an S-indexed category: in this case, an J-indexed family of 
functors means simply an ordinary functor C > D/. Once again, these may be 
made into (the J-indexed families of objects of) an S-indexed category, which 
we denote by {C, D]. It is straightforward to verify that if C and D are the naive 
Set-indexings of ordinary categories C and D, then both [C,D] and [C,D] are 
equivalent to the naive indexing of the ordinary functor category [C, D]. 


Suggestion for further reading: Paré & Schumacher [933]. 


B1.3 Fibred categories 


The notion of indexed category, though conceptually clear and attractive, is 
not exactly an elementary one: it is a collection of (potentially) ‘large’ entities 
indexed by a ‘large’ entity, and such things are hard to handle in set-theoretic 
metatheories, unless we make powerful assumptions such as the existence of 
‘Grothendieck universes’. However, there is a way of combining all the data 
in an indexed category into a single entity (itself due to Grothendieck, and 
commonly called the Grothendieck construction), which leads to an elementary 
theory which is equivalent (in all essential aspects) to that of indexed categories. 
We devote the present section to studying some aspects of this theory. 


Definition 1.3.1 Let S be a category, and C an indexed category over S. We 
define the Grothendieck category G(C) of C as follows: its objects are pairs (I, A) 
where I is an object of S and A is an object of C7, and morphisms (J, A) — (J, B) 
are pairs (x, f) where z: J — J in S and g: A > x*(B) in C7. The composite of 
(x, f) and (y, g): (J, B) — (K,C) is (yx, h), where h is the composite 


foo. #9) y  Ox,y(C) A 
A ——> 2"(B) ——> 2" y"(C) ———> (yz)"(C); 
the coherence conditions for the 0, ensure that this composition is associative, 
and so G(C) is a category. There is an obvious functor II: G(C) — S sending 
(I, A) to I and (z, f) to z. 


We should remark that a particular case of the Grothendieck construction 
(albeit for a covariant rather than a contravariant functor) has been seen much 
earlier on in this work: in fact in the proof of A1.1.7. If we are given a functor 
F: S — Set, and we regard it as a Cat-valued (pseudo)functor taking discrete 
categories as values, then the Grothendieck construction applied to it yields the 
category F which we considered there. We shall have more to say about this 
special case later on. 

The fibre of IT over the object J (i.e. the non-full subcategory of G(C) con- 
sisting of the objects mapping to J and the morphisms mapping to 17) is just 
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(an isomorphic copy of) the category C7, and we shall identify it therewith. It 
is convenient to picture the base category S as spread out horizontally and the 
fibres C7 as running vertically above it, as in the following picture: 


c! cI 
A 
z2*(B) ———> B 
(z,1) 
I ELENEN J S 


With this picture in mind, we shall call a morphism (x, f) of G(C) vertical if 
it lies in a fibre of II (i.e. if z is an identity morphism), and horizontal if f is 
an identity morphism. As the above diagram indicates, every morphism of G(C) 
can be factored uniquely into a vertical morphism followed by a horizontal one. 

Up to isomorphism, the horizontal morphisms can be described purely in 
terms of the functor II: G(C) — S: 


Lemma 1.3.2 Let h:U — V be a morphism of G(C). Then the vertical part 
of the factorization of h is an isomorphism iff, given any morphism k: W > V 
with the same codomain and any factorization i(k) = U(h) o x of (k) through 
Il(h), there is a unique l: W 4 U in G(C) with I(l) = z and hl = k. 


Proof Suppose h = (y,f) where f is an isomorphism, and let k = (yz, 9). 
Then the required factorization of k through A is easily seen to be (z, 2*(f~!) og) 
(modulo composition with a coherence isomorphism whose name we have sup- 
pressed). Conversely, if kh = (y, f) satisfies the given condition, take k = (y, 1) 
and z = 1; the second component of the unique vertical factorization of k through 
h must be a two-sided inverse for f. E 


Now let II: C — S be an arbitrary functor. Inspired by the foregoing result, 
we shall call a morphism of C Il-prone (or simply prone, if there is no need 
to specify II) if it satisfies the condition in the statement of Lemma 1.3.2, 
and I-vertical if its image under II is an identity morphism. Note that this 
condition is not self-dual, which is our reason for abandoning the word ‘hori- 
zontal’; a morphism which satisfies the dual condition (i.e. one which is prone 
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with respect to IT°P: COP — S°P) will be called I-supine. (Prone morphisms are 
more commonly called ‘cartesian’, but this word has been rather overworked by 
category-theorists, and deserves a rest. The terminology we have adopted, which 
conveniently combines the idea of lying horizontally with a ‘handedness’, was 
suggested by P. Taylor [1161].) We shall need the following properties of prone 
morphisms: 


Lemma 1.3.3 
(i) Let f: A— B andg: B — C be morphisms of C such that g is prone. Then 
the composite gf is prone iff f is. 


(ii) Given a commutative square 


| EE 
k 
C ———> D 


in C such that k is prone and g and h are vertical, then f is prone iff the 
square is a pullback. 


Proof Straightforward verification. 0 


By 1.3.3(i), the prone morphisms of C form a subcategory Cp (it is clear that 
they include all identity morphisms, and indeed all isomorphisms of C); and for 
each object A of C the slice category Cp/A is a full subcategory of C/A. We shall 
also write C? for the fibre of C over an object J of S, i.e. the subcategory of 
objects mapping to J and vertical morphisms between them. 


Definition 1.3.4 A functor II: C > S is called a fibration if, for any object 
A of C and any morphism z: I — H(A) in S, there exists a I-prone morphism 
f: B —AinC with I(f) =z. 


We do not, in general, demand that f should be unique; but it will automat- 
ically be unique up to vertical isomorphism; that is, if two prone morphisms f 
and f’ have the same codomain and the same image under II, there is a unique 
vertical isomorphism v such that f’ = fv. 

If If: C — S is a fibration, a cleavage for II is a function o assigning to each 
pair (A,x) with (A) = cod z a prone morphism f = o(A,2) with cod f = A 
and II(f) = z. We shall usually assume that ø is normalized in the sense that 
o(A,1nay) = la; if not, it is easy to modify ø to achieve this. We say ø is a 
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splitting if it satisfies the compatibility condition 
o(A, yz) = o( A, y) 0 o(dom o(A, y), £) 


for every composable pair of morphisms (x,y) in S; in general, it is not possible 
to modify a given o to achieve this. By a cloven (resp. split) fibration we mean 
a fibration equipped with a choice of a cleavage (resp. a splitting). 

The notion of cloven fibration may be defined in an essentially algebraic way — 
that is, it may be specified by partial operations whose domains are defined 
by equations in the previously specified operations, and satisfying equations 
(cf. [371]) - and so it may be interpreted in the internal logic of any cartesian 
category. Specifically, the structure involved is that of a pair of categories C and 
S, a functor Il: C — S, the operation o assigning to pairs (A, x) with II(A) = 
cod z a morphism o(A, x) of C with Ti(o(A,z)) = z and cod o(A, x) = A, and 
an operation 7 assigning to a triple (x,y, f) with (x,y) a composable pair in S 
and II(f) = yx the unique morphism r(z, y, f) satisfying U(r(z, y, f)) = x and 
o(cod f,y)°7(z,y, f) = f. We leave to the reader the task of formulating the 
other equations which o and r must satisfy. 

For the fibration IT: G(C) — S arising from an S-indexed category C, we 
have a canonical choice of cleavage given by o( A,r) = (z£, 12+ a); this cleavage is 
a splitting iff C is a strict functor and not just a pseudofunctor. In the converse 
direction, we have 


Theorem 1.3.5 Let II:C — S be a fibration, and o a cleavage for Ti. Then 
there is a pseudofunctor C: SP — CAT sending an object I of S to C', and 
such that the Grothendieck construction applied to this indexed category yields a 
fibration isomorphic (in CAV/S) to I. Moreover, C is a strict functor iff o is a 
splitting. 


Proof More straightforward verification. Given a morphism z: J — J in S, 
the induced functor 2*: C7 — C! sends an object A of C7 to dom o(A, 2), and a 
morphism f: A — B to the unique vertical factorization of f o o( A,x) through 
o(B,x). The functoriality of x* follows from the uniqueness of such vertical 
factorizations. Given y: J — K and an object C of C*, we have a canonical 
vertical isomorphism z*y*C — (yx)*C arising from the fact that both objects 
are the domains of prone morphisms to C; this defines the required natural 
isomorphism z*y* — (yx)*, and it is an identity (for all pairs (z,y)) iff o is 
a splitting. Finally, there is an obvious isomorphism G(C) — C which sends 
an object (J, A) to A and a morphism (z, f): (I, A) —> (J, B) to the composite 
o(B,x) of. E 


Thus S-indexed categories are ‘the same thing as’ cloven fibrations over S, 
and we shall henceforth use 1.3.5 as a licence to pass back and forth between the 
two kinds of structure, using whichever is more convenient at any given moment. 
In order to do this, we need to make 1.3.5 functorial; that is, to investigate 
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the concepts for fibrations which correspond to S-indexed functors and natural 
transformations. 
Rather against our general philosophy, a morphism of fibrations from 
I: C +S to 6: D—S will (in particular) be a functor F: C — D making the 
diagram 
F 
C ——__> D 


| 
Il 
S 


commute strictly, and not just up to isomorphism. (This is not really as arbitrary 
as it seems; if we had an F making the above diagram commute up to a natural 
isomorphism a: H — F, we could use a cleavage for © to ‘lift’ œ to a natural 
isomorphism F” — F, where F’ is a functor making the diagram commute 
strictly.) For F to be a morphism of fibrations, we further require that it should 
send II-prone morphisms to ®-prone ones (the fact that the diagram commutes 
ensures that it sends II-vertical morphisms to -vertical ones); however, if H and 
® are equipped with cleavages, we do not demand that F should commute with 
them in any sense. A (vertical) transformation between morphisms of fibrations 
F and G is of course a natural transformation a: F — G which is compatible 
with II and © (i.e. such that each a, is ®-vertical). We write Fibs (resp. cFibs, 
sibs) for the 2-category whose objects are fibrations (resp. cloven fibrations, 
split fibrations) over S, with the above morphisms and transformations. 


Theorem 1.3.6 The Grothendieck construction yields an equivalence of 
2-categories between CATs and ibs. 


Proof Straightforward verification again. E 


Examples 1.3.7 (a) Let S be a cartesian category. The fibration which corre- 
sponds to the S-indexed category S of Example 1.2.2(c) is simply cod: [2, S] — S, 
where 2 is the finite category represented diagrammatically by 


e——-—_> e 


(so that functors 2 > S may be identified with morphisms of S, and natural 
transformations between them with commutative squares in S) and cod is the 
functor sending a morphism of S to its codomain. It is clear that the fibre of 
this functor over I is exactly the slice category S/J; and a commutative square 
in S, regarded as a morphism of [2, S], is prone relative to cod iff it is a pullback 
square, from which it follows that the transition functors between the fibres 
are indeed the pullback functors defined in A1.2.8. Indeed, it is not hard to 
see that, for an arbitrary S, the functor cod: [2,S] — S is a fibration iff S has 
pullbacks — and that the difference between the two possible interpretations 
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of the statement ‘S has pullbacks’, on which we touched after Lemma A1.2.1, 
is precisely the difference between merely asserting that cod is a fibration and 
asserting that it is a cloven fibration. We remark that cod is also an opfibration 
(that is, cod®?: [2, S]°P — S° is a fibration), and indeed a split one: supine 
morphisms of [2,S] are those squares whose ‘top edge’ is an isomorphism, and 
the splitting sends (zx, y) to the square 


po SS 


B, 


We shall see in 1.4.5 below that this property of cod corresponds to the fact 
that the transition functors z* of the corresponding indexed category have left 
adjoints ©, (and that the latter are strictly functorial in z). 

(b) More generally, if F: S — C is a functor between cartesian categories, then 
the fibration corresponding to the S-indexed category C of 1.2.2(d) is simply the 
projection functor P2: Gl(F) — S, where GI(F) is the category obtained by 
glueing along F, as defined in A2.1.12. 

(c) Again, the fibration corresponding to the indexed category Sub of 1.2.2(g) 
is (the restriction of cod to) the full subcategory Mono(S) of [2, S] whose objects 
are monomorphisms in S. 

(d) The fibration corresponding to the indexed category T of 1.2.2(h) is none 
other than the forgetful functor Sp — Set; for the objects of its domain are 
simply sets equipped with a topology, and morphisms (1,4) — (J, V) are func- 
tions x: J + J such that U D x*V, i.e. such that z is continuous for the given 
topologies. Note that this functor is also an opfibration; again, this corresponds 
to the fact that the z* have left adjoints, as we noted in 1.2.2(h). 


Change of base for indexed categories, as defined in 1.2.2(e), corresponds to 
pullback of fibrations, in the following sense: 


Lemma 1.3.8 Let F:S —T be a functor, and C a T-indezed category. Then 
there is a pullback square 


G(F*C) ——> gG(C) 


} Il 


Se eT 


where the vertical arrows are fibrations. 
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Proof By definition, the objects of the pullback are pairs (J,(J,A)) where 
I € ob S, (J,A) € ob G(C) and FI = I(J, A) = J; but these correspond 
exactly to the objects (I, A) of G(F*C). It is straightforward to verify that this 
correspondence defines an isomorphism of categories. E 


Once again, it will be observed that the notion of pullback used in the above 
proof is the ‘strict’ one, and not the pseudo-limit as described in Section B1.1. 
However, for fibrations there is no essential difference: 


Lemma 1.3.9 ZLetIl:C — T be a cloven (resp. split) fibration, and F: S > T 
an arbitrary functor. Then the strict pullback of I along F is a cloven (resp. 
split) fibration, and it is also a pseudo-pullback. 

C 


Seat E 


Proof Let us denote the (strict) pullback by 


Jalas 


Then, given an object A of D and z: J — @A in S, it is easily verified that 
(x,o(GA, Fx)) is a prone morphism in D with codomain A for any cleavage o 
of IJ, and that the assignment (A, x) +> (x,o (GA, F2)) is a cleavage of ® (and 
a splitting if o is). For the second assertion, we note that the usual construction 
of the pseudo-pullback of F and I in CAT would produce a category D whose 
objects are triples (J,z, A) with I € ob S, A € ob C and z: FJ > IIA an 
isomorphism in 7. Clearly the assignment (J, A) > (J, lr, A) defines a functor 
D => D; and when II is a cloven fibration we can construct an inverse for it 
up to natural isomorphism, sending (I, z, A) to (J,dom(o(z, A))). So the strict 
pullback is equivalent to the pseudo-pullback, and hence shares the universal 
property of the latter. E 


Our convention that a cartesian category comes equipped with a choice of 
‘canonical’ finite limits will ensure that virtually all the fibrations we have to 
consider will come equipped with cleavages, although many of them will not 
have splittings. However, if we are simply given a fibration (or even an arbi- 
trary functor), we can always replace it by a ‘best approximation’ amongst split 
fibrations, by the following device: 


Definition 1.3.10 Let II: C — S be an arbitrary functor. The associated split 
fibration of I is the functor II: Ê — S, where an object of C consists of an object 
A of C together with a functor 0: S/HA — C/A such that every o(z) is II-prone 
and satisfies I(o(x)) = x. (As usual, we also demand that o(1na) = 14.) A 
morphism (A,a) — (A’,o’) in C consists of a morphism z: HA > ITA’ in S 
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together with a morphism f: A — dom o’(z) in C™4 (which of course induces, 
for each y with codomain IIA, a unique morphism dom o(y) — dom o’(zy) in 
càm y), Composition and identity morphisms in Ĉ are defined in the obvious 
way; the functor II sends (A, c) to ILA and (æ, f) to z. 


Il is a split fibration: to define the splitting at an object (A,o) and a mor- 
phism xz: J — IIA, we take its domain to be (dom o(z), a’) where o’(y) = o(y) 
for all y: J > I (and y is considered on the left of the latter equation as an 
object of S/I, and on the right as a morphism ry > z in S/JIA). There is a fac- 
torization U: C — C of II through II, sending (4,0) to A and (z, f) to o’(z) of; 
this functor is always full and faithful. If I is a cloven fibration with cleavage ø, 
then we can define an inverse (up to isomorphism) for U, sending an object A 
of C to (A, o(A, —)). Thus we have shown that every cloven fibration over S is 
equivalent (in CAL/S) to a split fibration. 

There is an important special case in which no arbitrary choices are required 
to obtain a splitting of a fibration. We shall say a category is stiff if its only 
isomorphisms are identity morphisms; and that a fibration II: C — S is stiff if 
its fibres C/ are all stiff categories, i.e. the only vertical isomorphisms in C are 
identities. (Stiffness is the conjunction of two more familiar categorical properties 
usually called ‘skeletal’ and ‘rigid’.) If II has this property, then any morphism 
x: I — J of S has a unique prone lifting with codomain A, for each A € ob CY; so 
II has a unique cleavage, which is in fact a splitting. As a further specialization, 
we may consider discrete fibrations, which are those in which every fibre C7 is a 
discrete category. An important property of discrete fibrations is worth recording: 


Lemma 1.3.11 Let Il:C — S and ®:D—S be fibrations, ® being discrete, 
and let F:C — D be a functor satisfying OF = II. Then 


(i) F is a morphism of fibrations. 
(ii) F is itself a fibration. 
(iii) F is a discrete fibration iff II is. 


Proof (i) Every morphism of D is ®-prone, for in its vertical-prone factoriza- 
tion the vertical part must be an identity morphism. So this is immediate from 
the definition of a morphism of fibrations. 

(ii) First we claim that a morphism f: A — B of C is Il-prone iff it is F-prone. 
For suppose f is I-prone and we are given k: C — B and a factorization F(h) = 
F(f)g in D; then H(A) = II(f)®(g) so that we have a unique k: C — A with 
fk = h and W(k) = ®(g), and then F(k) = g since both are (necessarily prone) 
liftings of ®(g) with codomain F(A). Conversely, if f is F-prone, then since F(f) 
is -prone it follows easily that f is II-prone. 

Now, given an object A of C and a morphism f: B — F(A) in D, we have a 
Il-prone g: C — A with II(g) = ®(f), and then F(g) = f since both are liftings 
of ®(f) with codomain F(A). So g is an F-prone lifting of f. 
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(iii) Given (ii), this is almost immediate from the definition of a discrete 
fibration. E 


Given the fact that discrete fibrations over a given category S with small 
fibres correspond to (necessarily strict) functors S°P — Set, and that morphisms 
of fibrations between them correspond to natural transformations, the conjunc- 
tion of (i) and (iii) of Lemma 1.3.11 is simply a reformulation (for contravariant 
functors, this time) of Proposition A1.1.7. 

An important concept in the theory of fibrations (or of indexed categories) is 
that of comprehensibility (and the special case of it known as definability). This 
concept is so widespread that it is hard to give a simple definition of it which 
will cover all cases of interest. However, the basic idea is as follows: suppose 
given a construction which, when applied to some finite collection of data is an 
ordinary category C, would produce a class (or if C were ‘small enough’, a set) 
of ‘structures’ on the given data. Then, if we apply the construction to data 
in the total category of a fibration Il: C — S, we should expect to obtain not 
just a single class but a contravariant functor defined on S/I x --- x In (where 
the given data lie in the fibres C",...,C!"); and we say that the result of the 
construction is comprehensible in C if this functor is representable (for all choices 
of the given data). 

To make this more precise, we shall interpret the notion of ‘collection of data’ 
as being given by a diagram of some (fixed) finite shape, and ‘structure’ as being 
given by an extension of the diagram to a larger one. Given a fibration I: C — S 
and a finite category D, we define the rectangular diagram category Rect(D,C) to 
be the non-full subcategory of [D, C] whose objects are the ‘vertical diagrams’ of 
shape D (i.e. diagrams with all edges vertical) and whose morphisms are natural 
transformations with all components prone. Then we say that II: C — S satisfies 
the comprehension scheme for a functor G: D' — D if the functor Rect(D,C) + 
Rect(D’,C) obtained by composition with G has a right adjoint. 

In order to understand this rather opaque definition, it is best to consider a 
few examples. 


Example 1.3.12 We say that a fibration Il: C — S (or the corresponding 
S-indexed category) is locally small if it satisfies the comprehension scheme for 
the inclusion functor 2 —> 2, where 2 is the discrete category with two objects 
and 2 is the category (e — e). In elementary terms, this means the following: 
given any two objects A and B of C, there exists an object (x,y): J — IA x IIB 
of S/TIA x IIB, and a morphism f: 2* A —> y*B in the fibre C? which is generic 
in the sense that, given any (z,w): J > HA x IIB and any g: z*A — w*B in 
C7, there is a unique u: J > I such that ru = z, yu = w and u*f is identified 
(modulo the coherence isomorphisms) with g. 


If C is any locally small category (in the ‘classical’ sense), then the naive 
indexing of C over Set is a locally small indexed category: given objects A = 
(A; | j € J) and B = (Bp | k € K), we take J to be the disjoint union, over all 
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pairs (j,k), of the hom-sets C (A;, Bk), with the obvious projection to J x K. (It is 
easy to see that the converse is true, by considering objects in the fibre over 1.) 
However, local smallness of the fibres in the classical sense is not a sufficient 
condition for local smallness of a fibration over Set: for example, the fibration 
Sp — Set of 1.3.7(d) is not locally small, even though its fibres are preorders. (To 
see this, let U¢ and Y be topologies on sets J and K respectively; then since there is 
a unique topology on a singleton set we see that any morphism (j,k): 1 = J x K 
would have to factor uniquely through the morphism J — J x K of 1.3.12 if it 
existed, and hence the latter would have to be bijective. But this would imply 
that we had mU > 73¥Y as topologies on J x K, which is clearly not true in 
general.) 

It is also easy to see that the discrete fibration corresponding to the indexed 
category I of Example 1.2.5 is locally small iff S has pullbacks over J (ie. the 
slice category S/I has finite products), since the object P — J x K representing 
vertical morphisms from z: J — I to y: K — I must be exactly a pullback of x 
against y. We shall investigate local smallness for indexed categories in greater 
detail in Section B2.2; in particular, we shall see there that the canonical indexing 
S of a cartesian category S over itself is locally small iff S is locally cartesian 
closed. 

The comprehension scheme works best when (as in the above example) the 
functor G: D’ — D is bijective on objects. If we apply it to a functor without 
this property, we can obtain surprising results, as the following example shows. 


Example 1.3.13 One might be tempted to call a fibration II: C — S small if it 
(is locally small and) satisfies the comprehension scheme for the inclusion 0 — 1, 
where 0 is the empty category and 1 the singleton category. In elementary terms, 
this means that the subcategory C, of prone morphisms of C has a terminal 
object. However, although this condition is of use in certain contexts, when 
considered as a definition of smallness it says both rather less and rather more 
than one might wish. On the one hand, it asserts the representability of the 
functor which to an object J of S assigns the collection of isomorphism classes 
of objects of C’, rather than ob C7 itself; thus it is really a notion of ‘essential 
smallness’. (Of course, for a fibration which does not possess a splitting, the 
assignment J ++ ob C7 cannot be made into a functor on S°, so it does not 
make sense to ask that it should be representable.) But it also implies that 
the fibres of C are rigid, i.e. that any vertical automorphism in C is an identity 
morphism; for if an object A admits a vertical automorphism f and a prone 
morphism g: A —> B, then gf is also prone, and the equality gf = g forces 
f = 1a. In Section B2.3 below, we shall develop a more satisfactory notion of 
‘small S-indexed category’ from an entirely different starting-point. 


In certain cases, we shall wish to consider a ‘generalized comprehension 
scheme’ where we seek a right adjoint not for Rect(D,C) — Rect(D’,C) but 
for the restriction of this functor to some full subcategory of its domain, defined 
by requiring the objects to be diagrams having some additional property (e.g. 
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that certain edges should be monic, or that certain parts of the diagrams should 
be limits). Of course, in this case we must restrict our attention to fibrations 
whose re-indexing functors preserve the given properties of diagrams. A good 
example is the following: 


Example 1.3.14 Let II: C — S bea fibration such that the transition functors 
z*:C! + C4 induced by morphisms z: J — I of S preserve monomorphisms. 
(Note that this will be the case if the x* have left adjoints — a condition 
which we shall explore more fully in the next section.) We say that II (or 
the corresponding S-indexed category) is well-powered if the forgetful functor 
U: Q(2,C) — Rect(1,C) has a right adjoint, where Q(2,C) is the full subcate- 
gory of Rect(2,C) whose objects are vertical monomorphisms and U sends each 
monomorphism to its codomain. In elementary terms, this means that for each 
A € ob C we have a morphism z: J — ITA in S and a subobject A’ > 2*A in 
C! such that, for any y: J > IIA and any A” >> y*A in C’, there is a unique 
z: J + I such that rz = y and 2*(A’) = A” in Subes(y*A). (Note that, once 
again, the fact that we are dealing with (possibly) non-split fibrations forces 
us to interpret the word ‘subobject’ here as meaning an isomorphism class of 
monomorphisms with given codomain, and not an individual monomorphism; 
but in the present context this is what we should wish to do anyway. Note also 
that the enforced rigidity, which we observed in 1.3.13, does not bother us here, 
since categories of the form Sub(A) are (preorders, and hence) rigid in any case.) 

It is clear that the naive indexing of an ordinary category C over Set yields 
a well-powered fibration iff C is well-powered in the classical sense, as defined in 
Section A1.4. More interestingly, the canonical indexing S of a cartesian category 
S over itself is well-powered iff each of the slice categories S/I have power objects 
as defined in Section A2.1; but by A2.3.2 this is equivalent to saying that S 
is a topos. 

For future reference, we note that the canonical indexing of a topos S over 
itself is also well-copowered (that is, S°P is well-powered). Since a topos is effec- 
tive regular (and since all epimorphisms in it are regular), we know that quotients 
of a given object I of S correspond bijectively to equivalence relations on J, in 
the sense of A1.3.6; and the latter can be ‘classified’ by an appropriate subobject 
E(I) of P(I x I). Moreover, this construction works in any slice category S/J; 
and it is preserved by pullback functors since they are logical. So we deduce that 
S/E(1) contains a generic quotient of J, in the sense required by the definition. 


Definability is the special case of comprehensibility where we are concerned 
with properties of a collection of data, rather than constructions on it (or equiv- 
alently, with constructions that produce either a singleton or nothing at all). 
Typically, this occurs when the functor G along which we are trying to extend 
diagrams is an epimorphism in Cat. Two examples, which will be of importance 
in Section B2.4, are definability of equality for parallel pairs of morphisms, and 
definability of invertibility for morphisms. The first is simply the comprehension 
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scheme for G: P — 2, where P is the parallel-pair category (e <3 e) and G iden- 
tifies the two non-identity morphisms of P, and the second is the comprehension 
scheme for the inclusion 2 — T, where Z is the trivial connected groupoid with 
two objects. 


Lemma 1.3.15 Let S be a category with equalizers. Then any locally small 
fibration 0: C — S has definable equality. If S also has pullbacks, then any such 
C also has definable invertibility. 


Proof (i) Given (f,g): A= B in C! for some I, let J — I x I be the object 
of S/I x I indexing morphisms from A to B. Then f and g correspond to 
morphisms J = J in S, and we have simply to take the equalizer of this pair to 
obtain a subobject I’ — I such that an arbitrary z: K — I factors through I’ 
iff z* f = x*g. But this is precisely what we seek. 

(ii) Now consider a single morphism f: A — B in C’. Let z: J — I be the 
pullback along the diagonal J — I x I of the object indexing vertical morphisms 
B— Ain C, and let g: x*(B) — x*(A) be the generic such morphism in C7. If 
we form the intersection K of the two subobjects of J measuring the equality of 
the pairs (x*(f) og, 1,+(p)) and (go z*(f), 12+(ay) in C7, then factorizations of 
an arbitrary y: L — I through the composite K — J — I correspond bijectively 
to inverses for y*(f) — but since y*(f) has at most one inverse for any given y, 
it follows that this composite must be a monomorphism, and it is the subobject 
of I which we seek. 0 


Generalizing the proof of 1.3.15, it can be shown that if S is cartesian and 
II: C > S is a locally small fibration, then IJ satisfies the comprehension scheme 
for any G: D! > D which is bijective on objects. 

We shall also be interested in definability of subfibrations of a given fibration. 
Of course, if C is equipped with a fibration H: C —> S, we say that a subcategory 
C’ of C is a subfibration if, whenever a prone morphism of C has its codomain in 
C’, then the whole morphism belongs to C’; this clearly implies (and, for replete 
subcategories, is equivalent to saying) that the restriction of II to C’ is still a 
fibration and that the inclusion C’ — C is a morphism of fibrations. We say that 
a subfibration C’ is definable in C if the inclusion Rect(2,C’) — Rect(2,C) has a 
right adjoint. (Note that this inclusion is full even if C’ — C is not; so this is a 
generalized comprehension scheme as we defined it earlier.) In elementary terms, 
it means that for every f: A— B in C’, we can find a subobject I’ > I such 
that an arbitrary x: J — I factors through I’ iff z*f lies in C”. 

For example, if an ordinary category C is indexed over Set in the naive way, 
as in 1.2.2(a), then the definable subfibrations of the corresponding fibration are 
just (the fibrations corresponding to the naive indexings of) the subcategories 
of C: if C’ is such a subcategory, and (f; | i € T) is a morphism of C7, then the 
subset I’ C I which measures the extent to which this morphism is in C’ is just 
{i € I | fı € mor C’}. But this fibration may have other full subfibrations which 
are not definable, for example that described (for C = Set) in 1.2.2(b). 
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Slightly less trivially, we note that if S is a Heyting category, then (the 
fibration corresponding to) the S-indexed category Sub of 1.2.2(g) is definable 
as a full subfibration of that corresponding to the canonical indexing S of 1.2.2(c). 
To see this, let x: K — J be an arbitrary morphism of S. We have to show that 
there is a subobject y: I — J such that, for any z: L > J, z*(z) is monic iff 
z factors through y. To this end, form the kernel-pair (a,b): R= K of x, and 
the diagonal morphism d: K — R; now take y to be V,,_(d). Then we see that z 
factors through y iff z*(d) is an isomorphism; but since z* preserves kernel-pairs 
and equalizers this is equivalent to saying that the two morphisms forming the 
kernel-pair of z*(x) are equal, i.e. that z*(x) is monic. 


Lemma 1.3.16 

(i) If C’ is a definable subfibration of C, then the inclusion Rect(1,C') > 
Rect(1,C) has a right adjoint. 

(ii) Suppose S has pullbacks. If C' is a full subfibration of C such that 
Rect(1,C’) — Rect(1,C) has a right adjoint, then C' is definable. 

(iii) Suppose S has pullbacks and TI: C — S is locally small. Then a subfibration 
C' of C is definable iff (a) C' is locally small and (b) the full subfibration 
C” of C with the same objects as C' is definable. 


Proof (i) We may obtain this right adjoint by restricting the right adjoint of 
Rect(2,C’) — Rect(2,C) to the full subcategory of Rect(2,C) whose objects are 
identity morphisms in C. 

(ii) Conversely, if C’ is full then a (vertical) morphism of C belongs to it iff 
its domain and codomain both do so. So, given a morphism f: A — B in C’, we 
form the intersection of the subobjects J’ — I and I” > I which ‘measure the 
extent to which A and B are in C”, and this yields the subobject which measures 
the extent to which f is in C’. 

(iii) If C’ is definable, then the definability of C” follows from (i) and (ii), since 
Rect(1,C’’) coincides with Rect(1,C’). Also, if A and B are any two objects of 
C’, let (x,y): I — IIA x IIB be the object indexing vertical morphisms from A 
to B in C, and let u: I’ — I measure the extent to which the generic vertical 
morphism f:2* A — y*B is in C’. Then it is clear that (ru, yu): I’ > ITA x HB 
indexes morphisms from A to B in C’, so C’ is locally small. 

Conversely, suppose C” is definable and C’ is locally small. Given a morphism 
f: A—B in C!, we can find u: I’ — I measuring the extent to which A and 
B both belong to C’, as in (ii). The morphism u*f may not lie in C’, but it 
corresponds to a commutative diagram 


i ruf) 
Toe 
(x,y) 


xI 
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where (z,y) is the object indexing morphisms from u*A to u*B in C. Now let 
(x, y’): J! — T x I’ be the object indexing morphisms from u* A to u*B in C’; 
then the generic morphism z’*u* A — y'*u*B defines a morphism J’ — J in 
S, which is easily seen to be monic. Pulling back this monomorphism along 
ru* f, we obtain a subobject I” of I’; and then the composite monomorphism 
I" — I’ > I measures the extent to which f lies in C’. So C’ is definable in C. 


We note, in particular, that a locally small fibration may have subfibrations 
which are not locally small. 

A question of considerable interest to us is when comprehensibility properties 
are preserved under change of base. Despite appearances to the contrary, the fol- 
lowing proof does not require the base categories S and T to have finite products, 
but only pullbacks. However, we shall temporarily use the notation S/I x---xIn 
(where [\,..., In are objects of S for the category whose objects are objects J 
of S equipped with an n-tuple of morphisms (zp: J > Ip | 1 < p < n); of 
course, if S does have products, this category is isomorphic to the slice category 
usually denoted thus. Note also that if we are given an n-tuple of morphisms 
(yp: Ip > Kp |1<p < n), then the functor S/D x---x I, > 8/K, Xx- x Kn 
obtained by composing with the y, still has a right adjoint, obtained by pulling 
back consecutively along each of the y, (in any order). 


Proposition 1.3.17 Let S and T be categories with pullbacks, F: S > T 
a functor having a right adjoint R, and 1:C—>T a fibration. Then 
F*tl: F*C — S satisfies any (generalized) comprehension scheme satisfied by II. 


Proof For simplicity, we shall deal only with the original case of a right adjoint 
for the functor Rect(D,C) — Rect(D’,C) induced by a functor G: D' — D. The 
extension to appropriate full subcategories of Rect(D,C) is an easy exercise. 

Let noD denote the set of connected components of D: then it is easily 
seen that applying II to objects and morphisms of Rect(D, C) yields a functor 
Rect(D,C) > T°. Hence we may construct a diagram 


Rect(D, F*C) ———_—_——_> Reet(D’, F*C) 


Rect(D, C) Rect. 0) 


SP > gmP’ 


ToP ` qro 
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in which all faces commute, and the left and right vertical ones are 
pullbacks. 

From now on, we shall simplify further by assuming that D is connected, i.e. 
that roD is a singleton. (If not, we may deal separately with each component of 
D; note that G maps each component of D’ into a single component of D.) How- 
ever, we allow D’ to have n components (n > 0). Thus an object A: D! > F*C 
of Rect(D’, F*C) lies over an n-tuple of objects (h,...,In) of S; and its 
image FA in Rect(D’,C) lies over (F),...,FI,). Applying the right adjoint of 
Rect(D,C) — Rect(D’,C) to FA, we obtain an object (yp: J > FI,|1<p<n) 
of T/FI, x --- x FI, and a G-extension B of (y1,..., Yn)" (FA) (that is, a dia- 
gram of shape D in C’ whose restriction along G is the diagram obtained by 
applying y% to the pth component of F'A, for each p) which is universal in the 
appropriate sense. 

Now, by a straightforward extension of 1.2.3, the functor S/D x --- x In > 
T/Fh x --- x FI, induced by F has a right adjoint, obtained by applying 
R and then pulling back along (n7,: Ip + RFI, | 1 < p < n) where 7 is the 
unit of (F 4 R). Applying this to (yp | 1 < p < n), we obtain an object 
(tp: K >I, |1<p<n) of S/I, x---x I, together with a (counit) morphism 
z:FK > JinT/FI,x---x FIn. So if we apply z* to the diagram B, we obtain 
a G-extension of (y1z,.-.,Ynz)*(F'A) in C¥*, or equivalently of (£1, -- , £n)” (A) 
in (F*C)*. And since, for any object (wp: L > I, |1<p<n) of S/hx---xIn, 
morphisms L — K in S/D x --- x I, correspond to morphisms FL — J in 
T/FI, x---x Fln, it is easy to see that this extension has the required universal 
property for the value at A of a right adjoint to Rect(D, F*C) — Rect(D’, F*C). 

g 


Corollary 1.3.18 Let S be (properly) cartesian closed, let I be an object of S 
and let C be an S-indexed category. Then (the fibration corresponding to) the 
S-indexed category C! of 1.2.2(f) satisfies any comprehension scheme satisfied 
by C. O 


In 2.2.6 below we shall obtain a partial converse to 1.3.18: if T is locally 
cartesian closed, then change of base along F: S — T preserves local smallness 
iff F has a right adjoint. 


Suggestions for further reading: Bénabou [101, 102, 105}. 


B1.4 Limits and colimits 


An important part of the reason for introducing the machinery of S-indexed 
categories and fibrations is the desire to be able to talk about ‘infinite’ limits 
and colimits in elementary terms. We saw in Section A1.2 that finite limits (and 
colimits) are an elementary concept, and we can therefore extend these notions 
to indexed categories in a straightforward way: thus we say that an S-indexed 
category C has S-indexed finite limits (or is S-cartesian) if each of the fibres 
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C! has finite limits, and each transition functor z*: C7 — C! preserves them. 
(Similarly for finite limits of some particular type, such as equalizers. Note that 
the assertion that C has S-indexed limits of some particular (finite) shape D is 
just the assertion that the indexed diagonal functor C — [D, C] has an indexed 
right adjoint, where [D, C] is the indexed functor category defined after 1.2.5.) 

The possession of S-indexed finite limits may also be neatly characterized in 
terms of fibrations: 


Lemma 1.4.1 Let S be a cartesian category, and II: C — S a fibration. Then 
the corresponding S-indexed category C is S-cartesian iff C has finite limits and 
Il preserves them. 


Proof First suppose C has and II preserves finite limits. Then it is clear that 
each fibre C? of IT has finite connected limits (in particular, pullbacks) which are 
preserved by the inclusion C7 — C, since the limit in S of a constant connected 
diagram on the object I is I itself. Moreover, it follows readily from 1.3.3(ii) that 
the transition functors z*: C! — C7 preserve pullbacks. So we need only verify 
that the fibres have terminal objects, which are preserved by the transition 
functors; but this is simply a matter of definition — if we define 1; to be the 
domain of the prone morphism to the terminal object of C lying over J — 1 in S, 
then it is immediate from the definition of prone morphisms that 1; is terminal 
in C7. 

Conversely, suppose C is S-cartesian. Given a finite diagram D: 7 — C, we 
first form a limit cone (with summit J, say) over the diagram ITD in S, and then 
use the transition functors to pull back the vertices of the original diagram to 
the fibre C’. The definition of prone morphisms ensures that the edges of the 
original diagram pull back to vertical morphisms forming a diagram of shape J 
in C7; and if we form a limit cone over this diagram, it is again easy to see that 
its summit yields a limit for the original diagram in C. Thus C has limits of shape 
J, and II preserves them. B 


We shall use the term cartesian fibration for a fibration satisfying the 
equivalent conditions of 1.4.1. 

Since, in the classical case, all limits may be constructed from finite limits 
(in fact, from equalizers) and products, we therefore need only worry about the 
S-indexed analogues of infinite products and coproducts. (We shall prove the 
S-indexed analogue of the theorem just quoted in 2.3.20 below.) 

To say that an ordinary category C has J-indexed products, where J is a set, 
is to say that the diagonal functor C — C! has a right adjoint. But to say that 
C has all small products implies, more generally, that all the transition functors 
z*: C4 — C! in the naive indexing of C over Set have right adjoints, and that 
these right adjoints themselves form indexed functors, i.e. they commute with re- 
indexing in a suitable sense. Because we do not wish to accord any special status 
to the terminal object of our base category S, we take the latter characterization 
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as the basis of our definition. Until further notice, we shall assume that our base 
category S has pullbacks. 


Definition 1.4.2 (a) Let C be an S-indexed category. We say that C has 
S-indezed products if we are given, for each 2: [— J in S, a right adjoint 
Il, : C? — CY for x*, and for each pullback square 


j ee 
w 


K ——> L 


in S, the canonical natural transformation w in the diagram 


ci ———_>¢! 


+ 


ch aaus ck 


(defined by taking y4: w* I, A — II,x* A to be the transpose across (y* 4 II,) of 
the composite y*w*Il, A = «2*2z*I, A — «x* A induced by the counit of (z* 4 II,)) 
is an isomorphism. 

(b) We say that C is S-complete if it has S-indexed finite limits and S-indexed 
products. 


The condition on pullback squares is commonly known as the Beck—Chevalley 
condition; cf. A2.2.4. It is often rather sloppily stated as the existence of a natural 
isomorphism w*II, = Iyz*, rather than the requirement that the particular 
natural transformation Y defined above should be invertible; but in practice, 
once one has verified the former, it is usually a fair bet that the latter will 
be true. 


Remark 1.4.3 The two conditions ‘C has S-indexed products’ and ‘C has 
finite limits’ of 1.4.2(b) are not entirely independent. We shall see in the next 
section that if the base category S is positive coherent, and C is a stack for the 
coherent coverage on S, then for every I € ob S we have C4 ~ C! x C!, by an 
equivalence identifying V* (where V: I H I — I is the codiagonal map) with the 
diagonal functor C? — C! x C!. So the assertion that the V* have right adjoints 
is equivalent to saying that the categories C? all have binary products; and the 
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Beck—Chevalley condition for pullback squares of the form 


Fp ag 


zuz T 


IUI —— I 


is equivalent to saying that these binary products are preserved by the transition 
functors z*. Similar arguments involving the initial object of S ensure that, if C 
has S-indexed products, then its fibres have terminal objects which are preserved 
by the transition functors. Thus, under the above hypotheses on S and C, the 
condition ‘C has finite limits’ of 1.4.2(b) can be reduced to ‘C has equalizers’ (or 
even to ‘C has coreflexive equalizers’; cf. A1.2.10). 


By the uniqueness of adjoints, it is clear that if the right adjoints II, exist 
for all z they are (covariantly) pseudo-functorial in z. At first sight, this gives 
another way of defining a natural transformation in the square appearing in 
1.4.2(a), as the transpose across (w* ~ ILo) of the composite I, A —> I,0,2* A = 
IL, I1,2*A induced by the unit of (x* ~ IL), but in fact this transformation is 
equal to w — this follows from the way in which the coherence isomorphism 
MUI, = I, IL, is constructed from y*w* = x*z*. 

Of course, we say that C has S-indezxed coproducts if each x* has a left adjoint 
vz: C7 — C7, and the Beck-Chevalley condition holds for pullbacks in S. 


Examples 1.4.4 (a) To show that the Beck—Chevalley condition is not always 
a triviality, consider the Set-indexed category T of 1.2.2(h). We have already 
observed that its transition functors have left adjoints; but in the pullback square 


T 
E 


0 i 
jac SS) 


(where n denotes an n-element set, and z and w are constant maps with disjoint 
images), the composite w*z, sends the unique topology on 1 to the indiscrete 
topology on 2, whereas y,x* sends it to the discrete topology. 

(b) A rather more substantial example of the same phenomenon is given by 
the Cat-indexed category which to a small category C associates the functor 
category [C, Set], and to a functor f: C — D the functor f*: [D, Set] — [C, Set] 
obtained by composition with f. As we observed in A4.1.4, these functors have 
adjoints on both sides; but they fail to satisfy the Beck-Chevalley condition in 
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general. For a simple counterexample, consider any pullback square of the form 
1 
C ———>C 
1 | 
EEEL y 
where C is a non-full subcategory of D: the composite (for either left or right 
adjoints) around the top and left sides of the square is clearly the identity func- 


tor, but the composite i* o lim; is the identity only if i is full and faithful 
(cf. A4.2.12(b)). Another counterexample is provided by the pullback 


0 ———_>1 


0 
1 ——— 2 
where n denotes an n-element totally ordered set and 0,1 are the bottom and 
top elements of 2; it is easily checked that 1* o limg is the identity functor 
Set — Set, but going the other way round the Beck—Chevalley square produces 
the constant functor with value 0. 


The next result generalizes a phenomenon which we observed in 1.3.7(a) 
and (d). 


Lemma 1.4.5 Let Il: C — S bea fibration, and C the corresponding S-indexed 
category. Then 


(i) the transition functors of C have left adjoints iff II is also an opfibration; 


(ii) C has S-indexed coproducts iff II is an opfibration and the pullbacks of 
supine morphisms of C along prone morphisms (exist and) are supine. 


Proof (i) Suppose the z* have left adjoints ©,. Then, for each z: I + Jin S 
and each A € ob C!, we have a morphism 7{A, xr): A > ©,A in C obtained by 
composing the unit map A — z*,,A in C7 with the prone morphism o(¥, A, £). 
Now, if we are given f: A — B in C such that IIf is the composite of x with 
y: J — K, then we obtain a morphism A — (yz)*B & x*y*B in C? and hence 
a morphism ©,A — y*B in C7; composing this with o(B,y) yields the unique 
factorization of f through 7(A, x) lying over y. Thus 7(A, =z) is supine, and so H 
is an opfibration. Conversely, if If is an opfibration, then by defining /,A to be 
the codomain of a supine lifting of z with domain A, we obtain an adjunction 
(£s 4 2*), since morphisms XA — B in C7 and morphisms A — z*B in C! 
both correspond bijectively to morphisms A — B in C lying over z. 
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(ii) Given (i), we have to show that the Beck—Chevalley condition for the £, 
is equivalent to the stability of supine morphisms under pullback along prone 
morphisms. Given a pullback square 


-™ 


in S and an object A of C7, we may form a commutative square 


ek es 


oS 


222s 


o(A,x 
xz*A e A 
h (A, z) 
a(&, A, w) 


wB A ————> 1A 


in C, where h is the unique factorization of the composite 7(A, z)o(A, x) through 
o(X,A,w) lying over y. By an argument like that of 1.3.3(ii), it is readily seen 
that this square is a pullback in C. So the assertion that its left edge h is supine 
is equivalent to saying that we have a canonical isomorphism Xy yzr* A = w*h,A - 
and this isomorphism is necessarily the natural map appearing in the Beck- 
Chevalley condition. o 


The following lemma is a useful labour-saving device: 


Lemma 1.4.6 Let C be an S-indexed category, and suppose that, for each 
z: I — J in S, 2*:C% — C? has both left and right adjoints Ez and z. Then C 
has S-indexed products iff it has S-indexed coproducts. 


Proof Suppose that C has S-indexed products, i.e. that the Beck—Chevalley 
condition holds for the right adjoints. Then, if 


L 


as 


<a 
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is a pullback square in S, the functors in the diagram 
J x” I 
cy —— cC 
Hz £y 


ch ck 


all have right adjoints, and the latter commute up to isomorphism; so the square 
above commutes up to isomorphism. Moreover, the canonical transformation 
Lyx* — w*d, can be shown to be the ‘mate’ under the adjunctions of the 
canonical transformation z*IL, — Il,y*, so the first is an isomorphism iff the 
second is. o 


The alert reader will recognize that we have seen a particular case of 1.4.6 
before, in A1.4.11: there we observed that the indexed category Subs of 1.2.2(g) 
has S-indexed coproducts iff S is a regular category (cf. A1.3.1(iii)), and 
deduced that it has S-indexed products provided its transition functors have 
right adjoints, i.e. provided S is a Heyting category. Note also that Subs is 
S-cocomplete iff S is a coherent category: since its fibres are preorders, the only 
finite colimits we have to worry about are coproducts (i.e. unions). 

Similarly, we have 


Lemma 1.4.7 Let S be a cartesian category, and let S be the canonical indexing 
of S over itself, as defined in 1.2.2(c). Then 


(i) S always has S-indexed coproducts; 
(ii) S is S-cocomplete iff S has finite colimits which are stable under pullback; 
(iii) S has S-indexed products iff it is S-complete, iff S is locally cartesian 
closed. 


Proof (i) We know that the pullback functor z* : S/J — S/I has a left adjoint 
=z, by A1.2.8; and the Beck—Chevalley condition for the square 


S(J == SSI 
> E, 


$/i——> $/K 


is simply the assertion that a composite of two pullback squares is a pullback. 
(ii) is immediate from (i) and the fact that the forgetful functors S/I — S 
preserve and reflect colimits. 
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(iii) A1.5.3 tells us that S is locally cartesian closed iff the x* all have right 
adjoints Iz, and the Beck—Chevalley condition for the I, follows from that for 
the £z, by 1.4.6. Finally, since S is cartesian, the slice categories S/I all have 
finite limits, and the x* preserve them since they have left adjoints. oO 


In particular, we note that S is both S-complete and S-cocomplete iff S is 
locally cartesian closed and cocartesian. 
The next result is a useful extension of 1.4.7. 


Lemma 1.4.8 Let F:S —C be a functor between cartesian categories, and 
let C denote the S-indexed category I œ C/F (I) of 1.2.2(d). The following are 
equivalent: 


(i) F preserves pullbacks; 
(ii) C has S-indexed coproducts; 
(iii) (if C is locally cartesian closed) C has S-indezed products. 
Proof As before, the pullback functors F(x)*:C/F(J) + C/F(J) automat- 
ically have left adjoints, and they have right adjoints if C is locally cartesian 
closed; the question is thus whether the Beck-Chevalley condition holds. If F 


preserves pullbacks, then the condition holds for the same reason as in 1.4.7. 
Conversely, if the condition holds for a pullback square 


= 


in S, then on chasing the terminal object 1p(z) of C/F(J) around the square 


nei 


Hec S 


JS 


C/F(J) => C/F(D) 
5, Sy 
C/F(L) > C/F(K) 


we see that F must preserve this pullback. o 
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We recall from A1.2.9 that any pullback-preserving functor F: S — C can be 
factored as 


a 


F 
S ——— > C/Fi ——> C 
where F is cartesian; and the S-indexed category obtained from F via 1.2.2(d) 
coincides with that obtained from F’. So there is no real loss of generality, in this 
context, in assuming that F is cartesian. 
Lemma 1.4.8 can be further generalized: 


Lemma 1.4.9 Let F:T — S be a functor between cartesian categories. Then 
the change of base functor F*: CATs — CALzr of 1.2.2(e) preserves the 
property of having indexed coproducts (and products) iff F preserves pullbacks. 


Proof If C is a T-indexed category with coproducts, then for any 
z:[—+J in S the transition functor 2*:(F*C)’ — (F*C)! coincides with 
F(xz)*:C¥J — CF!, and so has a left adjoint; and if F preserves pullbacks then 
the Beck—Chevalley squares for F*C are particular cases of those for C, and so 
they commute. The argument for products is similar. 

Conversely, suppose F’* preserves the property of having indexed coproducts. 
Then in particular F*S has T-indexed coproducts by 1.4.7, where S is the canon- 
ical indexing of S over itself; but this is the indexing of S over T obtained from 
F as in 1.2.2(d), so by 1.4.8 F must preserve pullbacks. O 


Of course, any change-of-base functor preserves the property of having 
(indexed) finite limits or colimits; so we deduce that if F preserves pullbacks then 
F* preserves indexed completeness and cocompleteness. In particular, since func- 
tors of the form (—) x I: S > S preserve pullbacks, we note that an S-indexed 
category of the form C7 (as defined in 1.2.2(f)) is S-complete or cocomplete 
if C is. 

It is possible to characterize the S-indexed categories arising from carte- 
sian functors F: S — C in fibrational terms: the following argument is due to 
J. L. Moens [820]. First, we need a couple of new definitions. We say that an 
S-indexed category C has stable S-indexed coproducts if the supine morphisms in 
the corresponding fibration II: C — S are stable under pullback along arbitrary 
morphisms, not just along prone ones (cf. 1.4.5). 


Lemma 1.4.10 LetII:C — S be a cartesian fibration, and C the corresponding 
S-indexed category. Then C has stable S-indexed coproducts iff the following 
condition is satisfied: suppose given x: I — J in S, an object A of C! and a 
morphism B — X., A in C7; then if we form the pullback 


C ——> 2*B 


A——— > 2*d,A 
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in C! (where the bottom edge of the square is the unit of (Dz 4 x*)), the transpose 
£C — B of the top edge is an isomorphism. 


Proof By 1.4.5{ii), if C has S-indexed coproducts, they are stable iff the 
supine morphisms in the total category of the corresponding fibration II: C + S 
are stable under pullback along vertical morphisms. But in this category we may 
form the diagram 


o(B,2) 
C ———> «* B ——_—_——__> B 


o({X,A, T) 
A —— t*r A ———> ©, 


where the right-hand square is a pullback by 1.3.3(ii), and the bottom composite 
is the supine morphism 7(A, x). So the outer rectangle is a pullback iff the left- 
hand square is; and the top composite is supine iff the transpose of C — «*B is 
an isomorphism. oO 


Given this result, it is easy to see that the naive indexing of an ordinary 
cartesian category over Set has stable Set-indexed coproducts iff the ordinary 
category has coproducts which are stable under pullback, in the usual sense. 

We may also define disjointness of coproducts for indexed categories: given 
a cartesian fibration II: C — S, we say the corresponding S-indexed category C 
has disjoint S-indexed coproducts if (it has S-indexed coproducts and) for any 
supine morphism f: A — B in C, the diagonal morphism A — A xp A is also 
supine. For the naive indexing of an ordinary cartesian category over Set, it is 
again not hard to see that this is equivalent to disjointness of coproducts as we 
defined it in A1.4.4. 


Lemma 1.4.11 Let Il: C — S be a cartesian fibration, and suppose the corre- 
sponding S-indezed category has disjoint and stable S-indexed coproducts. Then, 
for any z: I — J in S and any f: A — B inC’, the canonical square 


Aam 34) 


ag” 
B 


———> ©,(B) 


is a pullback in C. 
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Proof [It is easy to see that we have a pullback square 


fA 
hss 


f 1x f 


B——> B xy,B B 


and hence that the top edge of this square is supine. But we also have a supine 
morphism B xs,B A— B xy, rA lying over the first projection J xy I > I 
(since this is a pullback of A — ©,A), and so the composite is a supine morphism 
A — Bxs,B%,A lying over the identity morphism on J. But any such morphism 
must be an isomorphism. O 


Theorem 1.4.12 Let Il:C +S be a cartesian fibration, and let C be the 
corresponding S-indezed category. Then C has disjoint and stable S-indered 
coproducts iff there exists a cartesian functor F: S — T such that C is equivalent 
to the S-indexed category induced by F as in 1.2.2(d). 


Proof Suppose there is such an F. Then we may identify C with the glueing 
category GI(F), as we observed in 1.3.7(b); and it is not hard to see that a 
morphism 


| ee 


gens 


in this category is supine iff f is an isomorphism. Given this, it is easy to verify 
the conditions for disjointness and stability. 

Conversely, suppose C has disjoint and stable coproducts. We take T to be the 
fibre C', and we define F: S — T on objects by FI = Z,(17), where 1; denotes 
the terminal object of C7. For a morphism x: I — J of S, Fx corresponds to the 
composite 

z* (n) 
1; = x* (17) —> ed Dy) = PY y(13), 
where 77 is the unit of ($y 4 J*); equivalently, it is the unique vertical morphism 


FI — FJ making 
l; ———-_> FI 


o(lj, £) Fx 


ly ————> FJ 


B1.4 LIMITS AND COLIMITS 289 


commute. It is easy to verify that this is functorial, We have a functor 
©!:¢C! —T/FI which is simply ©; applied to morphisms with codomain 1,, 
and we have a functor W!: T/FI — C’ which sends (f: A> FI = ¥,(1;)) to 
the pullback 


Yf) ——> A 


l; ———> FI 


in C. To verify that ® is an S-indexed functor, consider the cube 
x* A —— (F'x)*¥(A) 


o(A, z) 


1p eS PB 6! (A) 


o(lr, x) Fr 
1, ———— FI 


induced by a morphism xz: J — I in S. Here the front face is a pullback by 
1.4.11, the left face is a pullback by 1.3.3(ii), and the right face is a pullback by 
definition. So the back face is a pullback, and hence its top edge is supine; thus 
we have 6/x*(A) & 2*!(A). The fact that Y is an S-indexed functor is an easy 
application of 1.3.3(ii). 

Since the top edge of the pullback square defining / (f) is supine, it is clear 
that the composite ®/W/ is naturally isomorphic to the identity. The fact that 
W/®! is isomorphic to the identity follows from 1.4.11. 

Finally, since we now know that the S-indexed category I++ 7/FI is equiv- 
alent to C, it has S-indexed coproducts; so F preserves pullbacks by 1.4.8. But 
it is clear from the construction that it also preserves the terminal object; so it 
is a cartesian functor. 0 


Next, we introduce the notion of continuity (= preservation of limits) for 
S-indexed functors. 
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Definition 1.4.13 If F:C-—D is an indexed functor between complete 
S-indexed categories, we say F is S-continuous (or simply continuous) if each F7 
preserves finite limits, and for each x: J — J in S the canonical natural transfor- 
mation FYTI, — II, F, which is the ‘mate’ of the isomorphism 2* FY S F/x*, 
is an isomorphism. 


Lemma 1.4.14 LetC andD be complete S-indexed categories, and F: C — D 
an S-indexed functor having an indexed left adjoint (i.e. such that each F! has 
a left adjoint L', in such a way that the L! form an S-indexed functor and the 
unit and counit of the adjunction are S-indexed natural transformations). Then 
F is S-continuous. 


Proof Since each F! has a left adjoint, it preserves the finite limits which 
exist in C7. For the second condition, we note that the requirement that the unit 
and counit of the adjunction be S-indexed forces the isomorphism up to which 
the diagram 


pi —_sp! 


LI D 


oana 
commutes to be the mate of that up to which the corresponding diagram for F 
commutes. So, on taking the right adjoints of all the functors in this diagram, 
we obtain a diagram 


Il, 
Cs. 3 CY 
F! FI 


p= Esp 
commuting up to an isomorphism, which again must be the mate (across a 
different pair of adjunctions!) of the given isomorphism z*F7 & F'g*. Oo 


Under suitable ‘smallness’ conditions on the categories C and D, 
Lemma 1.4.14 has a converse (the Indexed Adjoint Functor Theorem), which 
we shall prove in 2.4.6 below. 

The notion of indexed monad on an indexed category is self-explanatory; 
it is also easy to see that an indexed monad T = (T,7,) on an S-indexed 
category C gives rise to an indexed category of algebras CT, whose fibre over I is 
the category of algebras for the monad (T’,7’, pu!) on C7. The following result, 
though unsurprising, will be of importance later on. 
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Lemma 1.4.15 Let T be an indexed monad on an S-indexed category C. 


(i) or . S-indexed products, so has CT, and the indexed forgetful functor 
—» C preserves them. 


(ii) IfC has S-indexed coproducts and T preserves them, then CË has S-indexed 
coproducts and the forgetful functor preserves them. 


Proof (i) Let x: I — J be a morphism of S, and (A,a) a T/-algebra in C’. 
Consider the composite 


TIT (A) oA, MT! (A) Tate). I,(A), 


where @ is the mate of the natural isomorphism 2*TY & T!x*. It is straight- 
forward to verify that this composite is a T/-algebra structure on IL,(A); and 
that, given any T/-algebra (B, 3), a morphism B — II,(A) in C7 is a T’-algebra 
homomorphism iff its transpose x*(B) — A is a T/-algebra homomorphism. Thus 
we have constructed a right adjoint for x*: (C7 ae — (C! yr. The fact that the 
forgetful functors from T-algebras to objects commute with these right adjoints 
is immediate from the definitions. Finally, the Beck-Chevalley condition for CT 
can be ‘lifted’ from that for C, since the forgetful functors reflect isomorphisms. 

(ii) Given a morphism z: J > J and a T!-algebra (A, a) in C7, we consider 
the morphism 


Ela) 
T’S,(A) S ET (A) ——> B(A). 


It is again straightforward to verify that this is a T7-algebra structure on £,(A), 
and that this construction yields a functor (C/)™ — (C/)T” left adjoint to z*. 
Once again, the Beck—Chevalley condition for CT may be lifted from that for C. 

o 


Given an arbitrary S-indexed category C, we may freely adjoin S-indexed 
coproducts to it, by the following construction. Let Fam(C), the category of 
(S-indexed) families of objects of C, be the S-indexed category whose J-indexed 
families of objects are pairs (x, A) where x: J — I is a morphism with codomain 
I in S and A is an object of C7, and whose morphisms (x, A) — (2’, A’) 
are pairs (y,f) where y:2—2’ in C/I and f: A—y*A' in CY. (Note in 
particular that Fam(C)! is simply the total category of the fibration corre- 
sponding to C.) If z: K — I is a morphism of C, then the re-indexing functor 
z*: Fam(C)! — Fam(C)* sends (z, A) to (11,73A) where the 7; are the projec- 
tions from the pullback K x, J. The left adjoint ©, of z* is given by composition: 
E(w, B) = (zw, B). It is straightforward to verify that the above formulae do 
define an S-indexed category with S-indexed coproducts (the verification of the 
Beck-Chevalley condition is like that in the proof of 1.4.7(i) — indeed, 1.4.7(3) is 
a special case of it, since the canonical indexing S is none other than Fam(1), 
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where 1 is the indexed category with one object and one morphism in each fibre); 
and we have 


Proposition 1.4.16 Let C be an S-indexed category. 


(i) There ts an S-indexed full embedding C — Fam(C). 


(ii) Jf D ts an S-tindexed category with S-indexed coproducts, then any 
S-indexed functor F:C — D extends, uniquely up to canonical isomor- 
phism, to a coproduct-preserving functor F : Fam(C) — D. 


(iii) The assignment C ++ Fam(C) has the structure of a KZ-monad (cf. 1.1.11) 
on Cats; and the algebras for this monad are exactly the S-indexed 
categories with S-indexed coproducts. 


Proof (i) The embedding is defined on each fibre C7 by A +> (17, A); it is 
straightforward to verify that this works. 

(ii) Again, it is straightforward to verify that F may be defined (and, up to 
isomorphism, must be defined) by F'! (x, A) = Ez (F7 (A)). 

(iii) In particular, if D has S-indexed coproducts, then the identity functor on 
D must extend to a functor Fam(D) — D, sending (z, A) to D(A). It is easy to 
verify that this functor is an indexed left adjoint for the embedding of (i): the unit 
of the adjunction at an object (x, A) is given by (2,74): (x, A) — (17, Uz (A)) 
where n is the unit of (X, 4 2*). Applying this to the case D = Fam(C), we obtain 
the multiplication of the monad, together with the fact that it is a KZ-monad. 
Then by 1.1.13 we know that the Fam-algebras are the indexed categories for 
which the embedding of (i) has a left adjoint; but these are exactly the categories 
with S-indexed coproducts. oO 


Remark 1.4.17 If our base category S is a topos with a natural number 
object, and we merely wish to adjoin finite coproducts to an S-indexed cate- 
gory C, then we may do so by forming the full subcategory Fam; (C) of Fam(C) 
whose objects are pairs (x, A) such that z: J — I is a finite cardinal in S/I 
(cf. A2.5.14). (More generally, given any particular morphism x: X — I in S, 
we could form the indexed category obtained by ‘freely adjoining coproducts 
indexed by fibres of x’; the particular case under discussion is that in which x 
is the generic finite cardinal in S/N.) By a result to be proved in D5.2.12, if 
we are given a composable pair of morphisms z: J — I, y: K — J in S such 
that z is a finite cardinal in S/I and y is a finite cardinal in S/J, then the 
composite xy is (isomorphic to) a finite cardinal in S/I; hence Famy(C) does 
indeed have coproducts indexed by finite cardinals in S (that is, if x: J > I is 
a finite cardinal in S/I, then z*: Fams (C)! — Fam,y(C)/ has a left adjoint, and 
the Beck—Chevalley condition holds for all pullback squares for which it makes 
sense). Moreover, the proof of 1.4.16 can easily be adapted to show that Fam; (C) 
is the free category with finite-cardinal-indexed coproducts generated by C. 


Of course, we could also regard the problem of adjoining finite coproducts to 
an indexed category as one to be tackled ‘fibrewise’, using the ‘external’ analogue 
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of the Fam;(—) construction on each fibre. By the argument of 1.4.3 above, this 
will produce the same result (up to equivalence) if the category C is a stack for 
the coherent coverage on S. (See also D5.2.13.) 


Suggestions for further reading: Moens [820], Street [1137], Wood [1232]. 


B1.5 Descent conditions and stacks 


One problem with the notion of indexed category is that it is a little too ‘relaxed’, 
in the sense that the relationships between the fibres C? for different I are not 
strictly enough controlled. We have already observed the effect of this in 1.2.2(0) 
above: if we take our base category S to be Set, then since 1 is a generator 
for Set we might expect the entire structure of a Set-indexed category C to be 
controlled (up to equivalence) by the fibre C, but we have seen that a given 
ordinary category may admit indexings over Set which are genuinely different 
from the naive one. 

One way of excluding at least some of the ‘pathological’ possibilities for index- 
ings is to require them to satisfy descent conditions for some given coverage on 
S. (Here we are using the term ‘coverage’ in the sense in which it was defined in 
A2.1.9 (or C2.1.1); however, since we shall be concerned exclusively with base 
categories having finite limits (and in particular pullbacks), we shall generally 
assume that our coverages satisfy the strong form (C”) of the ‘pullback-stability’ 
condition rather than the original weak form (C).) The idea of a descent condi- 
tion is that, given a covering family (x: J, — I | y € T) in S, we should be able 
to ‘reconstruct’ the fibre C? up to equivalence from the categories C77, together 
with additional ‘descent data’. 

To explain the form which the descent data takes, consider an object A of 
C’, and let A, denote the object 2*(A) of C77, for each y € T. For each pair 
(y, 6) of (not necessarily distinct) indices in I, we may form the pullback 


Yyő 


Ly 6 Ty 


Ts 
Js ———— I 


in S; then since the above square commutes we have a canonical isomorphism 
fas: yzs(Ay) — z%(Ás) in C*+5_ Moreover, these isomorphisms are compatible, 
in the following senses: 


(a) If wy: J, + Ky is the diagonal map, then w%(f,,,) is the canonical iso- 
morphism wy},(A,) = A, = wi2%,(A,) arising from the equations 
YyyWy = lj, = Zyywy. 
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(b) If we form the ‘triple pullback’ 


AX 


Lyõe E ER 


NZ 


together with the three projections tye: Lye > Ky5, Uyse: Lyse > Kye 
and Uyse: Lyôe — Kse, then the diagram 


t se( fa ) 


th syy (Ay ) — > tse? 76(As) —> V36eY5e(As) 


U5 5¢( fee) 
Ur se( Fre) 


x 

Ur 5eYre(Ay ) > Ur 56 27 ¢(Ae) > Ux6e25e(A e) 
commutes, where the unnamed arrows are the canonical isomorphisms 
arising from commutative squares in S. 


Definition 1.5.1 Let C be an S-indexed category, and let R = (x,: J, —> I | 
y € T) be a family of morphisms of S with common codomain. The category 
Desc(C, R) of descent data for R in C has as objects pairs ((A,), (f5)) where 
A, is an object of C77 for each y and the fys: yz (Ay) > 235(As) are morphisms 
satisfying the compatibility conditions (a) and (b) above, and as morphisms 
((Ay)(f98)) > ((By), (975)) families (hy: A, > B, | y E T) such that 


* fas * 
Y35(Ay) ——> 235(As) 
Ya (hy) 235 (hs) 


* 975 * 
yza(By) ==> 256(Bs) 
commutes for each (y, ô). 


At first sight, Definition 1.5.1 may seem rather arbitrary: it is not clear why 
the conditions (a) and (b) should be singled out from all the things we might say 
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about a family of morphisms ( fys). However, it can be given a formal justification 
in terms of 2-categorical (pseudo-)limits: we shall discuss a particular case of this 
in 3.4.11 below. Note that we do not need to specify, in defining the objects of 
Desc(C, R), that the morphisms f,s5 should be isomorphisms, since this is forced 
by conditions (a) and (b): if wys: Ky — Ks is the ‘twist’ isomorphism in S, 
then (modulo the appropriate canonical isomorphisms) wžs(fsy) is necessarily 
an inverse for fys- 

We clearly have a functor 0g: C! — Desc(C, R) sending an object A to the 
family (x*(A) | y € T) with the structure maps described above, and a morphism 
h: A — B to the family (x*(h) | y € T). 


Definition 1.5.2 (a) We say an S-indexed category C satisfies the descent 
condition (resp. the pre-descent condition) for a family R of morphisms of S if the 
functor ĝpr just described is one half of an equivalence of categories Dese(C, R) ~ 
C! (resp. is full and faithful). 

(b) We say C is a stack (resp. a pre-stack) for a coverage T on S if it satisfies 
the descent condition (resp. the pre-descent condition) for every covering family 
in T. 


Our use of the terms ‘pre-descent’ and ‘descent’ is non-standard: the conven- 
tional usage is to call these notions ‘descent’ and ‘effective descent’ respectively. 
However, since the stronger notion is clearly the more important one, it seems 
appropriate to give it the unadorned name. 

A stack may be thought of as a ‘higher-dimensional version’ of the notion 
of sheaf for a coverage, introduced in A2.1.9. Indeed, if C is a discrete indexed 
category (one such that the fibres C! are discrete categories), and we ignore 
questions of size, then C satisfies the descent condition for a family R iff the 
functor (I ++ ob C7) satisfies the sheaf axiom for R. However, when C contains 
non-identity isomorphisms, this equivalence breaks down: even if the assignment 
(I + ob C?) is strictly functorial (as opposed to pseudofunctorial), the assertion 
that it is a sheaf does not imply that C is a stack. 


Example 1.5.3 Suppose S is regular, and let A and B be two objects of S 
which are locally but not globally isomorphic: that is, there exists a cover I -> 1 in 
S and an isomorphism f: I*A — I*B in S/I, but no isomorphism A —> B in S. 
(For example, in the topos [G, Set] where G is a group, we could take A to be G 
acting on itself by left translations, and B to be G with trivial G-action.) Let C be 
the S-indexed category defined by taking C! to be the monoid of endomorphisms 
of I* A; since ob C! is a singleton for all I, the functor I > ob C? is clearly a 
sheaf for the regular coverage (and indeed for any other coverage on S). If S is 
cartesian closed, then J ++ mor C! is also a sheaf, since it is representable by the 
object A^. Nevertheless, the composite 


(I x I)*(A) OL (I x I)*(B) mall”), (Ix I)*A 
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satisfies the coherence conditions of 1.5.1 for the family consisting of the sin- 
gle morphism (J —> 1), and so defines an object of Desc(C, (J —p1)). And 
this object is not isomorphic to the unique object in the image of the functor 
07-41): Ct > Desc(C, (I -»1)), for if it were, then we should have an automor- 
phism g of I* A such that 77(fg) = 73(fg), and the composite fg would ‘descend’ 
along J —> 1 to yield an isomorphism A = B. So @(;_,1) is not an equivalence. 


On the other hand, if C is a pre-stack, then for any pair of objects (A, B) of 
C1 the assignment 
I+ home: (I* A, I* B) 


is a Sheaf; indeed, the assertion that C is a pre-stack can be viewed as a ‘localized’ 
version (that is, one which applies to all the objects of C and not just those in 
the fibre over 1) of the assertion that its hom-sets are sheaves. In particular, 
if the coverage on S is subcanonical (that is, such that all the representable 
functors on S are sheaves — cf. A2.1.11(a); this condition will be satisfied by all 
the coverages we wish to consider on S), then any locally small indexed category, 
as defined in 1.3.12 above, is a pre-stack. 


Example 1.5.4 If our base category S is Set, we may take T to be the coverage 
whose members are all jointly-surjective families of functions. Then it is easy to 
verify that an indexed category C is a stack iff it satisfies the descent condition 
for the family of all maps 1 — J, for each set J. In turn, this happens iff the 
functor 
t —_> Je 
ie] 

whose ith component is i* is part of an equivalence of categories, i.e. iff C is 
equivalent (as an indexed category) to the naive indexing of C!. 


On a more general base category, there are two particular coverages which 
will be of interest to us: they are the regular coverage on a regular base cat- 
egory (cf. A2.1.11(a)), and the coherent coverage on a positive coherent base 
(cf. A2.1.11(b)). We next consider the interpretation of the notion of stack in 
these two particular cases. 

For the regular coverage, each cover consists of a single morphism z: J — I. 
For such a cover, the notion of descent data has a simpler appearance: an 
object of Dese(C, (x)) consists of a single object A of C7 equipped with a single 
morphism f: 1*(A) — 73(A) in C7*!7, subject to the ‘unit condition’ that 


* * A*(f) * oO 
A = Atnt(A) ——> A'nh(A) = A 


is the identity morphism on A, and the ‘cocycle condition’ that the re-indexings 
of f along the three projections J x; J x; J — J x, J fit together (modulo 
canonical isomorphisms) into a commutative triangle. (As before, these con- 
ditions are sufficient to ensure that f is actually an isomorphism, and this is 


B1.5 DESCENT CONDITIONS AND STACKS 297 


sometimes explicitly assumed in the definition of descent data.) In a general 
(cartesian) category S, we say that a morphism z is a descent morphism (resp. 
a pre-descent morphism) for a particular S-indexed category C if C satisfies the 
descent condition (resp. the pre-descent condition) for the family whose only 
member is x. If we simply say ‘x is a descent morphism’ without mentioning 
a particular indexed category, we mean that it is a descent morphism for the 
canonical indexing of S over itself. 

In verifying that a particular morphism is a descent morphism, the following 
result (due to J. Beck; it was this result that first emphasized the importance of 
the Beck—Chevalley condition) is often useful. 


Proposition 1.5.5 Suppose C is an S-indered category having S-indexed 
products, and let x: J — I be a morphism of S. Then 


(i) Desc(C, (x)) is isomorphic to the category of coalgebras for the comonad 
G on C7 induced by the adjunction (x* 4 Iz). 


(ii) x is a descent morphism for C iff x*:C! — C7 is comonadie. 


(iti) In particular, if C is S-complete, then x is a descent morphism for C iff 
xz*:C! —C/ is conservative. 


Proof (i) If A is an object of C7, then morphisms f: 7*(A) — 73(A) in 
C7*IJ correspond bijectively to morphisms A — II,,73(A) in C7; but by 
the Beck—Chevalley condition the latter correspond bijectively to morphisms 
f: A~ x*II,(A). Now A*(f) corresponds to the composite 


f 


A —— > T (A) = Hr mi (A) — in Ha A*r (A) SA 
where the second factor is induced by the unit of (A* 4 Ha); but it is easily 
verified that the Beck—Chevalley isomorphism makes the diagram 


(Aye 


~ fan 


In T(A) ——> Ty, In A* r(A) 


commute, where € is the counit of (<* 4 H). Thus (A, f) satisfies the unit 
condition for an object of Desc(C, (z)) iff (A, F) satisfies the unit condition for a 
G-coalgebra. A similar diagram-chase shows that the composite 7133(f) o tfo(f) 
and the morphism 1%,(f) in C7*!7*!/ transpose respectively to the composite 


Tat x*Il,(f) : 
A ——> 2° JI,(A) ———> 21,2", (A) 
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and to 


f 


ô 
A——> 2*,(A) —*> rIlLr"Il(A), 


where 6 is the comultiplication of G; so (A, f) satisfies the cocycle condition iff 
(A, f) satisfies the coassociativity condition for a coalgebra. Moreover, a mor- 
phism h: A — B between two objects equipped with such structures (A, f) and 
(B,g) clearly commutes in the appropriate sense with f and g iff it commutes 
with f and g; so we have the required isomorphism of categories. 

(ii) A further straightforward diagram-chase shows that this isomorphism 
identifies the functor 6/2): C? —> Desc(C, (x)) with the comparison functor C? > 
(C’)g; so this is immediate from the definitions. 

(iii) If C is S-complete, then x* (has a right adjoint and) preserves equalizers; 


so (iii) follows immediately from the Crude (co)Monadicity Theorem (A1.1.2). 


Regarding descent morphisms for the canonical indexing, we have the 
following: 


Proposition 1.5.6 Let S be a cartesian category, and x: J — I a morphism 
of S. Then 


(i) x is a pre-descent morphism (for the canonical indexing of S over itself) 
iff it is a stable regular epimorphism (i.e. every pullback of x is a regular 
epimorphism). 

(ii) In particular, if S is regular, then x is a pre-descent morphism iff it is a 
cover. 

(iii) Suppose either that S is effective regular, or that S is cartesian and has 
pullback-stable coequalizers of reflexive pairs. Then every cover in S is a 
descent morphism. 


Proof (i) Let us write (y,z): K = J for the kernel-pair of z, and w for the 
composite zy = xz. The assertion that x is a stable regular epimorphism is 
equivalent to saying that, for every f: A — I, the top row of the diagram 


(in which all the squares are pullbacks) is a coequalizer. Suppose this holds, 
and let g: B — I be another object of S/I. Then the assertion that a morphism 
z*(A) — 2x*(B) over J is a morphism of Desc(S, (x)) (when its domain and 
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codomain are equipped with their canonical descent data) is equivalent to saying 
that there is a morphism w*(A) — w*(B) making both the squares 


w*(A) Z3 2*(A) 


w*(B) ——S 2*(B) 


commute, and hence to saying that the composite x*(A) — «*(B) — B factors 
uniquely through x*(A) — A. Thus 2* is full and faithful as a functor S/I — 
Desc(S, (z)). 

Conversely, suppose x is a pre-descent morphism. Then, for any f: A— J 
and any object B of S, we see that morphisms x*(A) — J*(B) = x*I*(B) 
in Desc(S, (x)) correspond bijectively to morphisms A — 1*(B) in S/I. Dis- 
entangling the definitions, this means that morphisms 7*(A) — B in S which 
have equal composites with w*(A) = x*(A) correspond bijectively to morphisms 
A — B. So the top row of the diagram on the previous page is a coequalizer, i.e. 
x is a stable regular epimorphism. 

(ii) follows immediately from (i) and A1.3.4. 

(iii) First suppose S is effective regular. Let (h: A— J, f) be an object 
of Desc(S, (x)); then the projection y*(A) — A and the composite y*(A) — 
z*(A) — A define a pair of morphisms y*(A) = A, which is readily seen to 
be an equivalence relation on A. (Reflexivity and transitivity follow from the 
unit and cocycle conditions on the isomorphism y*(A) — z*(A); symmetry from 
the fact that it is an isomorphism and (y, z) is symmetric.) So we may form its 
coequalizer q: A — B, and we may factor xh through q to obtain a morphism 
k: B — I. We claim that the canonical morphism A — x*(B) induced by q and 
h is an isomorphism. It is a cover, because the composite y*(A) — A — x*(B) 
may be regarded as the pullback of q: A — B along x (where A is regarded as 
sitting over I by the morphism xh), and is thus a cover. And it is monic, because 
the pair (q, h) is jointly monic: if we have a pair (l,m): C 3 A with ql = qm 
and Al = hm, then we obtain a factorization n: C — y*(A) of (l,m) through 
the kernel-pair y*(A) = A of q, and then the composite C — y*(A) — K fac- 
tors through the equalizer of (y, 2): K 3 J — but the latter is the diagonal map 
J — K, so n factors through the corresponding pullback of y*(A), which forces 
l = m. Thus we have shown that every object of Desc(S, (x)) is isomorphic to 
one in the image of x*: combining this with (ii), we have the desired result. 

In the second case, it follows from A1.3.5 that S is regular; so the result 
follows immediately from (the dual of) 1.5.5(ii) and the Crude Monadicity 
Theorem (A1.1.2), since we saw in 1.4.7(i) that S has S-indexed coproducts, 
and in A1.3.2(iii) that x is a cover iff 2*: S/I — S/J is conservative. (5) 
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Before proceeding further, we pause to note a result for non-regular categories 
which may be extracted from the proof of 1.5.6(iii), and which will be of use in 
Section C5.1: 


Scholium 1.5.7 Let S be a cartesian category, and let P be a class of 
morphisms of S such that 


(i) P contains all isomorphisms, and is closed under composition; 
(ii) P is stable under pullback; 
(iii) every member of P is a coequalizer; 


(iv) if a,b: R = A is an equivalence relation for which a (equivalently, b) is in 
P, then it has a pullback-stable coequalizer which belongs to P. 


Then every morphism of P is a descent morphism. 


Proof Conditions (ii) and (iii) imply that the members of P are pre-descent 
morphisms, by 1.5.6(i). Suppose given a morphism x: J — I in P, with kernel- 
pair y,z: K 3 J, and an object h: A — J of S/J equipped with descent data 
relative to x. To simplify the notation, we shall use the descent data to identify 
the pullbacks of A along y and z, and denote their common value by A’. Thus 
we have a parallel pair h*(y),h*(z): A’ 3 A, and as in the proof of 1.5.6 we 
know that this is an equivalence relation; moreover, h*(y) and h*(z) are in P, 
since they are pullbacks of y and z, which are in turn pullbacks of x. So we may 
form their coequalizer q: A > B, with the induced map k: B — I. However, we 
do not have the information that A’ = A is effective, so we cannot argue as in 
the proof of 1.5.6(iii) to show that A — 2x*(B) is an isomorphism. Instead, we 
proceed as follows. 

Let L denote the triple pullback J x; J x; J; then the cocycle condition 
for the descent data on A says that we may identify its pullbacks along the 
threé projections L — I, and we denote their common value by A”. Let u,v, w 
denote the three projections L — K. Now form the pullback of the coequalizer 
A’ = A->B along z*(B) — B; since the latter is itself the pullback of x along 
k, and since gk = xh, it is easy to see that this has the form 


h*(v) 


A’ ZOT? A! ——-> 2*(B) 


h*(w) 
h*(u) h*(y) 
B 
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where the right-hand arrow on the top line is the composite of h*(z) with the 
comparison map A — x* (B). But it is also easy to see that 


h*(v) * 
AC — ae Al BOL A 
h* (w) 


has the structure of a split coequalizer, with splittings induced by pulling back 
the diagonal map d: J + K and 1 x d: K — L along h. Hence A — x*(B) must 
be an isomorphism. O 


We now turn to the coherent coverage on a positive coherent category S. If 
(xi: Ji +I |1<i<7n) is a covering family for this coverage, we may ‘factor’ it 
as the composite of the family of coprojections (vi: J; > J | 1 <i < n) (where 
J is the coproduct of the J;) and the singleton cover (x: J — I) induced by the 
xi. Clearly, an S-indexed category is a (pre-)stack for the coherent coverage iff 
it satisfies the (pre-)descent condition for families of these two kinds. But the 
singleton families are covering for the regular coverage, so we have already dealt 
with these in 1.5.5 and 1.5.6; the other families consist of monomorphisms, and 
here again we may simplify the definition of descent data. Since a morphism 


x: J — I is monic iff 
1 
J ———> J 


J ——_—> I 
is a pullback, the ‘unit’ condition (a) in the definition of descent data on a family 
(A; € ob C% | 1 < i< n) (relative to a family of monomorphisms (z; : J; — J)) 
becomes simply the assertion that the ‘diagonal’ components f of the data are 
(modulo canonical isomorphisms) simply the identity morphisms on the A;. So 
we can forget about these components, and simply describe the descent data as 
a family of isomorphisms (fij : yz (Ai) — 23,4; | i < j), subject to the condition 


that 
Pare tiia) o, oe 
ti sR Yij (As) Se tiko (Aj) = VkYzk (Áy) 


ijn Sik) 
* * wage Fix) * * * * 
UijkYik(Ai) A UijkZik(Ák) ae Vink 2jk (Ák) 


commutes for all i < j < k. (Here we need to assume explicitly that the f;, are 
isomorphisms, because we have suppressed mention of the f;; which are their 
inverses. ) 
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In practice, we may usually restrict our attention still further, to families 
consisting of just two monomorphisms (plus the empty family), since more 
general finite families can be built up from these by induction. For a family 
(x1: Jı ~ J, £2: J2 — J) the ‘cocycle’ condition above becomes vacuous, and 
we have 


Lemma 1.5.8 An S-indered category C satisfies the descent condition for a 
pair of monomorphisms (x1: Jı = J, £2: Jog J) in S iff the diagram 


a* 
ci "> cn 


is a (pseudo-)pullback in CAT. g 


Corollary 1.5.9 Let S be a positive coherent category. Then a nonempty 
S-indexed category C is a (pre-)stack for the coherent coverage on S iff it is 
a (pre-)stack for the regular coverage, and in addition we have C° ~ 1 and, for 
each pair of objects (I, J) the functor (v*,v3): C'HET — C! x C7 is an equivalence 


(resp. full and faithful). 


Proof We have already observed that covering families can be decomposed 
into finite covering families of (pairwise disjoint) monomorphisms and regular 
covers. The initial object 0 of S is covered by the empty family, and the descent 
condition for this family says that C? ~ 1. (The pre-descent condition for it says 
that C° is either empty or equivalent to 1; but if any fibre C7 is nonempty then C° 
must be nonempty.) Hence, for a disjoint pair of monomorphisms, the pullback 
diagram of 1.5.8 reduces to a product diagram. a 


As before, we investigate what these conditions mean for the canonical 
indexing of S over itself. 


Lemma 1.5.10 
(i) If S is a coherent category, then S is a pre-stack for the coverage whose 
covering families are finite families of monomorphisms 


(zti: Ji = J|1<i<n) 


in S such that J = |J Ji - 
(ii) If S is a pretopos, then S is a stack for the coherent coverage on S. 
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Proof (i) It suffices to consider the empty family and families with two mem- 
bers. For the first, we use the fact that the initial object of S is strict (A1.4.1). 
For doubleton families, wé use an argument much like that of 1.5.6(i), relying on 
A1.4.3 and the stability of binary unions under pullback in S. 

(ii) The reason why we get a pre-stack rather than a stack in (i) is that we 
cannot, in a general coherent category, form the pushout of a pair of monomor- 
phisms with common domain. But in a pretopos we can do so (A1.4.8), and so 
in this case S is a stack for the coverage described in (i), by an argument like 
that of 1.5.6(iii). Since it is also a stack for the regular coverage by 1.5.6(iii), the 
result follows. o 


We remark in passing that a finitary disjunctive category (also called a lex- 
tensive category; cf. [227]) is one with finite limits, a strict initial object and 
binary coproducts which are disjoint and stable under pullback. Equivalently, it 
is a cartesian category S with finite coproducts, such that S is a stack for the 
coverage consisting of finite families of coprojections. We shall meet disjunctive 
categories again, albeit briefly, in Part D (see D1.3.6). 


Suggestions for further reading: Bourn [171], Bunge & Paré [205], 
Giraud [406, 407], Grothendieck [421], Reiterman & Tholen [1003]. 


B2 


INTERNAL AND LOCALLY 
INTERNAL CATEGORIES 


B2.1 Review of enriched categories 


The theory of enriched categories is normally studied in the context of a base 
category V with a monoidal structure (usually, but not always, symmetric). 
However, we are primarily interested in studying categories enriched in a topos, 
where the monoidal structure is taken to be the cartesian one (i.e. that induced 
by finite products), and so we shall save ourselves a bit of extra work by develop- 
ing the theory only for cartesian monoidal structures. Throughout this section, 
therefore, our base category V will be assumed to have finite products. 


Definition 2.1.1 (a) A V-category (or V-enriched category) C consists of a 
collection of objects together with 


(i) for each pair of objects (A, B), an object C (A, B) of V; 
(ii) for each object A, a morphism 14: 1 —> C (A, A) in VY; 
(iii) for each triple of objects (A, B, C), a morphism 


CA,B,C: C (B,C) x C(A, B) => C(A, C) 
in V, 


such that the c4,g,c are associative in the sense that 
1 
C(C, D) x C(B,C) x C(A, B) —S cC (C, D) x C(A,C) 
cxi C 


CBD) <x CAB) SS CAD) 


commutes for each quadruple (A, B,C, D), and the 1,4 act as identities for c in 
a similar sense. 

(b) If C and D are \-categories, a V-functor F: C > D consists of a function 
A |= FA from objects of C to objects of D and, for each pair of objects (A, B), 
a morphism F'4.3:C(A,B)— D(FA, FB), such that the F4,g are compatible 
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in the obvious sense with composition and identities. We call F a V-embedding 
(resp. a V-full embedding) if each F4, B is monic (resp. an isomorphism) in V. 

(c) If F and G: C > D are )-functors, a V-natural transformation a: F > G 
is a function assigning to each object A of C a morphism a4: 1 —> D(FA,GA), 
such that the diagram 


QB X Fas 
C (A, B) ————_———_—__> D (FB, GB) x D(FA, FB) 


GA,B X aM, CFA,FB,GB 


CFA,GA,GB 
D (GA, GB) x D(FA,GA) ——— Z > D (FA, GB) 


commutes for each pair of objects (A, B). 


We write V-Cat for the 2-category of V-enriched categories, functors and 
natural transformations. 


If C is a V-category, the underlying ordinary category |C| of C has the same 
objects as C, and its morphisms A — B are defined to be morphisms 1 — 
C (A, B) in V, with composition defined by gf = ca,B,c(g, f). We similarly 
define underlying ordinary functors and natural transformations of V-enriched 
ones; note in particular that the underlying ordinary functor of a V-embedding 
(resp. a V-full embedding) is faithful (resp. full and faithful). We refer to C as a 
y-enrichment of the ordinary category |C]. 

In the case when V = Set, a V-enriched category is just a locally small cate- 
gory; and when V = Cat, a V-enriched category is a locally small 2-category. In 
the former case, a given ordinary category has at most one V-enrichment (up to 
canonical isomorphism); this is because the terminal object 1 is a generator for 
Set. In general, however, there may be many different ways of enriching a given 
category over a given V. For example, the ordinary category Cat has (at least) 
three different enrichments in itself: for the first we take Cat (C, D) to be the 
usual functor category [C, D], for the second we take the subcategory of all func- 
tors C > D and natural isomorphisms between them, and for the third we take 
the discrete category of functors C — D. Similar remarks apply to enrichments of 
ordinary functors: any functor between locally small categories admits a unique 
Set-enrichment, but (for example) the ordinary functor (—)°?: Cat — Cat is 
Cat-enrichable with respect to the second and third Cat-enrichments of Cat 
defined above, but not with respect to the first. 


Lemma 2.1.2 A cartesian closed category V has a canonical enrichment over 
itself. 
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Proof We define V (A, B) to be the exponential B4, 14 to be the transpose 
of 72:1 x A —> A, and c4,B,c to be the transpose of 


1x ev 
CP x BAxA——s C8 x B “sc. 
The remaining details are straightforward. o 


For the reasons indicated above, we cannot hope to have any sort of converse 
to 2.1.2; an enrichment of V over itself need not derive from a cartesian closed 
structure. In contrast, we shall see in the next section that an indered enrich- 
ment of a category over itself necessarily derives from a locally cartesian closed 
structure. 

If Y is a cartesian closed category and C is a V-category, then for each 
object A of C we have a V-functor C (A,—):C —> V sending B to C(A, B); 
C (A, —)B,c: C (B,C) > C (A, C)E (4B) is of course just the exponential trans- 
pose of the composition map ca,g,c of C. We have an enriched version of the 
Yoneda lemma: 


Lemma 2.1.3 Let V be a cartesian closed category, C a V-category, F: C —> V 
a VY-functor and A an object of C. Then there is a bijection between V-natural 
transformations C (A, —) —> F and morphisms 1 > FA in C. 


Proof Given a V-natural transformation a, we have the composite 


la QA 
1 ———> C (A, A) ——> FA; 


conversely, given a morphism z: 1 — FA, we have the natural transformation 
whose B-component is 


FaBxz 
C(A, B) —_—-> FBFA x FA- S> FB. 


The verification that these constructions are inverse to each other is 
straightforward. O 


If C is a small V-category (i.e. one with a set of objects) and V is complete, we 
may make the V-functors C — V into the objects of a V-category [C, V]; and then 
the Yoneda lemma yields a V-full embedding C°? — [C, V]. However, we shall 
not need to develop this aspect of enriched category theory in the present work. 

The following characterization of enriched functors from a cartesian closed 
category to itself is often useful. 


Lemma 2.1.4 Let V be a cartesian closed category, and let F: V— V be a 
functor. Then specifying a V-enrichment of F is equivalent to specifying a natural 


B2.1 REVIEW OF ENRICHED CATEGORIES 307 


transformation 
TU,V 
Ux FV —> F(U x V) 


(between functors V x V — V), such that Tiy is the canonical isomorphism 
1x FV S FV S F(1 x V) and the diagram 


ly x Tv,w 
U x V x FW ————> Ux F(V xW) 


TU,V x 
TUXV,W Me 


F(U x V x W) 


commutes for all U,V, W. 


Proof Given an enrichment Fy,y: VY — FV¥Y of F, we define ty,v to be 
exponential transpose of the composite 


Fyuxv 
pas (U x VY <25 F(U x V}FY 


where À is the unit of the exponential adjunction; conversely, given 7, we define 
Fy,v to be the transpose of 


y Tyu U y F(ev) 
VY x FU ——> F(VY x U) ——> FV. 


A straightforward diagram-chase shows that the two conditions on 7 are equiva- 
lent to the compatibility of the Fy,y with identity morphisms and composition, 
and that the two constructions above are inverse to each other. The fact that 
Fy,v is an enrichment of the original functor F (i.e. that the effect of composing 
it with morphisms 1 — VY corresponds to the effect of F on morphisms U — V) 
is built into the naturality of 7 in its first variable. CJ 


Enriched functors from a cartesian closed category to itself are sometimes 
called strong functors. If F:V — V is a functor from a category with finite 
products to itself, a natural transformation 7 satisfying the conditions of 2.1.4 
is called a strength for F, even if V is not cartesian closed. 


Example 2.1.5 Suppose V has list objects in the sense of A2.5.15. Then the 
assignment U ++ LU can be made into a functor Y — V, as we observed there; 
and this functor has a canonical strength Tyv: U x LV > L(U x V), namely 
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the unique morphism making 


ly x ly x 
fa E a sta OR 5 OO EOF 


TU,V 71,72, TU V T3 


(4) S 
ano ye hO xy 


commute. (Intuitively, Ty,v is the operation which takes an element of U and 
a list of elements of V, and produces a list of elements of U x V whose first 
coordinates are all equal to the given element of U.) It is straightforward to 
verify that Ty, y satisfies the conditions of 2.1.4. In the case when V is a topos 
with a natural number object, the fact that L is a Y-enriched functor may 
alternatively be deduced from the fact that it is a V-indexed functor (i.e. the 
slice categories V/U all have list objects, and pullback functors preserve them), 
as we shall see in the next section (cf. 2.2.2). 


It is also not hard to see that, for an ordinary natural transformation 
a: F — G between strong functors V — V, the condition that a should be 
V-enriched (i.e. that it should make the diagram of 2.1.1(c) commute) is equiva- 
lent to requiring that it should commute with the strengths of F and G, in the 
sense that 


TU 
U x FV —-> F(U x V) 


ly x ay QUxV 
i 


Tyv 
U x GV ——> G(U x V) 


commutes. Of course, a natural transformation with this property is called 
a strong natural transformation. The following result can thus be stated for 
categories with finite products, since its proof makes no use of the closed struc- 
ture — although, in the applications, we shall always use it for cartesian closed 
categories. 


Lemma 2.1.6 Let V be a category with finite products, and T: V — V an (ordi- 
nary) functor which preserves finite products. Suppose further that there is a 
natural transformation n: ly — T. Then there is a unique strength on T making 
n into a strong natural transformation. If n is the unit of a monad structure 
T = (T,n, u), then this strength also makes u a strong natural transformation. 
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Proof We define Ty,y to be the composite 


ny x ly 
Ux TV ——S TU x TV © TU x V); 


it is clear that this is natural in U and V, and that it satisfies the first condition 
of 2.1.4. For the second, we need to observe that 


u xv 22% TW xv) 


~ 


nu X ny 


TU xTV 


commutes; but this is immediate from the naturality of n, since the vertical 
isomorphism is obtained by applying T to the two projections from U x V. The 
same commutative diagram verifies that 7 is a strong natural transformation 
ly —> T. To verify that u, if it exists, is also strong, we need the similar diagram 
identifying yxy With py Xx py, plus the monad identity wy -Tny = iry. 

For the uniqueness, suppose oy,y:U x TV — T(U x V) is any strength 
for T. Composing it with Try and Trz, we get natural transformations 
ayv:UxTV — TU and By y:U x TV — TV, which together determine o. 
But buv = Biv © m2 by naturality, and 61y = try by the first condition of 
2.1.4. Similarly, ay,v = au, ° T1, and ay, = ny by the fact that 7 is strong. So 
we have identified o with the strength 7 defined above. CJ 


Remark 2.1.7 If Visa category with finite products, andT: V — V is astrong 
functor which preserves 1, then there is a unique strong natural transformation 
from the identity functor to T, namely that consisting of the components 7(_) 
of the strength of T. (For cartesian closed V, this is a particular case of 2.1.3, 
since we can identify the identity functor with V (1, —).) Thus 2.1.6 tells us that, 
for an ordinary functor T which preserves finite products, there is a bijection 
between the possible strengths for T and the (ordinary) natural transformations 
ly >T. 


If F: V — W is a functor preserving finite products, then for any V-enriched 
category C we may define a W-enriched category F.C having the same objects 
as C, but 


and composition and identity operations obtained by applying F to those of C. In 
fact it is not hard to see that F, is a 2-functor V-Cat + W-Cat. Note, however, 
that C and F.C do not in general have the same underlying ordinary category, 
unless either F happens to be full and faithful, or F has a left adjoint which 
preserves 1. (The latter case is of importance for us, since it applies when F is 
the direct image of a geometric morphism. ) 
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If F: V — W is a finite-product-preserving functor between cartesian closed 
categories, then the comparison maps 6y,v: F(V") — FV*", defined before 
A1.5.2, give rise to a W-functor (which we shall also denote by F) FeV > W. 
We note that this functor is a W-full embedding iff F is a cartesian closed functor 
(and it is a W-embedding iff F is sub-cartesian-closed, as defined before C3.1.1). 
In particular, we have 


Lemma 2.1.8 If V is a locally cartesian closed category, then for any 
z:U—-+V in V the pullback functor x* induces a V/U-full embedding 
(x*). (V/V) > V/U. 


Proof We saw in A1.5.2 that pullback functors are always cartesian closed. 
o 


The above result is an important part of the motivation for the definition of 
a locally internal category at the start of the next section. 


Suggestions for further reading: Kelly [578], Kock [603, 604]. 


B2.2 Locally internal categories 


In this section we introduce the notion of locally internal category, due to 
J. Penon [955], which synthesizes the concepts of indexed category and enriched 
category. Since the definition involves enrichments in all the slice categories S/I 
of our base category S, we must henceforth require the latter to have all finite 
limits. 


Definition 2.2.1 Let S be a cartesian category. A locally internal category 
C over S assigns to each object J of S an S/I-enriched category C7, to each 
morphism z: J > J in S an S/I-enriched full embedding 2*: (z*)e(C7) — C', 
and to each composable pair (x: I — J,y: J — K) an S/I-enriched natural 
isomorphism 6, , as in 


(x*)e(y*)eC* = (x*)C” 


l Oxy x* 


((yz)").C* a cr 


(where the left vertical arrow is induced by the canonical isomorphism z*y* = 
(yx)*), subject to the usual coherence conditions. Locally internal functors and 
locally internal natural transformations are defined in the obvious way; they 
form the 1-cells and 2-cells of the 2-category of locally internal categories over 
S, which we denote by Cats. 
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It is clear that, just as an enriched category has an underlying ordinary 
category, so a locally internal category over S has an underlying S-indexed cat- 
egory. What is, perhaps, more surprising is that a locally internal category is 
determined up to isomorphism by its underlying indexed category. 


Theorem 2.2.2 For any cartesian category S, Cats is strongly equivalent 
to the full sub-2-category of CATs whose objects are locally small S-indexed 
categories, as defined in 1.3.12. 


Proof First suppose given a locally internal category C. If A and B are objects 
of CY and C* respectively, let (x,y): I + J x K be the object C7** (a+ A, n3 B) 
of S/J x K. Then, for any object (z,w): L + J x K of S/J x K, morphisms 
u: L — I over J x K correspond to morphisms 1z, — (z, w)*(C7** (rž A, 73. B)) S 
C} (2* A, w*B) in S/L; that is, to morphisms z*A — w*B in the underlying 
ordinary category of C”. Thus the underlying indexed category of C is locally 
small. 

Conversely, let C be a locally small indexed category. Given any two objects 
A, B of the same fibre C7, we have an object J — I x I indexing morphisms from 
A to B, as in 1.3.12; we define C? (A, B) to be the pullback of this object along 
the diagonal A: J + I x I. Then C!(A, B) represents the functor on (S/TI)°? 
which sends z: K — I to the class of morphisms 2*A — z*B in C*; using the 
Yoneda lemma it is straightforward to define morphisms 1; — C!(A, A) and 
C'(B,C) x C(A, B) — C'(A,C) giving C! the structure of an (S/I)-enriched 
category. Further, we see that for any z: K — I the objects CX (x* A, z* B) and 
z*(C!(A, B)) represent the same functor on (S/K)°?, so they are canonically 
isomorphic; thus z*: C’ + C* becomes an (S/K)-enriched full embedding. So 
C can be enriched to a locally internal category. 

Further, this is the only possible locally internal structure on C (up to canon- 
ical isomorphism); for, if we have such a structure, then for any z: K 4 I 
morphisms z — C/(A, B) in S/I correspond to morphisms 1g — C* (x* A, z* B) 
in S/K, i.e. to morphisms 2* A > z*B in CË. So C!(A, B), as the representation 
of a functor on (S/J)°?, is determined up to unique isomorphism for each pair 
of objects (A, B) of Cf. 

A similar argument shows that any indexed functor F': C > D between 
locally small indexed categories admits a unique enrichment to a locally internal 
functor: given objects A and B of C7, we have for any x: K — I a mapping 
induced by F* from morphisms z* A — x*B in C* to morphisms 


xz* F! A% FKe*A — > FË r*B % r*FĪB 


in D*, and so there is a canonical morphism (F!)4 5: C/(A,B) 3 D(F" A, 
F!B). This is easily seen to be an enrichment of F!, and to be the unique 
such enrichment which is compatible with the transition functors x2* of C and D. 
In the same way, any indexed natural transformation between locally internal 
functors is automatically enriched. So the operation of equipping locally small 
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indexed categories, and indexed functors and natural transformations between 
them, with the indicated enrichments defines a functor from the indicated sub-2- 
category of CALs to Cats, which is inverse to the forgetful functor up to natural 
isomorphism. O 


From now on, we shall use the terms ‘locally internal category’ and ‘locally 
small indexed category’ interchangeably, and we shall not bother to distinguish 
notationally between the two concepts. 

Next, we establish a result promised in Section B1.3. 


Lemma 2.2.3 Let S be a cartesian category. Then the canonical indexing of 
S over itself is locally small iff S is locally cartesian closed. 


Proof If S is locally cartesian closed, then by 2.1.8 S is the underlying indexed 
category of a locally interna] category, and therefore locally small. Conversely, 
suppose S is locally small; let x: J +I and y: K — I be two objects of S/T. 
Then, for any z: L — I, morphisms z > (S/I) (x,y) in S/I correspond to mor- 
phisms z*z — z*y in S/L, and hence to morphisms z x z = H,2*2 —> y in S/T, 
that is, (S/I) (x,y) has the universal property of an exponential y” in S/I. O 


Generalizing one direction of 2.2.3, we have 


Lemma 2.2.4 Suppose S is locally cartesian closed. Then, for any locally 
small S-indexed category C, the free category with S-indexed coproducts Fam(C) 
(cf. 1.4.16) is locally small. 


Proof We recall that objects of Fam(C)/ are pairs (x, A), where z: J —> I in 
S and A is an object of C7. Suppose given two such objects (x1, A1) and (x2, A2) 
in the fibres over J; and I respectively: let K — I) x Iz denote the exponential 
(13 (x2))1 (1) in S/T, x Ip, and form the diagram 


Re, oe EE 


zı xil 
J, x Ig ——> hxk 


where the square is a pullback and e is the evaluation map associated 
with the exponential. Now let M — L be the object indexing morphisms 
I*n3(A,) > e*a4(Az) in the fibre C”; then we claim that the composite 
(M) + K — J, x Ip indexes morphisms (z1, A1) —> (x2, Az) in Fam(C). 
For, given any object (z1, 22): Z — I, x Ig and a morphism Z — II, (M) over 
I, x Iz, the composite Z — II,(M) — K defines a morphism y: zï (£1) — z3(z2) 
in S/Z, and then the factorization of this through II,(M) defines a morphism 
zï (21, Aj) nd 24 (£2, A2) in Fam(C)?. o 
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If S is a topos with a natural number object, then the same argument will 
show that the construction Fam,;(—) of 1.4.17 preserves local smallness. 


Lemma 2.2.5 Let S and T be cartesian categories, and F: S + T a functor. 
If the S-indexed category I œ T/FI of 1.2.2(d) ts locally small, then F has a 
right adjoint. 


Proof First suppose F preserves the terminal object. Given an object A of 
T = T/L, we define RA to be the object T (1, A) of S. Then, for any object I of 
S, morphisms 7 — RA in S correspond to morphisms FJ*1 + FI*A in T/FI, 
and hence to morphisms FI — A in T. This correspondence is clearly natural 
in I and A; so R is right adjoint to F. 

In the general case, we can as usual factor F' as 


A 


3 
setai taf 


where F preserves the terminal object; and the argument above shows that Ê 
has a right adjoint. But Xr; also has a right adjoint, namely (F'1)*; so F has a 
right adjoint. E 


Corollary 2.2.6 Let F:S—T be a functor between cartesian categories, 
and suppose T is locally cartesian closed. Then the following assertions are 
equivalent: 


(i) F has a right adjoint. 
(ii) F*: CATr — CATs preserves local smallness. 
(iii) The indexing of T over S induced by F is locally small. 


Proof (i) = (ii) is a special case of 1.3.17; (ii) = (iii) since the canonical 
indexing of T over itself is locally small, by 2.2.3; and (iii) > (i) is 2.2.5. o 


Remark 2.2.7 In 1.4.5, we saw that the change-of-base functor along F 
preserves (co)completeness iff F preserves pullbacks; so, combining this with 
2.2.6, we see that change of base along a functor between toposes preserves 
(co)completeness and local smallness iff it is (what we called in A4.1.13) the 
inverse image of a pre-geometric morphism. This provides another sense in 
which (pre-)geometric morphisms (or at least their inverse image functors) are 
‘structure-preserving maps’ between toposes: the ‘structure’ they preserve is the 
completeness and local smallness of the canonical indexing of a topos over itself. 
We also saw in 1.3.17 that change of base along a functor with a right adjoint 
preserves well-poweredness of indexed categories, and in 1.3.14 that toposes may 
be characterized as those cartesian categories whose canonical indexings over 
themselves are well-powered, so that inverse image functors really do preserve 
‘all the structure required to define a topos’. 


We have seen in the proof of 2.2.2 that any indexed functor between locally 
internal categories is automatically an enriched functor. There is an interesting 


314 INTERNAL AND LOCALLY INTERNAL CATEGORIES B2.2 


special case in which this implication can be reversed: the following argument is 
due to R. Paré. 


Proposition 2.2.8 Let S be a locally cartesian closed category, and F: S —> S 
an S-enriched functor whose underlying ordinary functor preserves pullbacks. 
Then F extends to an S-indexed functor S —> S. Moreover, this extension 
is unique (up to canonical isomorphism) if we require that it should preserve 
pullbacks as an indexed functor, and induce the given enrichment of F. 


Proof Given an object f: A — I of S/I, we define F!(f) to be the morphism 
u: P — I, where 
P — FA 


(u,v) Ff 
TI. 
Ix Fl —> FI 


is a pullback square (and r is the strength of F, as in 2.1.4). Clearly, F? is a 
functor S/I — S/I. To show that the F? form an indexed functor, let x: I‘ — I 
be a morphism of S, and consider the cube 


eee ee A 
Ff’ 
(u’, v’) Piss ee PA 
(u,v) 
l xFi ees — FI’ Ff 
xxi Fr 


TI, 

Ix Fl ——— FI 
where f’ is the pullback of f along x. Since F preserves pullbacks, the right 
vertical face of this cube is a pullback, and the front and back faces are pullbacks 
by definition. So the left face is a pullback, from which it follows that u’ = F” (f’) 
is (isomorphic to) the pullback of uw along z. A similar calculation shows that 
F! preserves pullbacks as a functor S/I — S/I; and since 7; is the canonical 
isomorphism 72: 1 x F1 — F1, it is easy to see that F! is (isomorphic to) our 
original functor F. 
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Next, we must show that the enrichment of F'! induced by the indexing 
is the one originally given. Let J and J be objects of S. Then the mor- 
phism F;,7: J!’ — FJ"! induced by the indexing is the transpose of the 


morphism (J!)*FI — (J')*FJ obtained by applying FUO”) to the generic 
morphism (J/)*I => (J/)* J, i.e. the morphism 


1, eV 
Pak ec a eed 


Now the identification of FU) (JT)*I with (J/)*FI tells us that 


I le! I 
J’ x FI —> F(J' x I) 


Fr, 


I TJI I 
J <P == FG) 


is a pullback square; and similarly with I replaced by J in the second factor. So, 
in order to identify Fr, z with Fr J, it suffices to show that the unique morphism 
J! x FI = J! x FJ making the diagram 


I TILT I 
J! x FI —-> F(J' x 1) 


F'(m, ev) 


I TIES I 
J’ x Fl <— J x FJ —> F(J x J) 


commute is (7, Fy, 7), or equivalently (11, F(ev)oTy: 1). But this is a straightfor- 
ward diagram-chase, using the naturality of r and the two commutative diagrams 


in the statement of 2.1.4. 
For the uniqueness, we observe that for any f: A— I we have a pullback 


square 
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which we may regard as a pullback 


f= SPA 


I*f 


rı—ĉ>rI 
in S/I. So, if we are given that F! preserves pullbacks and that F/I* is 
isomorphic to I*F?, the effect of F? will be determined up to canonical iso- 
morphism once we know what it does to the morphism A: I*1 — I*I. But the 
latter is forced upon us by the enrichment: if we think of A as an J-indexed 
family of morphisms 1 — I, it corresponds to a morphism I — T! (in fact 
the transpose of nı: I x 1 — I), and its image under F? must similarly corre- 
spond to the composite of this morphism with F},z (i.e. to the morphism whose 
transpose is T;,1). 0O 


Combining this result with 2.1.6, we obtain 


Corollary 2.2.9 IfS is a locally cartesian closed category, then any cartesian 
monad on S extends uniquely to a cartesian S-indexred monad on S. O 


We do not, in fact, know any example of an enriched endofunctor of a locally 
cartesian closed category which fails to extend to an indexed functor. In the 
absence of the assumption that F preserves pullbacks, we can still prove 


Lemma 2.2.10 Let F:S— S be an S-enriched endofunctor of a cartesian 
closed category S. Then F extends, uniquely up to canonical isomorphism, to a 
Sub(1)-indered endofunctor of the restriction of S to the full subcategory of S 
consisting of subterminal objects. 


Proof Let U be a subterminal object of S. Since the composite 


Xy U* 
S/U ——~—> S ———> S/U 
is isomorphic to the identity, it is easy to see that if we have any FU: S/U — S/U 
satisfying FUU* = U* F, then FU = U* FD y. Taking the latter as a definition of 
FY , the verification that it yields a Sub(1)-indexed functor amounts to showing 
that for any object I of S we have U x F'(I) = U x F(U x I). But the morphism 
(x1, Tu,1), where 7 is the strength of F as in 2.1.4, is readily checked to be a 
two-sided inverse for 1 x F'(72). o 


Corollary 2.2.11 IfS is a cocomplete, locally small topos in which the subter- 
minal objects form a class of generators, then any S-enriched functor F: S + S 
extends, uniquely up to isomorphism, to an S-indered endofunctor. 
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Proof Let I be an object of S. By the assumptions on S, I can be expressed as 
a colimit of a small diagram D: J — S, whose vertices are subterminal objects. 
Given any object (A — I) of S/I, A is similarly a colimit of the diagram j > 
D(j) x1A, since colimits in S are stable under pullback. Applying F?) (defined 
as in 2.2.10) to the jth vertex of this diagram, and using the fact that the F?) 
form a Sub(1)-indexed functor, we obtain another diagram of type 7 in S, such 
that for any morphism j — j’ in J we have a commutative square (in fact a 
pullback) 


FP) (D(j) x; A) —> F?9)(D(j") xr A) 


DO) ST) 


Taking the colimit of this diagram, we obtain an object of S/I, which we define 
to be F'(A — I). The verification that this construction defines an S-indexed 
functor, and that it extends the Sub(1)-indexed functor already defined in 2.2.10, 
is straightforward. The uniqueness of the extension again follows from the sta- 
bility of colimits under pullback: F/(A — I) must be the colimit of its pullbacks 
along the morphisms D(j) —> I. O 


The hypotheses on S in 2.2.11 are equivalent to saying that it admits a localic 
geometric morphism to Set; cf. A4.1.9 and C1.4.7. 


Suggestions for further reading: Betti [108, 114], Penon [955]. 


B2.3 Internal categories and diagram categories 


As we emphasized in Section Al.1, the theory of categories is an elementary one, 
and so may be interpreted in the internal logic of any category S with sufficient 
structure. In fact, since the theory is a cartesian one, as defined in D1.3.4, all we 
need is that S should have finite limits. Throughout this section S will be (at 
least) a cartesian category. 


Definition 2.3.1 (a) An internal category C in S is specified by the following 
data: 


(i) three objects Co (the object of objects of C), Cı (the object of morphisms) 
and Cz (the object of composable pairs) of S; 

(ii) two morphisms dj,d}:C, — Co (interpreted as codomain and domain 
respectively), a morphism s8: Co — C, (inclusion of identities) and three 
morphisms d2, d2, dż: C2 — C (respectively the first member, the compos- 
ite and the second member of the composable pair); 
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subject to the conditions that 


d do 


Cı ———> Co 


is a pullback, that djs? = 1c, = dis}, that did? = djd? and did? = d}d3, 
that d?sj = 1c, = dst (where s: Cı — C2 is the morphism into the pullback 
induced by s§dj: C1 — Cı and 1c,, and s} is similarly defined), and that d} is 
associative in the sense that, if we form the object of composable triples C3 as 
the limit of the diagram 


Ci Ci Cı 
Co Co 


then two possible composites C3 —> C2 — C, are equal. 

(b) If C and D are internal categories, an internal functor f: C — D consists 
of morphisms fo: Co —> Do, fı: Cı —> Dı and fo: C2 — D2 commuting with the 
structure morphisms in (a)(ii) above. 

(c) If f and g: C > D are internal functors, an internal natural transforma- 
tion a: f — g consists of a morphism a: Co — D; such that dja = go, dia = fo, 
and 

di (ado, fi) = (g, adj): Cy ——> D. 


The internal categories, internal functors and internal natural transforma- 
tions in S form a 2-category, which we denote Cat(S). We also write Cat(S) for 
the underlying 1-category of €at(S). 


The reader may readily verify that Cat(Set) is just the 2-category Cat of 
small categories, as previously defined. We also note that a cartesian functor 
S — T induces a 2-functor Cat(S) —> Cat(T), in an obvious way. Moreover, 
Cat(S) is itself a cartesian category; in fact the forgetful functor Cat(S) — S’ 
which sends C to (Co, C1, C2) creates finite limits. 


Remark 2.3.2 There is some redundancy in Definition 2.3.1, in that the object 
C2 and the morphisms d and d3 do not need to be (and in general, will not be) 
explicitly specified as part of the definition of an internal category, being recover- 
able from the rest via the pullback diagram contained in the definition. Similarly, 
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when we define an internal functor we do not need to specify fz explicitly, since 
it is uniquely determined by fo and fi. 

Nevertheless, the notation we used was intended to be suggestive that the 
definition is in a sense incomplete. If we define Cn, the object of composable 
n-tuples of morphisms in C, by an appropriate finite limit diagram in S (gen- 
eralizing that which appeared in the definition for the case n = 3), then we 
have (n + 1) morphisms d?: Cn > Cy~1 (0 <i < n), of which the first and last 
represent the operations of dropping the last and first members of a composable 
n-tuple, and the others represent those of composing one of the (n — 1) adja- 
cent pairs. Similarly, we have n morphisms sem Cn-1 > Cn, which represent 
the insertion of identity morphisms at each possible point in an (n — 1)-tuple. 
Moreover, these morphisms satisfy the simplicial identities 


didji = dG (i<j), 


sjus = sps (i<j), and 
dhs id (i<j) 
=le, (J <i<j+l) 


that is, they make (n +» Cn) into a simplicial object in S, i.e. a functor A® — S, 
where A is the category of nonempty finite totally ordered sets and order- 
preserving maps. (Note: as an object of A, n denotes the (n + 1)-element totally 
ordered set {0,1,...,2}; this ‘dimension shift’ is not only traditional but seem- 
ingly unavoidable.) Similarly, an internal functor f:C — D extends uniquely 
to a morphism of simplicial objects, i.e. a natural transformation between the 
corresponding functors A°? — S; thus we can identify the l-category Cat(S) 
with a full subcategory of the category Simpl(S) = [A°?, S] of simplicial objects 
in S. (Although we shall not need this fact, it is worth noting that the objects in 
this subcategory are, up to isomorphism, precisely the functors A°P — S which 
preserve those pullbacks which exist in A°?.) The notion of internal natural 
transformation also corresponds to a simplicial concept, namely that of a sim- 
plicial homotopy between morphisms of simplicial objects; however, simplicial 
homotopies cannot be composed in general, so Simpl(S) cannot be extended to 
a 2-category. 


Having got the formal definition out of the way, we shail henceforth omit the 
superscripts from the names of the morphisms d? and s? whenever possible, the 
context will usually make it clear which ‘dimension’ is intended. The simplicial 
object corresponding to an internal category is commonly called its nerve, but 
we shall not find it necessary to distinguish notationally between the two; for us, 
an internal category may be identified with its nerve whenever it is profitable to 
do so. 
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Next, we tackle the relationship between internal and indexed/locally inter- 
nal categories. If C is an internal category in S and J is an object of S, we 
write C/ for the ordinary category whose objects are the morphisms J — Co in 
S, and whose morphisms are the morphisms J — C, the operations ‘domain’, 
‘codomain’, ‘identity morphism’ and ‘composite’ being induced in the obvious 
way by composition with the structural morphisms of C. If z: I > J in S, then 
composition (on the other side) with z induces a functor z* : C7 — C7; the assign- 
ment z ++ z* is (strictly) functorial, and so J + C! becomes an S-indexed 
category. Once again, we shall usually not bother to distinguish notationally 
between this indexed category and the internal category C; but when it is nec- 
essary to make the distinction we shall denote the indexed category by [C]. The 
justification for this abuse of notation is contained in 


Lemma 2.3.3 The assignment C +> [C] is a full embedding of 2-categories 
Cat(S) > CATs. 


Proof [Itis clear that an internal functor f: C — D induces, by composition, 
an indexed functor [C] — [D]; and similarly for internal natural transformations. 
So the assignment is 2-functorial. The fact that, for each pair (C, D), it induces 
an equivalence of categories between €Cat(S) (C, D) and [[C], [D]] is an easy exer- 
cise in the use of the Yoneda lemma: for example, given an indexed functor 
F: [C] =. [D], we define fo = Fæ (1c): Co _ Do and fi = F(1¢,): Cı _ Dy, 
and verify that these define an internal functor C — D; then the coherence 
isomorphisms for F' yield an indexed natural isomorphism between F' and the 
indexed functor induced by f. o 


The reader should be warned not to read into Lemma 2.3.3 more than it 
actually says: the functor is a full embedding of 2-categories, but not a full 
embedding of 1-categories. In particular, although it follows that any indexed 
equivalence between [C] and [D] must be induced by an internal equivalence 
between C and D, it is possible for [C] and [D] to be isomorphic when C and D 
are not. The following simple example is due to R. Paré. 


Example 2.3.4 Let C and D be objects of a cartesian category S which ‘pro- 
vide a counterexample to the Cantor-Bernstein theorem in S’: that is, such 
that there are monomorphisms u: C — D and v: D — C but no isomorphism 
C = D. (Such pairs of objects can easily be found in suitable non-Boolean 
toposes; see D4.1.12.) Let C be the indiscrete internal category with object of 
objects C (that is, take Ca = C”*! for all n, with the d; taken to be appro- 
priate product projections and the s; induced by the diagonal map), and let D 
be similarly constructed from D. Then it is clear that C and D are equivalent, 
but not isomorphic, in Cat(S). However, for each J, composition with u and v 
induces injections in either direction between S(I,C) and S(I, D), so we can use 
the (external) Cantor—Bernstein theorem to find a bijection between these sets, 
which will be an isomorphism between the (indiscrete) categories C7 and D!. We 
cannot hope to choose these bijections so that they are exactly compatible for all 
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I (that is, commute with composition with a fixed x: T — J); but, again because 
the categories are indiscrete, they will do so up to unique natural isomorphism, 
and so we have defined not just an equivalence but an isomorphism of indexed 
categories between [C] and [D}. 


The moral of this example, if it has one, is that isomorphism of indexed 
categories is not really a sensible concept to study: the definition of indexed 
category is so ‘loose’ that CALs has to be considered as a 2-category, not as a 
l-category. We shall say that an S-indexed category is essentially small if it is 
equivalent to [C] for some internal category C; this replaces the unsatisfactory 
notion of essential smallness which we discussed in 1.3.13. 

It is easy to see that an indexed category of the form [C] is locally small: 
if z: I + Co and y: J — Cp are two objects of it, then the object of S/I x J 
which indexes the morphisms from x to y in [C] is the left vertical arrow in the 
pullback 

P ———-3 C; 


(dı, do) 
TXY 
Ix J — > Co x Co 


(and the generic morphism itself is the top edge of this square). In the oppo- 
site direction, we have the following important method of constructing internal 
categories from locally small indexed categories, which helps to explain why the 
latter are also called locally internal categories. 


Definition 2.3.5 Let C be a locally small S-indexed category, and let A be an 
object of C? for some I. We define the internal full subcategory of C generated by 
A, which we denote by C[A], as follows. Its object of objects Co[A] is simply J, 
and (d1, do): Ca [A] + Co[A] x Co[A] is the object of S/I x I indexing morphisms 
from A to A. The morphism sg: Co[A] — C,[A] corresponds (modulo coherence 
isomorphisms) to the identity morphism A — A in C’, and dy: C2[A] > C,[A] 
similarly corresponds to the composite of the morphisms in C©?!4] obtained by 
pulling back the generic morphism along do and d2. It is straightforward to verify 
that the foregoing data does define an internal category. 


C[A] comes equipped with a canonical indexed functor (from the indexed 
category corresponding to it) to the original category C: this sends an object 
z: J —+I to z*(A) € ob C7, and a morphism y: J > C,[A] to the mor- 
phism (d,y)*(A) — (doy)*(A) in C7 which corresponds to it. It is easy to 
see that this indexed functor is full and faithful; hence the name ‘internal full 
subcategory’. 

We recall that, given a functor F': S — T, we have a ‘re-indexing’ functor F* 
from T-indexed categories to S-indexed categories; on the other hand, given an 
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internal category C in T, we require a (pullback-preserving) functor G: T +S 
to produce an internal category G(C) in S. The two constructions are linked in 
an obvious way: 


Lemma 2.3.6 Let F: S —T be a functor having a right adjoint G, and let C 
be an internal category in T. Then the S-indexed categories F*|C] and [G(C)] 
are equivalent. 


Proof For any object I of S, we have a bijection between morphisms F(T) > 
C; (i = 0 or 1) in T and morphisms I — G(C;) in S. This sets up an equivalence 
(which is in fact an isomorphism) between the two S-indexed categories. 0 


We shall most frequently use this result when S and T are toposes, and F' and 
G are the inverse and direct images of a geometric morphism. We shall also need 


Corollary 2.3.7 Let G:S—T be a functor having a left adjoint F which 
preserves pullbacks (for example, the direct image of a pre-geometric morphism, 
as defined in A4.1.13). If C is an internal category in S which is S-complete 
(resp. S-cocomplete) as an indexed category, then G(C) is T-complete (resp. 
T -cocomplete). 


Proof This follows from 2.3.6 and 1.4.9. 0 


There is a well-known result due to P. Freyd (cf. [487]) which says that a com- 
plete small category must be a preorder. The proof of this result requires the law 
of excluded middle, and so one might expect it to fail for indexed categories over 
a non-Boolean base topos S; we shall see that it does indeed fail in certain cases, 
in Chapter F2. However, in the main we shall be interested in applying 2.3.7 to 
internal posets, that is to internal categories C such that (do, d1): C1 —> Co x Co 


is monic and 
So 
Co ———————-> C1 


So (do, d1) 


(dı, do) 
Cı ———> Co x Co 


is a pullback. 


Examples 2.3.8 (a) One particular internal poset, in any topos, with which 
we are already familiar is the subobject classifier Q, equipped with the partial 
ordering 92, defined in A1.6.3. Of course, the indexed category corresponding to 
this internal category is (up to equivalence) the category Sub of 1.2.2(g), and 
we observed before 1.4.7 that Sub is complete and cocomplete over any Heyting 
category (in particular, over any topos); so we deduce that Q is a complete 
and cocomplete internal poset in any topos. Applying 2.3.7 with G = (—)4 and 
F = (—) x A, we deduce that any power object PA in a topos, made into a poset 
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PA with its canonical ordering defined in A2.3.2, is complete and cocomplete; 
applying it to the direct and inverse images of a geometric morphism f: S >T, 
we deduce that f,({2) (and more generally f,(IPA) for any object A of S) is a 
complete and cocomplete internal poset in T. 

(b) Another example of interest derives from the notion of local opera- 
tor which we studied in Sections A4.4 and A4.5. The external poset Lop(S) 
which we defined in A4.5.10 may be made into an indexed poset Lop(S), 
by setting Lop(S)! = Lop(S/J) (and, for a local operator j: 1*Q — I*Q 
on S/I, z*(j) is simply the pullback of j along z: K — I in S). It is easy 
to see that we may construct a subobject Lo — QÊ, as an intersection of 
three appropriate equalizers, such that a morphism J — QÊ? factors through 
Lo iff its transpose [*2 — I*Q is a local operator on S/I; hence we can 
construct an internal poset L such that [L] ~ Lop(S). Moreover, the construc- 
tion of the local operator generated by a subobject of Q, given in A4.5.13(i), 
commutes with pullback functors, so it defines an S-indexed (and hence inter- 
nal) left adjoint for the inclusion L — PQ, from which it follows that L 
is S-complete and cocomplete (cf. 2.3.9 below). We note in passing that 
the constructions of A4.5.19 also give rise to S-indexed adjunctions between 
L and Q. 


For internal posets in a topos, we have an alternative characterization of 
(co)completeness in terms of ‘taking sups of subsets’ rather than ‘taking sups 
of indexed families’. Given a poset A = (A; 3 A) in a topos S, let |: A— PA 
be the name (in the sense of A2.1.1) of the relation (do, d1): Ai — A x A (note 
that this is the opposite of the order-relation as we usually think of it). If S = 
Set, | is the mapping which sends a to the principal ideal {b € A | b < a}, 
which explains the notation. It is easy to verify that | is an order-preserving 
map A — PA. 


Lemma 2.3.9 For an internal poset A in a topos S, the following are 
equivalent: 


(i) A is cocomplete, i.e. [A] is an S-cocomplete indexed category. 
(ii) |: APA has a left adjoint \(¥: PA— A in the 2-category of internal 
posets in S. 
(iii) | has an order-preserving left inverse. 
(iv) A is a retract of a poset of the form PB. 


Proof (i) => (ii): We define V:PA—A to be &,(e), where 
(n,e):€a>— PA x A is as usual the generic relation PA % A. We must first 
verify that V is order-preserving. Suppose given maps z1, £2: J 3 PA satisfying 
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£1 < T2; this means that if we form the pullbacks 


qi 
Ji ——--> E4 


Pi n 


Ti 
I ———> PA 
(i = 1,2), then Jı < Jz as subobjects of I x A; that is, we have a monomor- 
phism m: Jı — J2 satisfying pom = pı and eqgm = eqı. Now we have 
V zi = 27 En (e) = pq? (e) by the Beck-Chevalley condition; but 


Ep, qi (e) =, Spo Umm" g3(e) < Dp, 92 (e) 


since (Sm 1 m*) and Xp, is order-preserving. 
Now we verify that V is left adjoint to |. We have a pullback diagram 


Z 
A, —— E4 
do n 


A> PA 


by the definition of |, where the composite ez is dı; so by applying the Beck- 
Chevalley condition to this square we obtain V |= Xadi. But since dı < do in 
the poset S (Ai, A), we have Xg,di < Eaodo = Eaodò(14) < 14. On the other 
hand, we have V n = n*En(e) > e; that is, the morphism (\/n,e):€4— Ax A 
factors through (do,di) (say by y:€4—> Ai). Thus we have a commutative 
diagram 


y z e 
Ea ———> A, ——> €, — > A 


n do n 


V 


PA ———> A — PA 


where the top composite is e, so we conclude that (n,e) is less than or equal 
to the relation named by | V in the poset of relations PA % A, i.e. lp, <} V 
in the poset of maps PA — PA. Thus we have constructed the unit and counit of 
the required adjunction; the triangular identities are trivial, since we are dealing 
with posets. 

(ii) = (iii): It follows easily from reflexivity and antisymmetry of the order- 
relation that | is monic; so the counit V |< 1, of the adjunction is necessarily 
an equality. And (iii) > (iv) is trivial. 
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(iv) = (i): A poset is trivially Cauchy-complete, since it has no nontrivial 
idempotents; so 1.1.10 enables us to lift the left adjoints for the transition maps 
of [PB] to left adjoints for those of [A]. The Beck-Chevalley condition for [A] 
also follows trivially from that for [PB]; so [A] has S-indexed coproducts. The 
fact that the individual posets [A]! = S (I, A) have finite coproducts (joins) may 
be deduced from 1.4.3 (or from the fact that the posets [PB]! have them); and 
equalizers are again trivial because we are dealing with posets. QO 


Note that the proof of (iv) = (i) in 2.3.9 shows that a cocomplete inter- 
nal poset in a topos is necessarily also complete, since [PB] is both complete 
and cocomplete. We shall prove the corresponding result for arbitrary internal 
categories in the next section. It is also possible to give a direct proof of (ii) > 
(i): given V, and a pair of morphisms g: J — I and u: J > A in S, we define 
E(u) to be the composite 


Ju V 


pn E 
FP oP A E A 


where r°, as in A3.1.3, denotes the relation J % J which is tabulated by 
(z,1y): J> Ix J. We omit the detailed verification that this has the required 
properties. 

For future reference, we note a couple of applications of 2.3.9: 


Corollary 2.3.10 
(i) A logical functor F: S + T between toposes preserves (co)completeness of 
internal posets. 


(ii) If j is a local operator on a topos S, then an internal poset in S is 
sh;(S)-cocomplete iff it is S-cocomplete and its underlying object is a 
j-sheaf. 


Proof (i) Since F preserves power objects, it clearly preserves any of the 
conditions (ii), (iii) and (iv) of 2.3.9. 

(ii) One direction is immediate from 2.3.7, since the inclusion sh;(S) — S is 
a direct image functor. Conversely, suppose that A is cocomplete as an internal 
poset in S, and that its underlying object A is a j-sheaf. Then its order-relation 
A; is also a j-sheaf (since it may be expressed as the equalizer of 


T2 
AxAZ SA 
V 


where V is the binary join map) and hence j-closed in A x A by A4.3.8; so the 
classifying map of A; — A x A factors through Q; — Q, and hence | fac- 
tors through the subobject P;A = (Q;)4 ~> PA which is the power object 
of A in sh;(S). So this factorization, and the restriction of V: PA — A to 
P,A, express A as a retract of a power object P;A in the category of internal 
posets in sh;(S). 0 
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Next, we turn to the study of indexed functors from internal categories 
to arbitrary indexed categories. Once again, the Yoneda lemma enables us to 
describe these in simpler terms: 


Definition 2.3.11 Let C be an internal category in a cartesian category S, 
and D an indexed category over S. By a diagram of shape C in D, we mean a 
pair (F',¢), where F is an object of DO? and ¢: di F — dF is a morphism in 
DC: , such that s§@ is (modulo coherence isomorphisms) the identity morphism 
F — F, and d}¢ is similarly the composite of d*¢ and d5¢ in D©?. A morphism of 
diagrams (F,¢) — (G, Y) is a morphism F — G in D® which commutes in the 
obvious sense with ¢ and 7. We shall write D© (or [C, D], according to context) 
for the category of diagrams of shape C in D and morphisms between them. 


Examples 2.3.12 (a) For any object I of S, the discrete internal category I 
corresponding to I has its objects of objects and morphisms both equal to I and 
all its structure morphisms equal to 17; it is easy to see that the corresponding 
indexed category is the one which we called I in 1.2.5. Moreover, if (F, ¢) is a 
diagram of shape I, then it is easy to see that ¢ must be the identity morphism 
on F; so the diagram category D! is isomorphic to D/. In fact we can regard 
the assignment J +> I as a (full and faithful) functor S — Cat(S); and, since an 
internal functor f:C — C’ induces a functor f*: DU — DE in an obvious way, 
we see that the assignment C++ D® yields a canonical extension of an arbitrary 
S-indexed category D to a Cat(S)-indexed one. Similarly, any S-indexed functor 
can be canonically extended to a Cat(S)-indexed functor. 

From now on, we shall tend to identify objects of S with discrete internal 
categories whenever we need to do so, without explicitly commenting on the fact. 
For example, we make D© into an S-indexed category D© by setting 


C\I _ a~rixC 
(D) =D 


(where the exponent on the right really means the product of the given cate- 
gory C and the discrete category I). Equivalently, (DC) may be identified with 
the category of diagrams of shape C in the indexed category D’, as defined in 
1.2.2(f). 

(b) If S is a positive coherent category, then any finite (external) category 
C may be identified with an internal category in S, via the coherent functor 
A: Sets — S of Al.4.7. If D is an S-indexed category which is a stack for the 
coherent coverage, so that D“! is an I-indexed product of copies of D! for each 
finite set I, then it is easy to see that a diagram of shape AC in D is essentially the 
same thing as an ordinary functor C > D!. More generally, if S is an oo-positive 
geometric category, and D is a stack for the coverage derived from arbitrary 
coproducts in S, we can similarly identify functors C — D! with diagrams whose 
shape is an internal category in S, for any small category C. 

(c) Of course, we say that an internal category C is a groupoid if there exists a 
morphism t: Ci — C satisfying commutative diagrams which correspond to the 
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assertion that t sends each morphism of C to its inverse. For future reference, we 
note that if C is a groupoid, then in addition to the pullback square appearing 
in Definition 2.3.1(a), we also have pullback squares 


d d 
C2 ene Sen Cı and C2 a Cy ; 
do do dı dı 
do dy 
Ci ————> Co Ci ———> Co 


for if f,g: I = Cı are morphisms satisfying dof = dog, then it is easily verified 
that the factorization of (f,di(tf,g)): I — Cı x Cı through Ch is the unique 
factorization of (f,g) through (do, di), and similarly for the right-hand square. 
Note also that an equivalence relation on an object of S, as defined in A1.3.6, is 
an internal groupoid: indeed, equivalence relations are exactly those groupoids G 
which are also preorders (i.e. such that (dg, di): Gi —> Go x Go is monic). If G is 
an effective equivalence relation (i.e. occurs as the kernel-pair of some morphism 
rz: Go — I), then the notion of a diagram of shape G in D is exactly that of an 
object of DC? equipped with descent data for the singleton family (x), as we 
defined it in Section B1.5. 


Lemma 2.3.13 Let C be an internal category in a cartesian category S, and 
D an indexed category over S. Then DE is equivalent to the category [C, D] of 
indexed functors C — D and indexed natural transformations between them, as 
defined in Section B1.2. 


Proof Given an indexed functor G:C — D, we define F = G@°(1g,) and 
¢ = G(1c¢,). Conversely, given a diagram (F,¢), we define G by setting 
G!(x: I —> Co) = x*F and G/(y: J + C1) = y*@: (diy)*F — (doy)*F. The 
remaining details are straightforward. o 


We note that Example 1.2.5 was a special case of the above lemma, in which 
the internal category C was taken to be discrete, as in 2.3.12(a). The lemma may 
be extended in an obvious way to an equivalence between D© and the indexed 
functor category [C, D} of Section B1.2. 

A corollary of 2.3.13, which will be of use in Section B3.2, is the following: 


Corollary 2.3.14 LetG:T — S be a functor preserving pullbacks; let C be an 
internal category in T, and D a S-indezed category. Then the category [G(C), D] 
is equivalent to [C, G*D}. 


Proof It is straightforward to verify that a diagram of shape G(C) in D, as 
defined in 2.3.11, is the same thing as a diagram of shape C in G*D. So this is 
immediate from 2.3.13. 0 
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We note that if G has a left adjoint F, then 2.3.14 becomes a corollary of 
2.3.6, since change of base along F is then left adjoint to change of base along G. 
The next result is a useful extension of 1.3.18. 


Lemma 2.3.15 Let S be a locally cartesian closed category, and C an internal 
category in S. 


(i) If D is a locally small S-indezed category, then so is DE. 

(ii) Jf D is an internal category in S, then [D]© is essentially small, and may be 
identified with [DF], where D© is the exponential in the (cartesian closed) 
category Cat(S). 


Proof (i) We shall construct an enrichment of DE over S; the extension of 
this to an indexed enrichment, i.e. a locally internal category structure on DË, 
is straightforward. Let (F,¢) and (G,~) be objects of D©, and let z: I > Co 
be the object of S/Co indexing morphisms F — G in D©, and y: J > C; the 
object indexing morphisms dt F — dG in D@:. If we form the pullback 


, T’ 
I ——— Ci 


dı 


I —5> O 


then z’ indexes morphisms d4 F — d4G in D“; if we compose the generic such 
morphism with % we get a morphism dj F — d§G, and this operation of composi- 
tion induces a morphism g’ — y in S/C\, or equivalently a morphism x — Ha, y 
in S/Co. Similarly, the operation of composition with ¢ yields a morphism 
z — Hay. If we apply Ic, to each of these, we get a parallel pair of mor- 
phisms Hoz 3 Ic, y in S; taking the equalizer of these produces the required 
object of morphisms from (F, ¢) to (G, 4%). 

(ii) Given (i), to show that [D]© is essentially small we need only construct an 
object of S indexing its objects (up to isomorphism). But, by 2.3.3, any indexed 
functor [C] — [D] is canonically isomorphic to one induced by an internal functor 
C — D, and we may index the latter by an appropriate subobject of D °x p 
(specifically, an intersection of equalizers corresponding to the equations which a 
pair (fo, f1) must satisfy in order to define an internal functor). The fact that this 
construction, combined with that of part (i) for defining the object of morphisms 
of DÜ, defines an exponential in Cat(S) is entirely straightforward. 0 


We remark that, for cartesian closedness of Cat(S), we need only the (proper) 
cartesian closedness of S, not local cartesian closedness. On the other hand, 
Cat(S) is not locally cartesian closed even if S is: we have already observed in 
Section A1.5 that this fails for S = Set. 
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Proposition 2.3.16 Let C be an internal category in S, and let D be an 
S-indezed category having S-indexed products (resp. S-indered coproducts), as 
defined in Section B1.4. Then the category D© is comonadic (resp. monadic) 
over D@°. Moreover, the indexed category DE has S-indexed products (resp. 
coproducts). 


We remark in passing that we have seen the first part of this result before, 
in a particular case: when C is the internal groupoid associated to an effective 
equivalence relation, as in 2.3.12(c), it is just Proposition 1.5.5(i). 


Proof We shall give the proof in the case when D has products; the other case 
is exactly similar. Consider the functor 


di H 
Do 0 > Do dı > D%. 
we claim that this has a comonad structure. The counit is the composite 
Ha, do —> Ha, Mso Sodo =] 


of which the first part is induced by the unit of (sj 4 Hs,) and the second is 
the coherence isomorphism arising from the fact that diso = doso = 1c,; the 
comultiplication is similarly obtained as 


Tg, ds —> Ma, Ma, did & Da, a, didi —> Da, dla, d 


where the final step uses the Beck-Chevalley condition for the pullback square 
displayed in Definition 2.3.1(a). The fact that these natural transformations 
satisfy the commutative diagrams for a comonad follows straightforwardly from 
the equations in the definition of an internal category. 

Now, given an object F of D®, it is also straightforward to verify that 
a morphism ¢:d*F — djF in DĪ! makes (F,@) into a diagram of shape C 
iff its transpose ¢: F — Ia, dF is a coalgebra structure for the comonad just 
described; and morphisms of diagrams similarly correspond to morphisms of 
coalgebras. Thus D© is isomorphic to the category of IMa, dj-coalgebras. 

For the second assertion, we note that for each object J of S we have a 
comonad on D@e*! whose functor part is Ha, xı(do x 1)*, and whose category 
of coalgebras is equivalent to D©X!. Moreover, the Beck-Chevalley condition 
ensures that these comonads fit together to form an indexed comonad, and that 
the functor part of this comonad preserves S-indexed products. So the result 
follows from (the dual of) 1.4.15(ii). o 


Of course, in the situation of 2.3.16, the indexed category D© also inherits 
S-indexed finite limits (resp. colimits) if these are present in D. 


Corollary 2.3.17 Let C be an internal category in S, and let D be an 
S-complete indexed category such that the fibre D©° is a topos (resp. a 
quasitopos). Then D© is a topos (resp. a quasitopos). 
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Proof The functor dj is cartesian because D is S-complete, and Ha, is carte- 
sian because it has a left adjoint. So this follows from A4.2.1. O 


In particular, we may now deduce the analogue of A2.1.3 where Set is 
replaced by an arbitrary (quasi)topos. 


Corollary 2.3.18 If S is a (quasi)topos, then so is S© for any internal 
category C in S. 


Proof Since S is locally cartesian closed, its canonical indexing over itself is 
S-complete by 1.4.7; and the fibres of this indexed category (the slice categories 
S/T) are (quasi)toposes by A2.3.2 (or its quasitopos analogue). So this is imme- 
diate from 2.3.17. 2 


Remark 2.3.19 In view of 2.3.12(b), Corollary 2.3.18 contains as a particular 
case the result, promised after A2.1.3, that [C, S] is a topos for any finite category 
C and any topos S. Further, if S is cocomplete, then [C, S] is a topos for any 
small category C. 


In another direction, 2.3.16 yields the result, promised in Section B1.4, that 
S-complete indexed categories have ‘all small limits’. 


Proposition 2.3.20 Let D be an S-complete (resp. S-cocomplete) indexed cat- 
egory. Then the canonical extension of D to a Cat(S)-indered category, defined 
in 2.3.12(a), has Cat(S)-indezed finite limits (resp. colimits), and its transition 
functors have right (resp. left) adjoints. 


Proof As previously, we do only the complete case; the cocomplete one is 
similar. If f: C — C is an internal functor in S, then we have a commutative 
diagram 
ae 
DE ——> DE 


Do Pash ears DCo 
where the vertical arrows are forgetful functors. Since the comonads on DC? 
and Do are cartesian, by the argument in the proof of Corollary 2.3.17, these 
forgetful functors create finite limits; so the categories D? and D© have finite 
limits, and f* preserves them since fë does. Also, since D© has equalizers, we 
may lift the right adjoint Ij, of fò to a right adjoint for f*. O 


We normally write lim ¢ and lim p for the right and left adjoints, respectively, 
of f*: D% — DC, when they exist; in the case when S is Set and D is the naive 
indexing of an ordinary complete (resp. cocomplete) category, they are familiar 
as the right and left Kan extension functors along f, and we shall use the same 
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names for them in this more general context. In particular, when C’ is the 
terminal object 1 of Cat(S), so that DT reduces to the underlying ordinary 
category D of D, then we write lime and lime for these adjoints, and think of 
them as the functors assigning limits and colimits to arbitrary diagrams of type 
C in D. 

We can interpret 2.3.20 as saying that an S-complete (resp. S-cocomplete) 
indexed category is A-complete (resp. A-cocomplete) in the sense of 1.1.16 as 
an object of CATs, where A is the class of all morphisms between (the canon- 
ical indexings of) internal categories in S. However, it does not assert that the 
extension of D is Cat(S)-complete or cocomplete: in general, the Beck—Chevalley 
conditions fail to hold for pullback squares in Cat(S), as we saw for the case 
D = S = Set in 1.4.4(6). But there is an important class of pullback squares for 
which they do hold: 


Scholium 2.3.21 Under the hypotheses of 2.3.20, the Beck-Chevalley condi- 
tion holds for pullback squares of the form 


fxi 
B x C ——>BxC 


1xg 1xg 
fxi 


B x C —>BxC 
in Cat(S). 


Proof Consider first the case when B and B’ are discrete categories, cor- 
responding to objects B and B’ of S. The construction of the functor 
lim; xg: DB*C — DBXT uses the functor I xg: DP*Co + DBXCo, the cofree 
and forgetful functors between these categories and the diagram categories, and 
the construction of an equalizer in D?*C’; but the functors (f x 1)* commute in 
a suitable sense with all of these, and so the diagram 


* 
DB’ xc (f x 1) > DBxc 
limi xg lim i xg 


DB xc (f x 1) > DBxCc 


commutes up to an isomorphism, which is easily seen to be the canonical natural 
transformation between the two composites. In other words, the functor lim, 
extends to an S-indexed right adjoint for g*: D% — DE. 

In the general case, an easy extension of 2.3.13 shows that we may identify 
the diagram category DB*C with the category [B,[C,D]] of indexed functors 
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from B (that is, from the indexed category [B]) to the S-indexed functor category 
[C, D] defined before 1.2.5. Moreover, this equivalence identifies (f x 1)* with the 
funetor induced by composition with f on the right, and (1 x g)* with [B, g*]. 
Since [C,—] is a 2-functor CALs — CAT, the right adjoint of the latter is 
given by [C, lim ,]; and this clearly commutes with the operation of composition 
with f. g 


We shall also see in 2.5.11 that, at least for the case when D is S itself, we 
do have the Beck-Chevalley condition for comma squares in Cat(S): thus we 
may say that S is pointwise A-cocomplete in the sense of 1.1.16, for the class A 
mentioned earlier. 

We conclude this section with the ‘relative’ version of A4.1.4, which is an 
easy consequence of 2.3.20. 


Corollary 2.3.22 Let S be a topos. Then, for any internal functor f: C — D 
in S, the functor f*: SP — SE is the inverse image of a geometric morphism; 
and in fact the assignment C ++ S© becomes a functor Cat(S) — Top/S. 


Proof The canonical indexing of S over itself is both S-complete and 
S-cocomplete, by 1.4.7; so f* has both left and right adjoints. The construction 
is clearly (pseudo-)functorial in f; and it maps the terminal object of Cat(S) to 
(a category isomorphic to) S itself. o 


We note that, if we identify objects of S with discrete internal categories, 
then the functor defined in 2.3.22 extends the functor S — Zop/S of A4.1.2. In 
fact we can also make it into a 2-functor Cat(S) — Yop/S: an internal natural 
transformation a: f — g between internal functors C — C’ induces a natural 
transformation f* — g*, whose value at an object (F, ¢) of S% is the morphism 
a*ġ: f* F S aF — a*d F = g*F. 


Suggestions for further reading: Betti [112, 113], Bunge [183], Pavlović [941], 
Street [1131]. 


B2.4 The Indexed Adjoint Functor Theorem 


We have now assembled all the machinery necessary to prove the Adjoint Functor 
Theorem in the context of S-indexed categories, where S is an arbitrary cartesian 
category. It is well known that there are two versions (‘General’ and ‘Special’) of 
this theorem, and both of them have indexed analogues (see [933]); but we shall 
prove only the ‘Special’ version here ~— it is this version that tends to be more 
useful in applications. 

We begin by introducing the indexed version of the notion of separating 
family (cf. A1.2.4). 


Definition 2.4.1 Let C be an S-indexed category. An object G of C! (for some 
object I of S) is called a separating family for C if, given any J € ob S and any 
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f,g: A= B in C” with f Æ g, there exists a span 


[2 =k ue dees J 
in S and a morphism h: x*(G) > y*(A) in C* with y*(f)h 4 y*(g)h. In the 
particular case when J is the terminal object 1 of S, we call G a separator. 


Examples 2.4.2 (a) For the naive indexing of an ordinary category C over 
Set, a separating family is exactly what we meant by that term in Section A1.2. 
For if (G;)ie; is a separating family in the usual sense, and we are given two 
J-indexed families (f;,9;: A; = B;);ey which are not equal, then there exists 
j € J with f; Æ gj, and there exists i € I and h: G; > A; with fih Æ g;h, so 
we have a span of the required form with K = 1. The converse is similar. 

(b) For the canonical indexing S of S over itself, the terminal object 1 of 
S = S! is always a separator. For if u,v: y 3 z are two unequal maps in S/J, 
then we take K = dom y and the span 


1 pan ee K aes Ja 
together with the diagonal map h: K — K xz K regarded as a morphism 
K*(1) > y* (y) in S/K. Then y*(u)h and y*(v)h are simply the graphs of u 
and v, so they are not equal. However, this example does not always have a 
coseparating family (though it does if S is a topos; cf. 3.1.13 below). 

(c) A small (that is, internal) category C always has a separating family, 
namely the family of all objects of C (that is, the identity morphism lc, in 
ob C°). Once again, this is easy to verify: we take K = J, y = lj and z to 
be the morphism J — Cy which is the domain of the given parallel pair in C’, 
together with the identity morphism z*(1c,) > =. 

(d) Let L: C — D be an indexed functor having an indexed right adjoint F 
which is faithful (i.e. such that F” is faithful for each 7). Then if G € ob C! is 
a separating family for C, it is easy to verify that L/(G) is a separating family 
for D. For if f,g: A = B are unequal maps in D’, we have F” (f) 4 F7 (g); so 
we can find a span (z,y) in S and a morphism k: z* (G) > y* F(A) & FX y*(A) 
having unequal composites with F*y*(f) and F* y*(g), and then the transpose 
h: 2* L! (G) & LË z*(G) — y*(A) is the morphism we seek. In particular, if C has 
a separating family, then any indexed reflective subcategory of C has a separating 
family. 


In constructing further examples of separating families, we shall find the 
following lemma useful. 


Lemma 2.4.3 Suppose C is S-cocomplete and locally small. Then G € ob C! 
is a separating family for C iff, for every J € ob S and every A € ob C7, there 
is a span 


fer a L LELE, 


in S and an epimorphism e: Syx*(G) > A in C7. 
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Proof Suppose the condition holds. Then, for any f,g: A 3 B in CY with 
f #4, we have fe 4 ge, so we obtain y*(f)h Æ y*(g)h where h: 2*(G) — y*(A) 
is the transpose of e across the adjunction (X, 4 y*). 

Conversely, suppose G is a separating family. Given A, let (x,y): K > Ix J 
be the object indexing morphisms G — A, and h: x*(G) — y*(A) the generic 
such morphism in C*. Now, given f,g: A 3 B in C? with f Æ g, the def- 
inition of separating family says that we can find (2’,y’): K'—I xJ and 
h’: *(G) > y’*(A) with y/*(f)h’ 4 y’*(g)h’. But hk’ corresponds to a morphism 
z: K' — K in S such that (modulo canonical isomorphisms) z*(h) = h’; so this 
means that y*(f)hk 4 y*(g)h for any such pair (f,g). Hence, if we transpose h 
across the adjunction (Xy 4 y*), we obtain e: L,z*(G) — A such that fe £ ge 
whenever f Æ g, i.e. such that e is an epimorphism. 0O 


The possession of a separating family is not in general preserved under change 
of base, even when the object indexing the separating family lies in the image 
of the change-of-base functor. (Counterexamples can be given using 2.4.2(b): we 
shall see in Section B3.1 that there are geometric morphisms p: € — S for which 
the indexing of € over S induced by p* does not have a separating family.) The 
following result, though rather special, will be needed in the proof of the Adjoint 
Functor Theorem. 


Lemma 2.4.4 Let S be a category with finite products, I an object of S. Then 
the change-of-base functor Xf: Cats — Cats;; preserves the property of having 
a separating family. 


Proof Let C be an S-indexed category having a separating family G € ob C”. 
We shall show that the object 1*(G) of C7*! = (£C) O is a separating 
family for XC. Given an object u: K — I of S/I and an unequal parallel pair 
fig: A B in CË, we have a span 


2 au SS K 
in S and a morphism h: z*(G) — y*(A) such that y*(f)h # y*(g)h. Then 


z=(z,uy): L—Jx{ is a morphism uy — I*(J) in S/I, and we have 
z*ni(G) = x* (G); so 


roe =i 
is the required span in S/I. C 


It is customary, in textbooks on category theory, to present the Adjoint 
Functor Theorem as a criterion for the existence of left adjoints. However, we 
shall probably make more use of it (insofar as we use it at all) in constructing 
right adjoints; and so we shall present the dual of the ‘usual’ form of the theorem. 
Like other authors, we shall begin by considering a special case — the criterion for 
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existence of a terminal object —- and then deduce the general case by consideration 
of comma categories. 


Proposition 2.4.5 (The Indexed Special Terminal-Object Theorem) 
Let S be a cartesian category, and let C be a cocomplete, locally small, well- 
copowered S-indexed category with a separating family. Then C has an S-indezed 
terminal object (that is, the categories C? have terminal objects, and the 
transition functors x* preserve them). 


Proof It suffices to prove that C! has a terminal object; since the transition 
functors have left adjoints, they will preserve the terminal object if it exists. 

Let G € ob C7 be a separating family for C, and write A for the object £;(G) 
of C}. Let Q be the object indexing the quotients of A in C, and e: Q*(A) —» A’ 
the generic quotient of A in C. In C°*@ we have a generic pair of quotients 
my} (e) and 13(e); we may form their cointersection (= pushout), and classify this 
by a third morphism z: Q x Q > Q in S. Let Q, — Q be the equalizer of z 
and 19; then (by the dual of an argument familiar for subobjects; cf. A1.6.3) 
Q = (Qı 3 Q) has the structure of an internal poset in S. Moreover, the 
object A’ of C? acquires the structure of an internal diagram (A', a) in CÊ, in 
an obvious way. And the epimorphism Q*(A) => A’ becomes a morphism from 
the constant diagram Q*(A) to (A’,a) — which is again epic, since the forgetful 
functor C2 — C9 is faithful. 

Let T be the object limg(A’, a) of C'. Since Q is connected (because it has 
an initial object, corresponding to the largest quotient 14: A —> A), we have 
lim gQ*(A) = A; and since limg, being a left adjoint, preserves epimorphisms, 
we have an epimorphism h: A —> T in C}. Moreover, the transpose of the canoni- 
cal epimorphism Ug(A’) —> T (the ‘colimiting cone’) is a morphism A’ — Q*(T) 
in C®, which is easily seen to make the diagram 


Q*(A) er A’ 


Q*(h) 
Q*(T) 
commute; so h: A — T is the unique smallest quotient of A. 
We claim that T is a terminal object of C!. Suppose B is any object of C!; then 


by 2.4.3 we have a morphism z: J — I in S and an epimorphism 1 yz*(G) > B 
in C!. But Dyz*(G) & Hy U,2*(G), so we may form the pushout 


uz" (G) B 


m(G) C 
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where the left vertical map is induced by the counit of (XZ, 4 z*). Since C is a 
quotient of ©;(G) = A, it admits a morphism to T; so B admits a morphism 
to T. But if we had two parallel morphisms f,g: B 3 T, then their coequalizer 
T —> T’ would be a quotient of T, and hence a quotient of A contained in T. So 
T -» T’ must be an isomorphism, and hence f = g. O 


Theorem 2.4.6 (The Indexed Special Adjoint Functor Theorem) Let 
S be a cartesian category; let C and D be S-indexed categories which are locally 
small and cocomplete, and suppose further that C is well-copowered and has 
a separating family. Then an indeed functor F:C — D has an indexed right 
adjoint iff it is cocontinuous. 


Proof One direction is just (the dual of) 1.4.14. Conversely, suppose F is 
cocontinuous. To construct an indexed right adjoint R for it, it suffices in fact 
to construct an ordinary right adjoint R’:D! —C! for each F‘; for the fact 
that the F7 commute up to isomorphism with the left adjoints ©, of the tran- 
sition functors will ensure that the R? form an indexed functor (cf. the proof 
of 1.4.14). 

Consider first an object B of C!. To construct R!(B), we need to find a 
terminal object of the (ordinary) comma category (F! |B), as is well known. 
But this comma category is the fibre over 1 of an ‘indexed comma category’ 
which we shall denote by (F | B), and whose fibre over J is the ordinary comma 
category (F7 | J*(B)); the transition functors are induced in the obvious way 
by those of C. We shall show that (F | B) satisfies the hypotheses of 2.4.5, 
and so has an indexed terminal object; in particular, (F'!| B) has a terminal 
object. 

First we must verify that (F | B) is cocomplete. The existence of finite colim- 
its in the fibres (F7 | J*(B)), and the fact that the transition functors preserve 
them, follows straightforwardly from the fact that the C’ have finite limits, and 
the F/ and the transition functors of C both preserve them. So we need only con- 
struct the left adjoints ©, for the transition functors of (F | B), and verify that 
they satisfy the Beck—Chevalley condition. Given x: I — J in S and an object 
(A,g: F'(A) > I*(B)) of (F7 | I*(B)), we define ©,(A,g) to be (E2(A), h), 
where h is the composite 


Xz(g9) ? 
F!3,(A) S E F(A) ———> Erl’ (B) S E,2*J*(B) —> J*(B) 


(the last arrow being the counit of (Zz 4 2*)). It is straightforward to verify 
that this object has the right universal property, and that the Beck—Chevalley 
condition holds. 

Next, we verify local smallness. Given two objects (A,g) and (A’,g’) of 
(F! | I*(B)) and (FY | J*(B)) respectively, we have an object (x,y): K > Ix J 
of S/I x J indexing morphisms from A to A’ in C. Let f: 2*(A) — y*(A’) be 
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the generic such morphism; then in D* we have a (non-commuting) triangle 


Fraa) ZO KB) 


FR 
n y*(9') 
F¥y*(A’) 
(where we have suppressed the names of canonical isomorphisms). But equality of 
morphisms is definable in D, by 1.3.15; so we have a monomorphism z: K’ = K 
which is universal among morphisms u: L — K such that u* maps the above 
diagram to a commuting triangle. Then it is easy to see that (rz, yz): K’ ~ Ix J 
indexes morphisms from (A, g) to (A’,g’) in (F | B). 

The argument for well-copoweredness is similar. For any J, the forgetful func- 
tor (F! |J*(B)) — C! preserves and reflects epimorphisms, since F’ preserves 
them; so, given an object (A, g) of (F! | I*(B)), we have simply to find an object 
indexing those quotients A —» A’ of A for which g factors through FA —> FA’. 
If x: J — I indexes quotients of A in C, this will (in general) not be the case for 
the generic quotient e: z*(A) = A’ in C7; but we can form the pushout 


F7z*(A) A J*(B) 


F! (e) h 


F/7(A') —— C 
and then, using the fact that invertibility is definable in D (1.3.15 again), form 
the subobject y: J’ => J which is universal among morphisms z such that z*(h) 
is an isomorphism. Since the transition functors of D preserve pushouts, it follows 
easily that zy: J’ — I indexes quotients of (A, g) in (F | B). 

Finally, we must verify that (F | B) has a separating family. Let G € ob C7 
be a separating family for C, and let (x, J): J — I x 1 be the object indexing 
morphisms from F/(G) to B in D. Let g: F’x*(G) & z* F1(G) — J*(B) be the 
generic such morphism; then we claim that (x*(G), g) is a separating family for 
(F | B). To see this, let (A, h) be an object of (F* | K*(B)) for some K; then 
by 2.4.3 we have a span 


| a RS 


in S and a morphism f: y*(G) — z*(A) whose transpose U,y*(G) — A is an 
epimorphism. Now the composite 


L * 
y* F1(G) & F¥y*(G) EAD F”z*(A) = z*F*(A) a z*K*(B) = L*(B) 
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defines a morphism w: L — J such that zw = y and the composite above is 
(isomorphic to) w* (g); so we have w*(z*(G), g) = (y* (G), w*(g)) and f becomes 
a morphism from this object to z*(A,h), whose transpose is epimorphic. 

Thus we may apply 2.4.5 to (F | B), to obtain a terminal object (R? (B), ez) 
of (F? | B). If we do this for all objects B of D}, it is well known that R! becomes 
the object-map of a functor D! — C1, such that e€ is the counit of an adjunction 
(F? 4 R!). To obtain the right adjoint R? of F7, for a general object J of S, 
we simply repeat the whole argument in the context of S/J-indexed categories, 
using the fact that the change-of-base functor along &£r: S/I — S preserves all 
the properties used in the hypotheses of the theorem, by 1.4.9, 2.2.6(ii), 2.2.7 
and 2.4.4. 0 


Remark 2.4.7 We have seen in 2.4.2(c) that a small category C always has a 
separating family. It is also locally small, as we observed before 2.3.5; it need not 
be well-copowered in the sense in which we defined this notion in 1.3.14, but it 
is easy to see that if C has S-indexed pushouts then the concept of an individual 
epimorphism in C with given domain (as opposed to an isomorphism class of 
such epimorphisms, i.e. a quotient object) is comprehensible, since a morphism 
is epic iff the two morphisms in its cokernel-pair are equal, and C has definable 
equality by 1.3.15. An examination of the proofs of 2.4.5 and 2.4.6 will show that 
this is sufficient to make them work; so we deduce that if C is an S-cocomplete 
small (i.e. internal) category in S, then every cocontinuous functor from C to a 
locally small cocomplete S-indexed category D has a right adjoint. 


As an application of 2.4.7, we prove a result promised after 2.3.9. 


Corollary 2.4.8 Suppose S is (properly) cartesian closed. Then an internal 
category C in S is S-complete iff it is S-cocomplete. 


Proof Suppose C is S-cocomplete. To prove that C is S-complete, it suffices 
to construct S-indexed right adjoints for the diagonal functors A: C — [D, C] 
for each finite category D, and for the indexed transition functors z*: C? — C/ 
induced by each morphism x: J — I of S. (The Beck-Chevalley condition for 
the right adjoints to the z* follows from that for their left adjoints, by 1.4.6.) 
It is easy to see that [D,C] is cocomplete if C is (the left adjoints to its transi- 
tion functors being defined ‘pointwise’) and that A is cocontinuous; also, [D, C] 
is small (and hence locally small) if C is, since we can construct its object of 
objects and object of morphisms using appropriate finite limits in S. Similarly, 
C! is S-cocomplete if C is, by 1.4.9; and the proof of this fact again makes it 
obvious that the indexed transition functors z* are cocontinuous. The point at 
which we use the cartesian closedness of S is to ensure that C7 is small if C 
is: of course, its object of objects and object of morphisms are the exponen- 
tials (Co)? and (C,)!. So the existence of the required right adjoints follows 
from 2.4.7. o 


Suggestion for further reading: Paré & Schumacher [933]. 
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B2.5 Discrete opfibrations 


The remaining three sections of this chapter are devoted to further study of 
diagram categories, as defined in 2.3.11, which will be needed for the proof of 
Diaconescu’s Theorem in Section B3.2. From now on, we shall tend to use the 
notation [C, D] rather than D© for diagram categories, since we are thinking 
of them mainly as ‘hom-categories’ in the 2-category Cats, rather than as the 
‘C-component’ of an indexing of D over Cat(S). 

In this section we investigate an alternative way of looking at the diagram 
categories [C, S], where C is an internal category in a cartesian category S. We 
begin with a useful notational convention regarding pullbacks over Co. Since C 
comes equipped with two morphisms to Co, the usual ‘fibre product’ notation 
Ci xc, A (where A — Co is another object of S/Co) is potentially ambiguous; 
but, if we regard dp and d; as proceeding respectively ‘leftwards’ and ‘rightwards’ 
from C4, as in the picture 


Ch 


do dı 


Co Co ) 


then it makes sense to agree that (unless explicitly stated otherwise) when Ci 
appears on the left of the symbol xc, it denotes the object dı of S/Co, and when 
it appears on the right it denotes the object dp. Thus, for example, Ci xc, Ci 
is another name for C2, since we have a pullback square 


dy 
C2 ——> Cı , 
do do 
dı 


Cı — Co 


and more generally the n-fold fibre product Cy xc, Ci XCy**: XCy Ci denotes Ch. 
(Accordingly, extending our convention, (—) xc, Cn signifies that the structure 
map Ca — Cp is to be taken as d}d?---d?, and Cn xc, (—) that it is to be 
taken as did? --.d". Of course, there are (n — 2) other morphisms from C, to Co 
built into the structure of C, but we shall rarely need to use these when forming 
pullbacks.) 
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By definition, a diagram of shape C in S consists of an object (f: F — Co) 
of S/Co, equipped with a morphism 


ETE, a ee 


NZ 


in S/C, which is unital and associative. Since S has S-indexed coproducts, we 
can equivalently regard the structure as a morphism ¢ = m1ġ: Ci xc, F > F 
making 


Ce ne 


Ti f 


do 
Ci ————> Co 


commute (we think of this as a left action of C on F); and, of course, the 
projection 72: C1 xc, F — F makes 


T2 
ar C SF 


Ti f 


d 
Cpe SC 


commute. Further, the unital condition on ¢ says that 


sof, 1 $ 
pep EE 


is the identity, and the associativity says that the two composites 


lx@ $ 
Ci Xc Ĉi M es Xo, E ———> F 
d x1 


are equal. It follows that if we define Fp = F and Fp = Cn xc, F for n > 1, then 
n> Fn has the structure of an internal category F, with d?: Fa — F,_, taken 
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to be d? x lp: C, xq, F —> Cnrn-1 XC F for 0 <i < n, d? taken to be 


1x@ 
Ch XCo Fo Ch-1 XCo Ci XCo F ane Cn-1 XCo F s 


and s? similarly taken to be s7 x lp. Moreover, if we define fo = f and 
fn = T1: Cn XC, F —> Chn for n > 1, we obtain an internal functor f: F > C. 

Of course, what is going on here is just the internal version of the 
Grothendieck construction which we met in A1.1.7, and again (in a more gen- 
eral setting) in Section B1.3. Accordingly, we now introduce the fundamental 
definition of this section: 


Definition 2.5.1 A discrete opfibration in a cartesian category S is an internal 
functor f: F — C for which 


dı 
Fi ——> Fo 
fi fo 
dı 


Cı ———> Co 


is a pullback. Similarly, we call f a discrete fibration if 


do 
Fı ———> Fo 


fi fo 


Ci ———> Co 
is a pullback. 
Internalizing 1.3.11(iii), we have 


Lemma 2.5.2 Let f:C—D and g: D— E be internal functors in S, such 
that g is a discrete opfibration. Then f is a discrete opfibration iff the composite 
gf is. 

Proof This follows from well-known properties of pullbacks. oO 


Proposition 2.5.3 Let doFib(S) denote the category of internal categories in 
S and discrete opfibrations between them. Then, for any C, the slice category 
doFib(S)/C is equivalent to [C, S]. 


Proof The discussion before Definition 2.5.1 shows how each diagram 
(F,¢) of type C in S gives rise to a discrete opfibration F — C. Con- 
versely, a discrete opfibration f: F — C may be reconstructed, up to canonical 
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isomorphism, from the object fo: Fo — Co of S/Cpo, together with the morphism 
do: C1 XC, Fo = Fi — Fo, and the latter is readily seen to be a (unital and asso- 
ciative) left action of C on Fp. It is further easy to see that morphisms of diagrams 
(F, ġ) — (G, Y) correspond under these constructions to internal functors F — G 
over C; but by 2.5.2 all such functors are discrete opfibrations. o 


Of course, discrete fibrations over C similarly correspond to diagrams of shape 
CP in S (which we may further identify with objects (f: F — Co) of S/Co 
equipped with a right action F xc, Cı — F). It is also possible to define the 
notion of (internal) split (op) fibration in an arbitrary cartesian category S — as we 
remarked in Section B1.3, the theory of split fibrations is essentially algebraic — 
and to show that split opfibrations over C correspond both to diagrams of shape 
C in Cat(S), and to internal categories in [C, S}. However, we shall not pursue 
the details here. 

Instead, we next consider some examples of discrete opfibrations which will 
be of importance in our subsequent discussions. 


Examples 2.5.4 (a) If 1 denotes the terminal object of Cat(S), it is easy 
to see that the unique internal functor F — 1 is a discrete opfibration iff F is 
a discrete category. Thus the equivalence doFib(S)/1 ~ S! of 2.5.3 identifies 
objects of S with discrete internal categories, as we did in 2.3.12(a). More gen- 
erally, for any internal category C and any object IJ of S (regarded as a discrete 
internal category), the projection C x J — C is both a discrete fibration and a 
discrete opfibration; it is easy to see that this corresponds to the constant dia- 
gram of shape C with value J, ie. the object CI of S/Co equipped with the 
structure map 


1 
HOt = OT =-s CH 00I: 


(b) For any internal category C in S (or, more generally, for any simplicial 
object), the shift or décalage Dec; (C) of C is the simplicial object obtained from 
C by shifting dimensions by 1 and dropping the zeroth face and degeneracy 
maps: that is, Deci(C)n is Cr41, d?: Deci(C)n — Deci(C)n—1 is defined to be 
dp: Cn+1 > Cp, and similarly for the s?. It is easy to check that the simplicial 
identities for Dec; (C) follow from those for C, and that Dec,(C) is an internal 
category if C is. Moreover, the omitted face maps dit!: Crs; — Cn form the 
components of a morphism of simplicial objects Dec,(C) — C, which is a discrete 
opfibration if C is an internal category, since 


dz 
C2 —————> C 


Ci — Co 
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is a pullback. The corresponding diagram of shape C in S is of course the object 
do: Cy — Co of S/Co, equipped with the left action dy: C1 xc, Ci = C2 > Ch. 
We shall also write Dec!(C) for the simplicial object with the same objects of 
n-simplices as Dec: (C), but with the last face and degeneracy maps thrown 
away; i.e. d?: Dec'(C)n — Dec'(C)n—1 is defined to be d?+?: C,41 — Cn, and 
so on. In this case, when C is an internal category, the omitted face maps 
EE Cn+1 —> Cn form a discrete fibration Dec! (C) — C. And we shall write 
Dec” (resp. Dec,) for the nth iterate of the functor Dec! (resp. Dec,) from 
[A°P, S] to itself, i.e. the operation which shifts dimensions by n and throws 
away the last, (resp. first) n face and degeneracy maps. 

(c) More generally than (b), if we are given any object z:I — Cp of 
S/Co, we may construct an internal category R(x) equipped with a dis- 
crete opfibration r: R(x) —C, by setting R(t)n = Cn41 Xc, I, with 
Tn = dgt*m: Cn41 XC I —> Cn, and the face and degeneracy maps of R(x) 
induced by those of Dec, (C). (Thus Dec; (C) itself is R(1c, ).) The letter R stands 
for ‘representable’: if S = Set and x: 1 — Co is a particular object of a small 
category C, then R(x) is (the discrete opfibration corresponding to) the repre- 
sentable functor C (x, —): C — Set. In general, recalling from 2.3.16 that S© is 
monadic over S/Cp (the functor part of the monad being Xa di = Ci xc, (—)), 
it is easy to see that R is the free functor S/Co — [C, S], i.e. the left adjoint 
to the forgetful functor. Note also that all the structure maps of R(x) are 
morphisms over I; thus we may regard it equivalently as a diagram of shape 
C in S/I, or as an J-indexed family of objects of the S-indexed category S© 
(ef. 2.3.12(a)). 


Now suppose that S has coequalizers of reflexive pairs (cf. A1.2.10). For 
any internal category C in S we define moC, the object of components of C, to 
be the coequalizer of (do, d,): C1 3 Co (note that this pair is reflexive, with 
common splitting given by so). It is clear that mo is a functor Cat(S) — S; 
and it is not hard to verify that it is left adjoint to the functor which sends an 
object of S to the corresponding discrete internal category. More interestingly, 
we have 


Lemma 2.5.5 Let S and C be as above. Then the left adjoint lime of 
C*: S — [C, S] may be described in terms of discrete opfibrations as the functor 
(f:F > C) mF. 


Proof By definition, C* I is the object (Co)*I of S/Co equipped with the iden- 
tity morphism on (C})*J as its structure map. But by 2.5.4(a) this corresponds 
to the discrete opfibration 12: I x C — C; that is, C* may be identified with 
the composite of the ‘discrete’ embedding S — Cat(S) and the pullback functor 
C*: Cat(S) — Cat(S)/C. Composing the left adjoints mo and Uc of these two 
functors yields the description above. QO 
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We note that 7Deci(C) & Co, since we have a split coequalizer diagram 


dy E 
eae Oe SG 
dz 


whose splittings are provided by the degeneracy maps s and sj omitted in the 
definition of Dec,(C). More generally, for any object x: I — Co of S/Co we have 
moR(x) = I, since the above diagram together with its splittings is preserved by 
the functor (—) xc, J. Of course, this latter fact could also have been deduced 
from the description of R as the left adjoint of the forgetful functor S? + S/Co; 
for its composite with the left adjoint of C* must yield the left adjoint Uc, of C5. 


Corollary 2.5.6 Suppose coequalizers of reflexive pairs are stable under pull- 
back in S. Then, for an internal category C in S, the functor C*: S — [C, S] is 
full and faithful iff C is connected, i.e. oC = 1. 


Proof The hypothesis on S implies that Cı x I = Co x I — mC x Iisa 
coequalizer for any object I of S, ie. limcC*I & moC x I. So toC = 1 implies 
that the counit of the adjunction (limc 4 C*) is an isomorphism, the converse 
is just the special case J = 1 of this isomorphism. O 


The following result is the ‘internal’ analogue of the result that every functor 
from a small category to Set is expressible as a colimit of representable functors. 


Lemma 2.5.7 Let S be a cartesian category with coequalizers of reflexive pairs 
which are stable under pullback. Let C be an internal category in S, and let F 
be a diagram of shape C in S. Then there exists a diagram G of shape F°P in 
[C, S] (where y: F — C is the discrete opfibration corresponding to F) such that 
lim por (G) S F, and such that the image of G under the forgetful functor 


IF”, [C, S] — [C, 51° 


is R(70). 


Proof In addition to its left C-action, the object Ci xc, Fo = Fi of S/Co x Fo 
comes equipped with a right action of F, given by the composition map 
dŒ: Fy Xp, Fi — Fi; the two actions commute with each other in the appro- 
priate sense (this is just the associativity of composition in C), and so we may 
regard it as a diagram of shape F? in [C, S], or equivalently as a diagram of 
shape C in [F°?, S]. (In the notation which we shall introduce in Section B2.7 
below, G is the profunctor ye: F + C.) Now the diagram 


dg d} 
Po O3 F, ———> h 
di 
is a split coequalizer (the splittings being given by s8 and sį) in S/Co; and the 
maps in it are all equivariant for the left actions of C on the three objects. Also, 
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the hypotheses on S ensure that the forgetful functor [C,S] — S/Co creates 
coequalizers of reflexive pairs; so the diagram remains a coequalizer in |C, S}. 
But it is precisely this coequalizer which we have to compute, by 2.5.5, in order 
to find limpo (G). o 


Classically, it is well known that the functor category [C°P, Set] is the ‘free 
cocompletion’ of a small category C. The next result is the analogue for internal 
categories. 


Corollary 2.5.8 Let S be as in 2.5.7, let C be an internal category in S and 
let D be a cocomplete S-indexed category. Then the category |C, D] is equivalent 
to the category of cocontinuous S-indexed functors [C°?,S] — D. 


Proof The object (do, di): Cı —> Co x Co of S/Co x Co, equipped with both 
left and right actions of C in the obvious way, may be regarded as a diagram of 
shape C in [C°P, S]. So, given any indexed functor F: [C°P, S] — D, we obtain a 
diagram of shape C in D by applying [C, F] to this diagram. Conversely, suppose 
given a diagram G of shape C in D, which we may think of as an indexed functor 
g: [C] — D via the equivalence of 2.3.13. We have to produce a cocontinuous 
functor F: [C°P,S] — D which agrees with g on representables; but since every 
object E of [C°P, S] is a colimit of representables by 2.5.7, there is only one 
possible way to define this functor. Specifically, we let y: E — C be the discrete 
fibration corresponding to E, and then define F(E) to be the object limg(y*G) 
of D. The extension of this definition to an S-indexed functor, the verification 
that it is cocontinuous, and the proof that the two constructions are inverse to 
each other up to natural isomorphism, are entirely straightforward. O 


It will be recalled that in 1.4.16 we gave a construction for freely adjoining 
S-indexed coproducts to an S-indexed category. If we apply this to a cate- 
gory of the form [C], we obtain a category whose -indexed families of objects 
are simply objects of S/I x Co, and whose -indexed families of morphisms 
(x: J +I x Co) > (y: K — I x Co) are pairs (f,g) where f: J — K is a mor- 
phism over J (though not necessarily over J x Co) and g: J + I x Cı satisfies 
(1 x di)g = x and (1 x do)g = yf. If we freely adjoin reflexive coequalizers to 
this category, then we should expect to obtain the free S-cocompletion of [C], 
by the remark after A1.2.10; it is not hard to verify directly that this process 
would yield an indexed category equivalent to [C°P, S], so giving an alternative 
proof of 2.5.8. 

Lemma 2.5.5 can be generalized to yield a description of the left adjoint lim + 
of f*: [D, S] — [C, S] for an internal functor f: C  D in S. First we show 


Lemma 2.5.9 If S is a cartesian category with reflexive coequalizers which 
are stable under pullback, then doFib(S)/C is reflective in Cat(S)/C, for any 
internal category C in S. 
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Proof Let f:D-—C be an internal functor, and form the coequalizer 


(dy (1 x fi), dima) q 
Cy Xo, Di LS Ci Xc Do ——> Eo 
1 x do 


dot 
dot, eo 


Co 


in S/Co. Note that the parallel pair in this diagram is reflexive, with common 
splitting given by 1 x so; in fact we can make the assignment 


n> (dom: Cy Xo, Dn > Co) 


into an internal category Cı » D in an obvious way, and so eg: Eo — Co is 
mo(C; x D). We claim first that e9 can be given the structure of a discrete 
opfibration over C. For the hypothesis on coequalizers in S ensures that 


Ci X Co Ci Xc Di z Ci XCo Ci XCo Do ——> C; XC Lo 


Cı 


is a coequalizer in S/C; now the morphisms d, x 1: Cy xc, Ci Xc Di > 
Cı Xc Dı and dy x 1: Cy Xe, Ci Xc, Do —> Ci Xc, Do induce a morphism 
€: Cy XCo Eo — Eo making 


Ci X Co Eo MaE Eo 


Wy €o 


do 
Cı —————> Co 


commute. Together with the second projection Ci xc, Eo — Eo, this morphism 
forms part of an internal category structure on (n ++ Cn Xc, Eo), which we 
denote by E, and eg becomes the object-map of a discrete opfibration E — C. It 
is clear, too, that 


(f:D>C) = (e:E >C) 
is a functor Cat(S)/C — doFib(S)/C. 
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Next, we observe that the composite 


S89 X 1 q 
Do = Co X Co Do ——> C1 XCo Do — > Ep 
is the object-map ro of an internal functor r: D — E over C, where r,, is given by 


fn XTo 
Dn = Dn XD, Do ———> Cn Xo, Eo = En. 


And r defines a natural transformation from the identity functor on Cat(S)/C 
to the functor just described. Finally, if f itself is a discrete opfibration, then 
the split coequalizer diagram 


dı 
DE s Dy zd Do 
do 


(whose splittings are given by s9: Do — Dı and sı: Dı — D2) is identified by 
the isomorphisms Dı = Cı xc, Do and D2 = Ci Xc, Di with the coequalizer 
defining Eo; thus we conclude that ro: Do — Eo is an isomorphism, and hence 
that r: D — E is an isomorphism. It follows that, for a general f: D — C, r is 
universal among morphisms from f to discrete opfibrations over C. 0 


Now, if f:C—D is an internal functor, it is readily seen that 
f*: [D,S] — [C,S] may be described in terms of discrete opfibrations as the 
pullback functor f*: Cat(S)/D — Cat(S)/C, restricted to the latter. (It is easy 
to verify directly that a pullback in Cat(S) of a discrete opfibration is a discrete 
opfibration.) Thus we have 


Corollary 2.5.10 Under the hypotheses of 2.5.9, the left Kan extension func- 
tor lim: [C, S] — [D, S] may be described in terms of discrete opfibrations as 
the composite 


5 
doFib(S)/C —> Cat(S)/C Ó> Cat(s)/D “> doFib(s)/D 
where L is the reflection functor of 2.5.9. g 


Remark 2.5.11 Given two internal functors f:C — D and g: B — D, it is 
clear that we may construct the comma category (f |g) in Cat(S) (its object 
of objects is Bo Xp, Di Xp, Co, and its object of morphisms is an appropriate 
subobject of Bı x Cı), and prove that it has the universal property of a comma 
object (cf. 1.1.4(c)) in Cat(S). Using the description of left Kan extensions pro- 
vided by 2.5.10, it is straightforward to verify that, if S satisfies the hypotheses 
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of 2.5.9, we get a commutative square 


[C, S| ———> D, 5] 


p* 
lim 


[(f Lg), 5] ——> B, sS] 


where p and q are the projections. In other words, S is pointwise A-cocomplete 
in CALs in the sense of 1.1.16(b), where A is as before the class of all functors 
between internal categories in S. If S is also locally cartesian closed, then by 
taking right adjoints of the functors in the above diagram we may also conclude 
that S is pointwise A-complete. 


We say an internal functor f : D — C is initial if its reflection in doFib(S)/C 
is the terminal object 1c; equivalently, if the diagram 


(d\(1 x fi), dim) doTi 
Cı xo, Di OLL Ci xe, Do ——> Co 


1 x do 


is a coequalizer. It is easily seen from the composition properties of discrete 
opfibrations (2.5.2) that the unit of the adjunction of 2.5.9 is always initial; in 
fact, we have shown that an arbitrary internal functor can be factorized uniquely 
(up to isomorphism) as an initial functor followed by a discrete opfibration. In 
the case S = Set, this factorization structure on Cat is sometimes known as the 
comprehensive factorization (cf. [1139]). 

There is also an important connection between final functors and discrete 
opfibrations (or equivalently, between initial functors and discrete fibrations). 
Of course, we say f:D — C is final if it is initial as a functor D°P — CP; 
. equivalently, if 


(dom, di( fi x 1)) d\72 
Dy xe, Cy ———————— 3 Dp xa, C1 — > Co 
dı X1 


is a coequalizer. (Traditionally, final functors were called ‘cofinal functors’; but 
this use of ‘co’ is potentially misleading as it has nothing to do with dualization — 
it is derived from the Latin ‘cum’ rather than ‘contra’ — and so it is now generally 
omitted.) 


Proposition 2.5.12 Suppose S is cartesian and has reflexive coequalizers 
which are stable under pullback. Then an internal functor f:D—C in S is 
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final iff the canonical natural transformation in the diagram 


f* 


IC, deme aei S] 


A 


(the ‘mate’ of the natural isomorphism D* = f*C*) is an isomorphism. 


Proof First suppose the diagram commutes up to isomorphism. Recall from 
2.5.4(b) that we have a particular discrete opfibration dp: Decı (C) — C, and that 
noDec:ı(C) = Co. But if we pull back Dec,(C) along f we obtain the discrete 
opfibration over D whose domain is the internal category 


(dom, di(fi x 1)) 
Dy xo, Cy SS Do XQ C1; 
dı X 1 


so the condition implies that the coequalizer of this pair must be 
dina: Do X Co Ci — Co. 

Conversely, suppose f is final. Then an easy diagram-chase shows that, for 
any discrete opfibration e: E — C, the functor q in the pullback 


F E 

D TE ees C 
is also final; for all the objects and morphisms in the coequalizer diagram 
for q are obtained simply by pulling back those in the diagram for f along 
eo: Eo — Co. It therefore suffices to show that if f is final then the canonical 
morphism moD — mpC induced by f is an isomorphism. Suppose u: Dp > A 
coequalizes (do, d1): Dı 3 Do; then um: Do Xc, Cı — A coequalizes the pair 


appearing in the definition of finality, and so we obtain a unique v: Co > A 
satisfying vdı Tə = un. Moreover, from the commutativity of the diagram 


(1, sofo) 


us 
Do ——————— Do Xo C1 ae Do 


ai es 


dt u 


e EERE | 
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we see that we necessarily have vfo = u; hence v coequalizes (do, di): Ci 3 Co 
and so factors uniquely through Cp — a C. Conversely, if w: Co — A coequalizes 
dg and dı, then we necessarily have wdın2 = wfom; thus we may show that 
the composite Do — Co —> moC has the universal property of a coequalizer of 
Dı 3 Do. g 


We remark that 2.5.6 is a special case of 2.5.12, since C — 1 is a final functor 
iff C is connected. 


B2.6 Filtered colimits 


In this and the next section, we shall develop two rather specialized tools of 
internal category theory, both of which will play crucial rôles in the proof of 
Diaconescu’s Theorem in Section B3.2: the calculus of filtered internal colimits, 
and the theory of internal profunctors. Both of these can be developed in a 
context considerably more general than that of a topos (which is why they belong 
here, rather than in Chapter B3): our standing hypothesis for the present section 
will be that our ambient category S is regular and has coequalizers of reflexive 
pairs. 

We saw in 2.3.22 that, for any internal functor f: C — D in a topos S, the 
functor f*:[D, S] — [C,S] is the inverse image of a geometric morphism. On 
the other hand, when S = Set it is well known that there are important special 
cases when f* is a direct image functor, i.e. when its left adjoint lim ș is cartesian 
(see, for example, A4.1.10). In this section, we shall consider the corresponding 
question over a general base category S. 

One case when f* is a direct image functor is when f has a right adjoint 
g: D — C in €at(S); for, since the assignment C ++ S© is a 2-functor €at(S) —> 
Yop/S, it preserves adjunctions, from which we deduce that f* is right adjoint 
to g* — equivalently, that g* is isomorphic to lim. In particular, if C has a 
terminal object (which may be defined, for an internal category C, as a morphism 
t: 1 — Co for which there exists a morphism u: Co — C, satisfying diu = lc, 
and making 

Co ————> 1 


do 
Cı ——> Co 

a pullback), then the unique internal functor C — 1 has a right adjoint, and so 
lime is cartesian. However, from our experience of the case S = Set, we know 
that this sufficient condition is not necessary; in the Set-based case, lime is 
cartesian iff C is a filtered category. 

Accordingly, our first task in this section is to formulate the notion of fil- 
teredness for an internal category in S. It turns out that the internal logic of a 
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topos is considerably more than we need in order to do this: the axioms defining 
filteredness all lie within the regular fragment of first-order logic (cf. D1.1.6), 
and so can be interpreted in any regular category. 

We begin by defining a number of objects associated with an internal 
category C. 


Definition 2.6.1 
(a) Pc, the object of pairs of morphisms of C with common codomain, is defined 
by the pullback 


Pp 
Pi 


P2 do 


do 
Ci ———> Co 


(Note that, because of our convention regarding the symbol xc, introduced 
in Section B2.5, we cannot denote Pe by Ci Xc C1.) 


(b) Qc, the object of pairs with common domain, is defined by the pullback 


qı 
Qc —— 0 


q2 dı 


dy 
Ci ——-—>> Co 


(c) Re, the object of parallel pairs in C, is the intersection of Pc and Qc in 
Sub(C, x C1), or equivalently the pullback 


ies SO; 
T2 (do, dı) 
(do, dı) 


Cı ———> Co x Co 
(d) Sc, the object of commutative squares in C, is defined by the pullback 


S 
E g 
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(e) Tc is defined by the pullback 


ty 
Te ———> C2 ; 


t2 (do, dı) 


(do, dı) 
Cz —— Cı x CY 


intuitively, Te is the object of diagrams of type T in C, where T is the 
finite category 


with fg = fh. 


We shall normally omit the subscript C from the names of the above objects 
when we are working with a single internal category C. 

If we now translate the usual axioms for filteredness of a small category (‘there 
exists an object’, ‘for any pair of objects, there exists a pair of morphisms from 
them with common codomain’, and ‘for any parallel pair of morphisms, there 
exists a morphism coequalizing them’) via the standard interpretation of regular 
logic in a regular category (cf. D1.2.6), we arrive at the first half of the following 
definition: 


Definition 2.6.2 
(a) An internal category C in a regular category S is called filtered if 
each of the three morphisms Co — 1, (dıpı,dıp2): P + Co X Co, and 
(dati, d2t2): T — R is a cover. 
(b) We say C is weakly filtered if the above morphism T — R and the morphism 
(d281,d282): S — Q are both covers. 


- (Note: in this definition we have adopted the custom of naming morphisms 
‘ into a pullback by the names of morphisms into the product of which the pullback 
is (canonically) a subobject. For example, (dt), d2t2) is really a morphism T — 
Cı x Cy, but it can easily be shown to factor through R — C x Ci.) 

We remark that if C is a preorder (i.e. if (do, di): Cy — Co x Co is monic), 
then the condition that T — R should be a cover is redundant, in both halves 
of Definition 2.6.2: for in this case the diagonal morphisms Cı — R and C2 —> 
T are isomorphisms, and (d2t1,d2t2) is identified with the split epimorphism 
d2: Cz — Cı. For preorders, we commonly use the term directed as a synonym 
for filtered. 

We also remark that the assertion that an internal category C is filtered does 
not imply the filteredness of the fibres of the corresponding indexed category 
[C]; however, it is equivalent to the assertion that, given any finite diagram D 
in some fibre [C]’ of this indexed category, there exists a cover x: J —> J and a 
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cone under the diagram x*(D) in [C]. (Weak filteredness is equivalent to the 
same assertion restricted to connected finite diagrams.) 

In the proofs of most of the following results, we shall argue informally in 
terms of elements; that is, we shall prove the assertions in regular logic, and rely 
on the soundness of the standard interpretation of this logic in a regular category 
(D1.3.2) to deduce that they hold in S. 


Lemma 2.6.3 A filtered category is weakly filtered. 


Proof Suppose given elements z, y of Ci satisfying dix = diy. By the second 
condition in the definition of filteredness, we can find u,v such that dou = dov, 
diu = doz and div = doy. Now the pairs (u, x) and (v, y) belong to C2, and the 
pair (dı (u, £), dı (v, y)) belongs to R, so we can find w with diw = dou = dov and 


dı (w, dı (u, £)) = dy(w, di(v, y)) . 


But the latter equation is equivalent to dı (dı (w, u), z) = di(di(w,v),y); so the 
elements (dı (w,u),z) and (di(w,v),y) of Cz define an element of S mapping 
onto the given element (z, y) of Q. g 


Lemma 2.6.4 IfC is filtered, then it is connected (i.e. noC S 1). 


Proof We know that 7oC — 1 is a cover, since Cg — 1 factors through it. But 
if x,y are any two elements of Co, we can find u,v in Ci with dyu = z, div = y 
and dou = dov; so if q: Co —> ToC is the quotient map we have 


qx = qdıu = qdou = qdov = qdiv = QY . 


Thus the kernel-pair of q is the whole of Co x Co, and hence 7C — 1 is monic. 
0 


Proposition 2.6.5 In an effective regular category S, an internal category C 
is filtered iff it is weakly filtered and connected. 


Proof One direction is immediate from the last two lemmas. Conversely, sup- 
pose C is weakly filtered and connected; then clearly Co — 1 is a cover, so it 
only remains to verify the second condition of 2.6.2(a). For this, let I — Co x Co 
be the image of the given morphism P — Cp x Co; we claim first that J is 
an equivalence relation on Co, as defined in A1.3.6. The reflexivity and sym- 
metry of J are clear from the definition of P; and transitivity follows from the 
weak filteredness of C, since if we are given elements x, y,z,w of Cı satisfying 
dix = dıy, doy = doz and d,z = dyw (so that the pairs (doz, doy) and (doy, dow) 
are both related by 7), then the pair (y,z) belongs to Q and we can find u,v 
with diu = div, dou = dıy and dov = dz (and d;(y,u) = dı (z, v), although we 
don’t need this). And now the pair (di(z,u),di(w,v)) belongs to P, so that the 
pair (doz, dow) are related by I. 

Now let f: Co — J be a morphism whose kernel-pair is J. Then f fac- 
tors through the coequalizer Co — 1 of dg and dj, since the pair (do, d;) 
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factors through P =3 Co (and hence through I = Cg) by the morphism 
(sodo, lo, ): C1 — P; hence J must be the whole of Co x Co, and so P + Co x Co 
is a cover. o 


For any internal category C, we observe that the (n+1) morphisms Cn —> Co 
are all coequalized by Co — aC; so we can regard each Cn unambiguously as 
an object of S/7oC, and all the structural morphisms of C are morphisms over 
noC. Moreover, since Eroe: S/7pC — S preserves and reflects pullbacks, we see 
that C itself can be regarded as an internal category in S/7 oC; and since the 
conditions in 2.6.2(b) involve only pullbacks (rather than products), C is weakly 
filtered in S iff it is weakly filtered in S/noC. Thus we may rephrase 2.6.5, as 
follows: 


Corollary 2.6.6 Let S be an effective regular category. Then an internal 
category C in S is weakly filtered iff it is filtered as an internal category 
in S/ToC. O 


Lemma 2.6.7 Let f:F — C be a discrete opfibration. If C is weakly filtered, 
then so is F. 


Proof A straightforward diagram-chase shows that the squares 


Sc ———> Qc Te ———> Rec 


are pullbacks, where the vertical arrows are induced by composition with f. So 
this is immediate from the definition of a regular category. o 


Theorem 2.6.8 Let S be an effective regular category with coequalizers of 
reflexive pairs. Then an internal category C in S is filtered iff the functor 
- lime: [C, S] + S is cartesian. 


Proof First suppose C is filtered. We regard lime, as in 2.5.5, as the functor 
sending a discrete opfibration f: F — C to 7F; so 2.6.4 tells us that lime pre- 
serves the terminal object. So it suffices to show that lime preserves pullbacks; 
but in fact we need only show that it preserves binary products, since it will then 
follow from 2.6.6 and 2.6.7 that it sends pullbacks over (f: F — C) to products 
in S/7oF, i.e. to pullbacks over 7F in S. Now let f: F — C and g: G — C be 
two discrete opfibrations; then the projections F xc G — F and F xc G —G 
induce a morphism ¢: 79(F xc G) —> 7oF x 7G, and we must show that this is 
an isomorphism. 

To show that ¢ is monic, it suffices to show that any two elements of Fy xc, Go 
having the same image in 79F x 7G also have the same image in 79(F xc G). 
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Let (x1,Y1) and (£2,Y2) be two such elements; then 2, and x2 have the same 
image in 7F. But F is weakly filtered by 2.6.7, so by the argument in the proof 
of 2.6.5 the pair (21,272) lies in the image of Py — Fy x Fo; i.e. we can find 
elements u1, u2 of Fı satisfying diu; = 2; (i = 1,2) and dou; = douz. Similarly, 
we can find elements v1, v2 of G; satisfying div; = y; and dovı = dovz. Now the 
elements fiu, five, gıvı and gi v2 of Cı form a diagram of shape 


e —— e 


e ————> e 


in C; so we can find elements w1, w2 in Ci with diwi = do fiui, diw = dogi vi 
and dow; = dow2. And by an argument similar to that in the proof of 2.6.3 we 
can also ensure that dı (w1, f1u1) = di(we,giv1) and dı (w1, fiue2) = di(we, giv2) 
as elements of C1. Now the pair (w1, dou1) is in C1 xc, Fo, so we may regard 
it as an element Ù, of F}, and similarly we may lift w2 to an element we 
of Gi. And the pairs (dı (1, u1), dı (te, v1)) and (dı (1, u2), dı (We, v2)) lie in 
Fı xc, G1; but their domains are the elements (11, y1) and (£2, y2) respectively, 
and their codomains are equal. Hence (x1, y1) and (x2, y2) have the same image 
in no(F XC G). 

To show that ¢ is a cover, it similarly suffices to show that, given any element 
(x,y) of Fo x Go, we can find (2’, y’) in Fo xc, Go such that x and z’ have the 
same image in 79F, and y and y’ have the same image in 7G. But we can find 
elements u, v of Cı such that diu = fox, div = goy and dgu = dov; and if we 
then lift u and v, as before, to elements ŭ and č of F; and Gi with domains z and 
y respectively, the elements x’ = dot and y’ = doŭ have the required properties. 

For the converse, suppose now that lime is cartesian. Since lime preserves 
the terminal object, we have 79C = 1, and so Cp — 1 is a cover. For the second 
condition of 2.6.2(a), consider the discrete opfibration Dec: (C) — C of 2.5.4(b): 
we observed after 2.5.5 that m(Dec;(C)) = Co, so if lime preserves binary 
products we have mo(Dec: (C) x¢ Dec; (C)) = Co x Co. But the object of objects of 
Dec; (C) xc Dec; (C) is easily seen to be the pullback Pe; so P — Cox Ch is a cover. 
The third condition follows similarly from the fact that lim ç preserves equalizers: 
the two possible morphisms dor, = dorz and dr; = dir2: Rc — Co give rise 
to discrete opfibrations R(djr1) — C, as in 2.5.4(c), and the two morphisms 
Rc =3 Cı define a pair of morphisms R(dgri) = R(dir1) over C. Applying 
the functor 79 to the latter, we obtain the identity morphism Rc — Re twice 
over; but if we form the equalizer of the morphisms in doFib(S) (equivalently, 
in Cat(S)), we obtain an internal category whose object of objects is readily 
seen to be isomorphic to Te. So, if lime preserves equalizers, we have a cover 
Tc — Rc; once again, it is easy to verify that this cover is the one appearing in 
Definition 2.6.2(a). 
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Remark 2.6.9 If C is a filtered internal category in a topos S, we shall some- 
times write coc (or simply oo, if C is obvious from the context) for the geometric 
morphism S — [C, S] whose direct image is C* and whose inverse image is limc. 
(Note that, by 2.6.4 and 2.5.6, this is a morphism in Yop/S.) The thinking 
behind this notation is that, for any object c of C (that is, for any morphism 
c: 1 — Co), ‘evaluation at c’ is an inverse image functor from [C,S] to S, and 
we think of lime as ‘evaluation at the ideal object at infinity’. Note also that, if 
we merely assume that C is weakly filtered, then the same two functors define 
a pre-geometric morphism S — [C,S] in the sense of A4.1.13, since the proof 
of 2.6.8 shows that lim ç preserves pullbacks in this case. (Equivalently, we may 
regard this as a geometric morphism S/mC — [C, S]; cf. 2.6.6.) 


Using the description of lim + provided by 2.5.10, it is now an easy matter to 
extend 2.6.8 to a characterization of those internal functors f: D— C for which 
lim z: [D, S] — [C, S] is cartesian. 


Proposition 2.6.10 Let S be an effective regular category in which coequaliz- 
ers of reflexive pairs exist and are stable under pullback, and let f : D — C be an 
internal functor in S. Then lim p is cartesian iff the internal category Cı » D 
defined in the proof of 2.5.9 is filtered in S/Co (equivalently, iff f is initial and 
Cı xD is weakly filtered in S). 


Proof The forgetful functor [C, S] — S/Co is monadic and hence creates finite 
limits; so lim + is cartesian iff its composite with this forgetful functor is. But it 
is easily verified that the functor Cat(S)/C — Cat(S/Co) which sends D — C 
to Ci x D is cartesian and preserves discrete opfibrations; so the above composite 
may be viewed as the composite 


Ci ™ (—) limo, xD 
doFib(S)/D ———> doFib(S/Co)/(C1 x D) ———> S/Cy 


which is cartesian iff its second factor is. This establishes the first characteriza- 
tion; the second follows at once from it and 2.6.6. oO 


Before leaving this section, we consider the interplay between filtered cat- 
egories and final functors, as defined at the end of the last section. First we 
note that, for functors between weakly filtered categories, the definition of final 
functor may be somewhat simplified; for we have an explicit description of the 
equivalence relation which is the kernel-pair of Co —> moọC, as the image of 
Pe > Co x Co (cf. the proof of 2.6.5). Thus we may show 


Lemma 2.6.11 Suppose S is an effective regular category, and let D be a fil- 
tered internal category in S. Then an internal functor f: D — C is final iff the 
morphisms dinz: Do Xco Cı ~ Co and ((dimi, na)ni, (dimi, najma): Ty —> Ry 
are covers, where Ry is the kernel-pair of (m1, 4172): Do Xco C1 —> Do X Co and 
Ti that of (mı, di (fi x 1)): Di xo Ci > Dı x C1. 
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Proof Note that, when f is the identity functor on C, the definitions of Ry and 
T; reduce to those of Rc and Tc given in 2.6.1. Intuitively, the second condition 
says that, given a parallel pair of morphisms (x,y): c =3 f(d) in C, we can find 
z:d—d' such that f(z) has equal composites with x and y. 

Since D is filtered we deduce, as in the proof of 2.6.5, that the equivalence 
relation generated by ((dom, di(fi x 1)), dı x 1): Di xo, Ch 3 Do xc, Ci is 
simply the image of (dı x 1) x (dı x 1): Ss — (Do Xq C1) x (Do Xc,C1), where 
S; is the kernel-pair of (dom, di(fi x 1)): Di xc, Ci — Dox c,C1. So we have 
to show that the second condition of the lemma is equivalent to saying that 
Ss — Qz is a cover, where Qy is the kernel-pair of dim2: Do Xc C1 > Co. But 
this may easily be proved by ‘elementwise’ arguments like those in the proofs of 
2.6.3 and 2.6.5. o 


If C is also known to be filtered and f is full, then it is easy to verify that 
the second condition of 2.6.11 is redundant. 
In a similar vein, we may establish 


Proposition 2.6.12 Let f:D—C be a final internal functor in an effective 
regular category S. If D is filtered, then so is C. 


Proof As a sample of the argument, we shall show how C inherits the second 
property of 2.6.2(a) from D; the other cases are similar. Consider the diagram 


d, Te x dinz 
WSS E E TRAE ET O = OOS 


Mi X Ti 


Py ————————_ Do x Do 


where the square is a pullback. The top composite is thus a cover; but it is easy 
to construct a factorization of it through Pe —> Co x Co. g 


Of course, given the result of 2.6.8, we could alternatively have deduced the 
result of 2.6.12 using 2.5.12. 

We conclude this section with an important result which, in conjunction with 
2.5.12, tells us that we can in practice usually replace arbitrary filtered colimits 
(that is, colimits over arbitrary filtered categories) by colimits over directed 
posets. However, for this result we shall need to assume rather more about our 
base category S. 


Theorem 2.6.13 Let S be a topos with a natural number object. For any fil- 
tered internal category C in S, there erists a final functor D — C where D is a 
directed poset. 
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Proof We shall give the proof explicitly only in the case S = Set: the 
techniques needed to ‘internalize’ this proof in an arbitrary topos with natu- 
ral number object are to be found in Part D. (In particular, we remark that the 
notion of finiteness required for the proof is the Kuratowskian one, studied in 
Section D5.4.) 

The objects of D are taken to be finite subcategories, not of C itself but of 
C x N, where N is the ordered set of natural numbers (with the ordering defined 
in Section A2.5); the reason for using this product will become apparent in the 
course of the proof. We say a subcategory F of C x N is suitable if 


(i) F is finite (that is, both Fp and F; are finite); 
(ii) F has a terminal object; and 


(iii) for any morphism (a, 8): (c,n) — (c, n’) of F, if 2 is an identity morphism 
then so is a. 


Note that one effect of the third condition is to ensure that the only isomorphisms 
in F are identity maps; in particular, this ensures that the terminal object of F is 
unique, and we denote it by t(F). We take D to be the poset of all suitable sub- 
categories of C x N, ordered by inclusion. We have an evident functor f: D— C, 
which sends F to the first component of t(F) and an inclusion F; C Fz to the 
first component of the unique morphism t(F1) — t(F2) in F2. We must show 
that D is directed and that f is final. 

D is clearly nonempty: if c is any object of C, then the subcategory whose 
only object is (c,0) and whose only morphism is the identity is clearly suitable. 
Let F; and Fz be suitable subcategories. The subcategory F; V Fz generated by 
their union is still finite, because of condition (iii) which ensures that we do not 
have to compose arbitrarily long strings of morphisms in the union; and it also 
clearly satisfies condition (iii). However, it lacks a terminal object in general. 
Let us write t(Fi) = (&, ni); let n3 = max {n1,n2} + 1, and choose a pair of 
morphisms (cı — ¢3, C2 —> c3) in C with common codomain. If we simply adjoin 
the morphisms (ci, ni) > (€3,73) (i = 1,2) to Fı V Fe, we still may not have 
a terminal object, since if an object (co, no) belongs to both F; and Fz then 
the composites cg — Cı — cg and co — C2 — c3 may not be equal. But there 
are only finitely many such pairs; so we can find a single morphism c3 — C4 
coequalizing all of them. Now if we adjoin the morphisms (c1, n1) — (ca, na) 
and (c2,72) — (c4, n3) (plus all necessary composites) to Fı V F2, we obtain a 
suitable subcategory F3 containing both F, and F2, as required. 

The proof that f is a final functor is very similar, and we omit it. 0 


Example 2.6.14 To show that the hypothesis that S has a natural num- 
ber object cannot be omitted from 2.6.13, consider the case S = Sety. Let 
M = {1,e} be the two-element monoid with e? = e, considered as a finite cate- 
gory (that is, an internal category in Set;): it is straightforward to verify that 
M is filtered. However, any finite directed poset P has a greatest member po, 
from which it is easy to see that no functor f: P — M can be final: if x denotes 
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the unique object of M, then (1: * — f(po)) and (e: * > f(pg)) necessarily lie 
in different connected components of the comma category (* | f). 


B2.7 Internal profunctors 


In this section we develop the basic properties of the bicategory of internal pro- 
functors in a category S; our standing hypothesis for the section is that S is 
cartesian and has coequalizers of reflexive pairs which are stable under pullback. 
By A1.3.5 and the remarks following it, this implies that S is regular; so this 
hypothesis is slightly stronger than that which we assumed in the previous sec- 
tion — although we shall not need to assume that S is effective regular, as we did 
in the proof of 2.6.5 above. 

Classically, a profunctor from a category C to a category D (also called a 
distributor, or a D-C bimodule) is a functor C°P x D — Set, thought of as a 
‘generalized functor’ from C to D. If we replace the ordinary categories C and D 
by internal categories C and D in S, then we may of course model this notion 
by that of a discrete opfibration over C°P x D (or by a discrete fibration over 
CxD°?); but it turns out to be more profitable to exploit the mixture of variances 
in the definition, producing the following more symmetrical notion: 


Definition 2.7.1 Let C and D be internal categories in a cartesian category S. 
By a profunctor T: C + D, we mean an object of S/Do x Co, or equivalently a 
span 


Dy <————— Ty ——> Co 


in S, equipped with a left action Av: Di xp, To — To and a right action 
pr: To Xc Ci — To (both maps over Do x Co), which are associative and unital, 
and such that 


1x pr 
Dı X Do To X Co Ci ——> D: X Do To 


Ar x1 AT 


Pr 
To Xc, Ci ——— > To 


commutes. A morphism of profunctors is a morphism over Do x Co commut- 
ing with the left and right actions. We write Prof(C,D) (or, if necessary, 
Profs (C, D)) for the category of profunctors C + D; by the remarks above, 
this category is isomorphic to [C°? x D, S]. 


We may clearly identify internal diagrams of shape C and of shape C°P with 
profunctors 1 9% C and C % 1 respectively, where 1 is the terminal object of 
Cat(S). 
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Example 2.7.2 The unit or Yoneda profunctor Y (C): C % C is defined to be 
the object 


do dı 
Co <—_—— Ci —— > o 


of S/Co x Co, with both left and right actions given by d? : C2 > C1. The con- 
dition that the two actions commute with each other reduces in this case to the 
associativity of composition in C. 


We may regard Y (C) as a diagram of shape C°? in [C, S], and hence as an 
indexed functor [C]°P — [C, S]. As such, it is a full embedding: it sends an object 
x: I — Co of [C]! to the diagram which we called R(z) in 2.5.4(c), regarded as 
an I-indexed family in virtue of the fact (which we observed after 2.5.5) that 
moR(xz) = I. And, using the description of R as a left adjoint, it is easy to see that 
morphisms R(x) — R(y) over C x I correspond to morphisms I — Cı making 


Co x Co 


commute, i.e. to morphisms y — x in [C]/. And since this correspondence is 
clearly natural in J, it induces an isomorphism 


[C] (y, £) = S (R(x), R(y)) 


in S/I. 

Now suppose we are given three internal categories B,C,D and two pro- 
functors S: B} C, T:C D. We define (T &c S)p to be the coequalizer of 
the pair 

pPrx1 
To X Co Ci X Co So D2 To X Co So 
1x Ag 


(note that this pair is reflexive, with common splitting induced by the inclusion- 
of-identities of C). We note also that (T @c.S)p comes equipped with a canonical 
morphism to Do x Bo, induced by the morphisms Tp — Do and Sp — Bo, since 
the two morphisms above commute with these in the obvious sense. Further, our 
standing hypothesis on S implies that the diagram 


1 X PT X 1 
Dı XD, To XC Ci XCp SO LIZZ Di Xd, To Xcy So —> Di Xpo (T Be S)o 
1x1xAg 


is also a coequalizer; and since the morphisms pr x 1 and 1 x Ag are both 
‘equivariant’ in an obvious sense for Az, the latter induces a left action 
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Di xD, (T @c S)o > (T &c S)o. Similarly, ps induces a right action of B on 
(T @c S)o, and it is straightforward to check that these actions give T @c S the 
structure of a profunctor B + D. And it is also clear that if we are given mor- 
phisms of profunctors S — S’ and T — T’, we obtain a morphism of profunctors 
(T Sc S) — (T’ @¢ S’) in the obvious way; so (—) @c (—) becomes a bifunctor 
Profs (C, D) x Profs (B,C) > Profs (B,D). 


Proposition 2.7.3 Composition of profunctors, as defined above, is associa- 
tive up to coherent natural isomorphism, and has the Yoneda profunctors Y (C) 
as two-sided units. Equivalently, profunctors in S are the 1-cells of a bicate- 
gory Profs, whose 0-cells are the internal categories in S and whose 2-cells are 
morphisms of profunctors. 


Proof Given profunctors S: B + C, T:C% D and U:D% E, we may form 
the diagram 


Uo X Do D1 X Do To XCo C1 XCo So TZ Vo X po To XCo C1 XCo So —> (U Bp T)o xco C1 X Cg So 


— 
Uo Xx Do Di X Dg To XCo So TTS Uo X Dg To XCo So —————> (U ®p T)o Xca Sa 


—— 
Uo X Do P1 X Do (T 8c $)o TS Uo X Do (T Bc S)o 


in which the first two rows and columns are coequalizers, by our assumption on 
S; so the coequalizer of the third row is canonically isomorphic to that of the 
third column. And a straightforward diagram-chase shows that this isomorphism 
is an isomorphism of profunctors U @p (T @c S) = (U p T) Sc S; moreover, 
the canonical nature of the isomorphism ensures that it satisfies the coherence 
conditions for composition in a bicategory. 

For the units, let us consider Y (D) p T, where T: C + D. We have to form 
the coequalizer of 


x1 
Dı Xp, Di X Do To = Dı X Do To; 
1x àr 


but the equations satisfied by D and T ensure that Ar: Di xp, To —> To is a 
split coequalizer for this pair, with splittings given by 


si x1 so x 1 
Dı X D Dı X Dy To «—— D; XD, To c T. 
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Further, Àr is equivariant for the left action of D and the right action of C, so 
it induces an isomorphism of profunctors Y(D) @p T & T. Similarly, we have 
T @c Y (C) ST; and it is easy to check that these unit isomorphisms satisfy the 
appropriate coherence conditions with the associativity isomorphisms described 
above. E 


In the case S = Set, the tensor product of two profunctors S: B°P x C — Set 
and T: C°? x D — Set is often written as a coend: 


C 
(T &c¢ S)(B, D) = f S(B,C) x T(C, D). 


We shall occasionally find it convenient to use this notation for internal 
profunctors, too. 

Now let f: C > D be an internal functor. We define a profunctor fe: C D 
by taking (f.)o to be the object (dom, m2): Dı X Da C1 > Do x Co of S/ Do x Co, 
with right C-action given by 


(R(x fı), drz) 


Dı X Do Co X co C1 S Di X Do Cy —————_ D; X Do Co 


and left D-action given by composition in D. Similarly, we may make 
(m7, 4172): Co Xp, Dı — Co x Do into a profunctor f°: D % C. Note that (1c). 
and (1¢)® are both canonically isomorphic to Y (C). 


Proposition 2.7.4 
(i) For any f: C — D in Cat(S), the diagram 


profs, r) LE eof, (C,E) 
(PP x 1) 


[DP x E, S] —————> [CC® x E, S] 


commutes up to natural isomorphism. (In a similar way, (f°) @p (—) may 
be identified with (1 x f)*: [B°P x D, S] — [B® x C, S].) 
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(ii) The assignment f > fe is (the effect on 1-cells of) a pseudofunctor 
€at(S)° — Profs. Moreover, this functor is locally full and faith- 
ful (i.e. for each pair of internal categories (C, D), the induced functor 
€at(S) (C, D)°? — Profs (C, D) is full and faithful). 


(iii) For any f, the profunctor f° is right adjoint to fe in Profs. 


Proof (i) This is similar to the verification of the unit law in the proof of 
2.7.3. Given a profunctor T: D % E, we have a split coequalizer diagram 


prx1xil 
pr xl 
Th hi De Xp, Di pes —— — > Thx E A a a, 
1xd@x1 


with splittings induced by the inclusion-of-identities of D. And it is straight- 
forward to verify that these morphisms are equivariant with respect to the left 
E-action and the right C-action, when To x p, Co is made into a profunctor C % E 
by regarding it as the pullback of Tọ along fo x 1. 

(ii) Given internal functors f:C —D and g:D—E, (i) tells us that 
(ge Sp fe)o is the pullback of (ge)o along (1 x fo), i.e. it is the object E1 xx, 
Dox Do Co = E1 xX E Co of S/EoxCpo. As usual, it is straightforward to extend this 
isomorphism to an isomorphism of profunctors ge ®p fe = (gf)e. The canonical 
nature of this isomorphism ensures that it, and the isomorphism Y(C) = (ic). 
which we noted earlier, satisfy the coherence conditions for a pseudofunctor. 

Now suppose we are given a 2-cell a: f — g of €at(S), that is an internal 
natural transformation between internal functors C = D. By definition, a is a 
morphism Cy — Dı such that dja = go and dja = fo, and so we obtain a 
morphism 


1 x (a, 1) d? x1 
Dı X Do.go Co —————> Dı X Do D; X Do, fo Co ——> D; X Do, fo Co 7 


where we have added extra subscripts to the product signs to indicate the 
morphisms by which Co is regarded as an object over Do. It is clear that 
this morphism is equivariant for the left action of D, and (using the nat- 
urality of a) for the right action of C: that is, it is a morphism of pro- 
functors a,: ge — fe. It is further straightforward to verify that a +» a, 
is a functor Cat(S)(C,D)°? — Prof, (C, D); and that these functors sat- 
isfy the coherence conditions (as C and D vary) needed to make (—), into a 
pseudofunctor. 

Conversely, suppose we have a morphism of profunctors h: 9. — fe. Then 
the composite 


(S090, 1) 0 m 
Co —————> Dı X Do,go Co —> Dı X Do, fo Co —— D; 
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is a morphism a: Co — Dy, satisfying doa = go and dia = fo; and it is not 
hard to see that the C-equivariance condition on h translates into the condition 
for a to be an internal natural transformation. Moreover, the D-equivariance 
condition tells us that k can be recovered from its restriction to the subobject 
(8090, 1): Co — Dy X Do,go Co of its domain, and hence that the construction just 
given is a two-sided inverse for the previous construction of profunctor morphisms 
from internal natural transformations. So (—).: Cat(S) (C, D) — Profs (C, D) 
is full and faithful. 

(iii) By (i), (f° @p fe)o is the object (m1, 73): Co X Do Di X Do Co —> Co x Co; 
and it is easy to see that (do, fı, d1): C1 — Co Xp, Di Xp, Co is a morphism 
of profunctors nz: Y(C) — f° @p fe. On the other hand, (fe @c f*)o is the 
coequalizer of 


(11, dom, di( fie, 73)) 
Dı X Do Cı X Do Di Di X Do Co X Co Dy; 
(dy(m™, fit), di 72, 73) 


and the morphism d,(m™,73): Dı xp, Co Xp, Di — Dı factors through this 
coequalizer to yield a morphism of profunctors es: fe @c f* — Y (D). It is again 
straightforward to verify that these morphisms satisfy the triangular identities 
for an adjunction (fe 4 f°). oO 


Corollary 2.7.5 For any f: C > D in Cat(S), the diagram 


protse, e) LEC geoj, (D,E) 


R 
I 


liM (for x1) 
[C x E, S| > [D® x E, S| 


commutes up to natural isomorphism. 


Proof This is immediate from (i) and (iii) of 2.7.4: since f® is right adjoint to 
fa, the functor (~) & f° is left adjoint to (~) @ fe. oO 


Parts (ii) and (iii) of 2.7.4 together say that the passage from Cat(S) to the 
bicategory Prof, is very similar to that from a regular category C to its (locally 
ordered) 2-category of relations Rel(C), which we studied in Section A3.1. (This 
explains why we have re-used for profunctors some of the notation that we used 
for categories of relations.) However, there is one respect in which the analogy 
fails to hold: not every morphism of Profs which has a right adjoint is necessarily 
of the form fe. 
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Example 2.7.6 Let S = Set; let C be a small category, C € ob C and let 
e: C — C be an idempotent endomorphism. Let F: C — Set be the functor D > 
{f:C — D | fe = f}, and let G: CP — Set be Br {g: BC | eg = g}. If 
we regard F and G as profunctors 1 % C and C % 1 respectively, where 1 is the 
terminal object of Cat, then it is easy to see that F is left adjoint to G. But F is 
isomorphic to a profunctor of the form f, iff it is representable, iff the idempotent 
e splits in C. So if we take C to be a category in which not all idempotents split, 
then we have adjoint pairs of profunctors between 1 and C which are not of the 
form (fe, f°). 

In fact it can be shown that this is essentially the only possible counterex- 
ample: if C is a Cauchy-complete category and T: B +} C is a profunctor having 
a right adjoint, then T = fe for some f: B — C (which is of course unique up to 
natural isomorphism, by 2.7.4(ii)). The same result holds, with an appropriate 
internal definition of Cauchy-completeness, for internal profunctors in any cate- 
gory S satisfying the standing hypothesis of this section. However, we shall not 
need this result, and so we omit the details of the proof. 


Another feature of Profs can be viewed as the analogue of the fact that 
Rel(C) is a tabular allegory: 


Proposition 2.7.7 For any profunctor T: C } D, there is a span 


f g 


C < T — D 


in Cat(S) such that T is isomorphic to the profunctor ge Sr f°. 


Proof We use a ‘mixed-variance’ version of the construction of a discrete 
opfibration from a diagram, given in 2.5.1. Specifically, we define an internal 
category T with object of objects To and object of morphisms given by the 
pullback 


k 
T, ———> Dı xo To , 
h ÀT 
PT 
To X Co Ci —> To 


with do, dı: Ti 3 To given by mh and mek respectively. The inclusion-of- 
identities of T is induced in the obvious way by those of C and D; to define 
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the composition, consider the diagram 


T 
T xmn Ti oD X Do Tı TOEN 
l ixh h 


x 
Th XCo C71 —-_> Dı X Do To X Co Ci — > To X Co Ci 
T23 


Wy 712 nı 


T: 
Ti ————> Dı Koc ST: 


in which all the squares are pullbacks. We have 
diroghr 1 = foprhml = foArkml = fomek7 1 = fomıhrom = dotghnom = dorzal 


and so the pair (m2h7 1, t21) maps T; x7, Ti into Cy xc, C1- Hence we can define 
a morphism 


(my hmyl, dy(w2hnyl, nal)) 
Ti xn D ———— To xo C. 


Similarly, we have 


(dı (nım, mıkram), rzkram) 
T xn D D1 Xd, To; 


and these two morphisms combine to yield dı: Ti x7, Tı —> Tı. The unit and 
associative laws for composition in T follow straightforwardly from those for C 
and D. 

The projections Do — Tp — Co become the object-maps of the internal 
functors g and f, which are given on morphisms by 72h and mik respectively. 
To establish the isomorphism T = ge Qr f°, we have to show that 


(71, dota, di( finz, T3)) Ar( x pr) 
Dı X Do Tı X Co Cı Dı X Do To X Co Cy ———> To 
(di (71, 9172), dy 72, 73) 


is a coequalizer. For this, note first that 1 x pr: Di Xp, To Xo, Ci —> Di X Do To 
is the (split) coequalizer of 1 x 1 xd, and 1xprx1: D1 xp, To XQ Ci xaCi 3 
Dy x D, To X co C1; and this pair factors through the pair in the displayed diagram 
above by 


1xsxl 
Di Xp, To XCo C1 XCo Ci ————> Di Xd, Ti Xc C1 , 
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where s is the splitting of h obtained by pulling back (sogo, 1): To > Di xp, To 
along pr. Hence the coequalizer (q, say) of the pair in the displayed diagram 
factors through 1 x pr. Similarly, q factors through Ay x 1; so it factors through 
the diagonal of the square 


1X pr 
Dı X Do To xC C1 ——— D, x Do To 


àr x1 ÀT 


PT 
To X Co Ci — To 


since this square is a pushout. But it is easy to verify directly that A7(1 x pr) 
coequalizes the given pair; so it is isomorphic to q. oO 


Corollary 2.7.8 Suppose S is a locally cartesian closed category with coequal- 
izers of reflexive pairs. Then the bicategory Profs is biclosed; that is, for each 
profunctor T:C% D in S, the functors (—) 8p T and T @c (—) have right 
adjoints. 


Proof By 2.7.4(i), 2.7.5 and 2.7.7, we can factor (—) @p T : Prof (D, E) — 
Prof (C, E), up to equivalence, as 


g lim x1 
[DP x E, S] r [TP x E, S] piace [C° x E, S]; 
but both of these functors have right adjoints (the first by 2.3.20, since S is 
S-complete by 1.4.7). The argument for T @c (—) is similar. oO 


Remark 2.7.9 We note that if T is a diagram of shape C, regarded as a 
profunctor 19> C, then the category T constructed in the proof of 2.7.7 is just 
the domain of the discrete opfibration y: T — C which corresponds to T under 
the equivalence of 2.5.3. In particular, it follows as in the proof of 2.7.8 that 
(—) @c T: [C°P, S] -+ S may be factored as the composite 


y* lim pop 
[CP S] Mes IT?, S —— S. 


This observation will be of importance in the proof of Diaconescu’s Theorem in 
Section B3.2. 


Suggestion for further reading: Kock & Wraith [649]. 


B3 
TOPOSES OVER A BASE 


B3.1 S-toposes as S-indexed categories 


We are interested in this chapter in studying the 2-category Yop/S of toposes 
over a fixed topos S, through the medium of indexed category theory. We recall 
the precise definition of this 2-category from Section A4.1: its objects (which 
we call toposes defined over S, or simply S-toposes) are geometric morphisms 
p: E > S with codomain S, its morphisms (q: F — S) — (p: E — S) are pairs 
(f,a) where f: F — € is a geometric morphism and a: q = pf a geometric 
transformation, and its 2-cells (f,a) — (g,@) are geometric transformations 
f — g compatible in the obvious sense with a and 3. However, we shall almost 
invariably abuse notation by suppressing any mention of the 2-isomorphism a 
when specifying 1-cells of Top/S; we shall also tend to suppress the structural 
morphism p when specifying objects of Top/S, and simply write ‘E is an S-topos’. 
(The justification for the latter abuse is contained in Theorem 3.1.2 below.) 

The main message we wish to convey is that, to a very large extent, the way in 
which Yop/S behaves is independent of any particular features of S: provided we 
remain within the context of S-indexed categories and functors, we can pretend 
that S is simply the topos of sets. (There are only two significant exceptions 
to this, namely that we must refrain from using the Law of Excluded Middle 
(resp. the Axiom of Infinity) in our set-theoretic arguments, unless S happens 
to be a Boolean topos (resp. to have a natural number object). However, both 
of these restrictions will be found quite easy to get used to in practice.) 

Of course, if p: E — S is an S-topos, we use the inverse image functor p* to 
make € into an S-indexed category, as in 1.2.2(d); that is, we set EF = E/p*l. 
(In fact we shall tend to simplify our notation, whenever possible, by identifying 
an object I of S with its image p* I in €.) As usual, we write E for the S-indexed 
category obtained from € in this way. 

Since p* is cartesian, it extends to an indexed (cartesian) functor S — E, by 
1.2.2(d); we shall again use p* to denote this indexed functor. Similarly, the right 
adjoint p, of p* extends to an indexed functor E — S, which is an indexed right 
adjoint for p*, by 1.2.3. So we have an ‘indexed geometric morphism’ from E to 
S. We note also that, by 1.4.8 and 2.2.6, E is cocomplete and locally small as an 
S-indexed category. Also, if f: F — E is any morphism of Vop/S, then we may 
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regard the adjunction (f* 4 f,.) as indexed over £, and hence (by change of base 
along the inverse image functor p*: S — E) as an S-indexed adjunction. 

However, we observed in 1.2.4 that the left adjoint of f*, if it has one, does 
not automatically extend to an S-indexed functor. (A counterexample may be 
constructed using the second example in 1.4.4(6): note that the functor 1: 1 — 2 
is a discrete opfibration, and so serves to identify [1, Set] with a slice category 
of [2, Set].) 


Definition 3.1.1 Extending the terminology which we introduced in A4.1.5, 
we shall say that a geometric morphism f: F — E over S is S-essential if f* 
has an S-indexed left adjoint fı. Equivalently, f is S-essential if f* has an 
ordinary left adjoint fi, and for each morphism x: J — J in S the canonical 
(Beck~-Chevalley) natural transformation in the diagram 


iyd 
FJJ o; E/p* J 


ge —> x 


(fi)? 
F/q*I ——> €/P*I 
(where (fi)! denotes the functor which sends an object (B — q*I = f*p*I) to 
its transpose (fıB — p*I), as in 1.2.4) is an isomorphism. 


Examples of S-essential morphisms include the geometric morphisms 
[C, S] — [D,S] induced by internal functors C — D in S, as in 2.3.22; the 
proof of this is contained in 2.3.21. 

In the case when S is Set, we have seen (in A4.1.9) that there is (up to 
isomorphism) at most one geometric morphism from any topos E£ to S, so that 
‘being defined over S’ can be regarded as a property of a topos rather than an 
extra structure. (In fact we saw that it is equivalent to the property of being 
locally small and having small copowers.) Our first aim in this section is to 
establish the corresponding result over an arbitrary base S. By an S-indexed 
topos, we mean an S-indexed category E such that each fibre €/ is a topos and 
each transition functor €7 — E7 is logical. (Clearly, the S-indexed category 
obtained from an S-topos p: E — S as above is an S-indexed topos, by A2.3.3.) 
An S-indered geometric morphism between S-indexed toposes is, of course, an 
S-indexed adjunction (f* 4 f,) such that f* is cartesian. With the obvious 
definition of S-indexed geometric transformation, we thus have a 2-category 
Tops of S-indexed toposes. The following result is due to J. L. Moens [820], 
though we give a later proof due to M. Jibladze [497]. 


Theorem 3.1.2 For an S-indexed topos E, the following are equivalent: 


(i) E is S-complete and locally small. 
(ii) E is S-cocomplete and locally small. 
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(iii) There exists a geometric morphism p: E! — S, such that E is equivalent 
to the indexed category I > E1/p*I. 


Moreover, if these conditions are satisfied, then the geometric morphism in (iii) 
is unique upto unique 2-isomorphism. 


Proof (i) © (ii) follows from 1.4.6, since by A2.2.10(ii) a logical functor has 
a left adjoint iff it has a right adjoint (and logical functors are cartesian and 
cocartesian). We have already noted that (iii) => (ii) follows from 1.4.8 and 2.2.6; 
so it suffices to prove that (ii) implies (iii). 

For this, we employ A2.3.8, which characterized pullback functors amongst 
logical functors having left adjoints; we shall show that each of the transition 
functors r* : €7 — E! satisfies this characterization. In the case when zx is monic, 
this is easy, since applying the Beck—Chevalley condition to the pullback square 


ae 


I I 
T 
I ——_—__> J 


yields the result that the unit 1 — z* o ©, is an isomorphism, and so ©, is (full 
and) faithful. For a general x, form the pullback square 


y 


K I 
I J 


| 


T 


| 


and the diagonal w: I — K. Given an object A of £7, we have a commutative 
diagram 
Z (€) 


Ly lyw*z*A ——> Lyz*A 


wie = ee A 
where 77 is the unit of (£, 4 x*) and e€ is the counit of (X,, 4 w*); the right vertical 
arrow is an isomorphism by Beck—Chevalley, and the left one is an isomorphism 
because yw = zw = 1;. Now ©,” is isomorphic to the functor (—) x U, where 
U =%,,(1), by A1.5.8; but since w is (split) monic we know from the particular 


B3.1 S-TOPOSES AS S-INDEXED CATEGORIES 371 


case already discussed that U must be a subterminal object in E*. So € is monic; 
but by A2.4.8 &,, preserves monomorphisms, and hence 7 is monic — equivalently, 
£, is faithful. So by A2.3.8 we can identify E7 with a slice of £7, and x* with 
the pullback functor. 

In particular, defining p*J, for I € ob S, to be ©; applied to the terminal 
object of E7, we obtain a functor p*: S > E! such that E is equivalent to the 
indexed category (I +» E!/p*I). Thus the latter S-indexed category is cocom- 
plete and locally small, whence p* is cartesian (since it preserves 1 by definition) 
and has a right adjoint by 1.4.8 and 2.2.5. So p* is the inverse image of a geometric 
morphism, as claimed. 

For the uniqueness of p, we observe that it extends to an indexed geometric 
morphism E — S, by the remarks at the beginning of this section; and the 
latter is unique up to isomorphism by the ‘S-indexed version’ of the argument 
used in A4.1.9. Specifically, p* preserves 1 and S-indexed coproducts; but every 
object x: I — J of S/J can be written as an S-indexed coproduct of copies of 1 
(specifically, x = £,(1)), and so p* is determined up to unique S-indexed natural 
isomorphism. Hence the ordinary geometric morphism p: E — S is determined 
up to canonical natural isomorphism (given the indexing of € over S), as the 
1-component of the indexed one. O 


Remark 3.1.3 Moens’ original proof of the implication (ii) > (iii) in 3.1.2 was 
based on 1.4.12; he showed that if an S-indexed topos has S-indexed coproducts 
then they are necessarily disjoint and stable (i.e. the ‘S-indexed analogue’ of 
A2.4.4). We omit the details. 


Remark 3.1.4 The assertion, for an S-indexed topos E, that there exists an 
S-indexed geometric morphism E — S, is strictly weaker than the equivalent 
conditions of 3.1.2. The reason is connected with the fact, noted at the end of 
A4.1.9, that a topos defined over Set need not be cocomplete (in the classical 
sense); if we take € to be an incomplete Set-topos such as the one described in 
A2.1.7, then the Set-indexing of E arising from the geometric morphism E — Set 
does not coincide with the naive indexing I +> (I-fold power of £) (in fact the 
latter is the stack completion of the former, for the coverage of 1.5.4); the former 
is (complete and) cocomplete as a Set-indexed category, but the latter is not. 
However, both indexed categories admit Set-indexed geometric morphisms to 
the canonical indexing of Set over itself. On the other hand, it can be shown 
that an S-indexed topos admitting an indexed geometric morphism to S is locally 
small, by the indexed version of the argument of A4.1.9; so we may if we wish 
add ‘E is S-cocomplete and admits an S-indexed geometric morphism to S’ to 
the equivalent conditions of 3.1.2. 


Theorem 3.1.2 tells us that we can regard the 2-category Yop/S as a sub- 
2-category of the 2-category Tops of S-indexed toposes. In fact it is a full 
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sub-2-category, in the appropriate sense: 


Proposition 3.1.5 Let p: E — S and q: F — S be S-toposes, and let E and 
F be the corresponding S-indexed toposes. Then there is an equivalence of 
categories 


Top/S (q, p) ~ Tops (F, E) . 


Proof Given a geometric morphism f: F — € over S, we extend it to an 
S-indexed geometric morphism in the obvious way: (f*)! is simply f* applied 
to objects and morphisms of €/p*I (followed by pullback along the isomor- 
phism g*I — f*p*I), and (f.)! is f, followed by pullback along the unit 
pl — f, f*p*l = f.q*I. Conversely, if f: F — E is any S-indexed geometric 
morphism, then it follows from the uniqueness clause of 3.1.2 that the composite 
of p with the restriction f! of f to the fibres over 1 is isomorphic to q, i.e. that 
f’ is a geometric morphism over S. Both constructions are clearly functorial on 
2-cells, so it remains to verify that the whole of f is (up to isomorphism) induced 
by f! in the manner indicated above. 
Let g: A— p*I be an object of E7. Since the square 


(eo) 


E€! > F! 


E; Xr 
(ey 


E ———_—_> F 
commutes up to natural isomorphism, we know that (f*} (g) is isomorphic to an 
object of the form h: (f*)!(A) — q*I. And on chasing the graph of g (regarded 
as a morphism g — I*(A) in E7) around this diagram, and using the fact that 
f* also commutes up to isomorphism with I*, we see that h must be (identified 
with) (f*)!(g), as required. o 


We have seen that S-toposes are complete, cocomplete and locally small as 
S-indexed ‘categories. They are also well-powered and well-copowered, by 1.3.14 
and 2.2.7; hence we shall be particularly interested in those which have S-indexed 
separating and/or coseparating families as defined in 2.4.1, so that we can apply 
the S-indexed adjoint functor theorem to them. It turns out that separating 
families for S-toposes can be described in more elementary terms: 


Lemma 3.1.6 Let p: E — S be an S-topos, and E the corresponding S-indexed 
topos. The following are equivalent: 
(i) E has an S-indered separating family. 


(ii) There exists an object B of E such that every object of E is a subquotient 
(cf. A4.6.1) of one of the form p*I x B. 
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(iii) There exists an object B of E such that, for every object A of E, the 
composite 


~ LE $ 
p*p.(A®) x B => A® x BY À 
is epic, where A is the partial-map representer for A (cf. A2.4.7) and € is 
the counit of (p* 4 p4). 
Proof (i)= (ii): Let g: B > p*J € ob E£” be a separating family for E, and let 
A be any object of €. By 2.4.3, there exists x: I — J in S and an epimorphism 
u,x2*(g) > A in £; but L;x*(g) is the top left vertex of the pullback square 


P ———>B 


g 


p*z 
p“ I ——> p* J 
and hence a subobject of p*I x B. So the object B has the property specified 
in (ii). ` 
(ii) = (iii): Apply (ii) to the object A; since all such objects are injective, it 
must be a quotient of some p*I x B, say h: p*I x B — A. Transposing succes- 
sively across the exponential adjunction and (p* 4 p,), we obtain a morphism 
h: I — p,(A®) such that h is the composite 
p*(h) x1 ~ Exil = = 
p*I x B ——_> p*p, (A?) x B ——> A® x B > À; 
since h is epic, so is ev(e x 1). : 7 
(iii) > (ii): For any A, set I = p,(A”); then since A is a subobject of A, it is 
a subquotient of p*I x B. 
(ii) > (i): Let B be as in (ii). Set J = p,(PB), and form the pullback 


G ————> EB ; 


g 


p*p.(PB) ——> PB 


we claim that g: G — p*J is an S-indexed separating family for E. Given an 
object f: A — p*I of E7, choose a subquotient representation 


S A ; 


pK xB 
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let s: p*K — PB be the name of S, regarded as a relation from p* K to B, and 
š: K > p,(PB) = J its transpose. The square 


S —————> EB 


xl 
pK x B-—-> PBxB 


is a pullback by the definition of s; but the bottom edge of this square factors 
through cpp x 1, so we also have a pullback square 


S ———>G 


Thus, for the span 


12 51 
I <— K x I — J, 
we obtain an epimorphism Er, (371)*(g) > f in E7, as required. o 


Definition 3.1.7 We say an S-topos p: E — S (or the geometric morphism p) 
is bounded if it satisfies the equivalent conditions of Lemma 3.1.6. An object B of 
E with the (equivalent) properties described in 3.1.6(ii) or (iii) is called a bound 
for E over S. 


Alternative names that have been used for the concept of bound, as just 
defined, include ‘object of generators’ and ‘progenitor’. 


Examples 3.1.8 (a) Localic morphisms, as defined in A4.6.1, are bounded; 
they are exactly those geometric morphisms such that 1 is a bound for the domain 
over the codomain. In passing, we note that if a bound B is a subquotient of an 
object C, then C is also a bound; thus localic morphisms may equivalently be 
characterized as those for which every well-supported object of the domain is a 
bound. 

(b) Interpreting the proof of 3.1.6 in the case S = Set, we see that a bound 
for a Set-topos € is simply an object B whose subobjects form a separating 
set (equivalently, since € is balanced, a generating set) for £. In particular, 
if a Set-topos £ has a generating set {G; | i € I} such that the coproduct 
Licey Gi exists in E, then the latter is a bound for E, since each G; occurs 
as a subobject of it (cf. A2.4.4). For example, in a functor category [C, Set] 
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where C is small, the coproduct of all the representable functors C(A,—), A € 
ob C, is a bound. On the other hand, for a topological group G, the topos 
Unif(G) of uniformly continuous G-sets defined in A2.1.7 need not be bounded 
over Set; it has a generating set (namely the set of discrete quotients of G), 
but these do not have a coproduct in Unif(G) unless G has a smallest open 
subgroup. 


Remark 3.1.9 For an object B of a Set-topos E, the assertion that B is a 
bound for € over Set is easily seen to be equivalent to saying that B is a strong 
separator for the allegory Rel(€), as defined in A3.4.10. Thus one may deduce 
that the tabular allegories satisfying the hypotheses of A3.4.11 are exactly the 
categories of relations of bounded Set-toposes. Moreover, if E is a bounded Set- 
topos and B is a bound for E over Set, then we may define an allegory Ag whose 
objects are sets and whose morphisms J % J are relations in € from the J-fold 
copower of B to the J-fold copower; this allegory is positive geometric and has 
l as a strong separator, and if we split its symmetric idempotents we obtain the 
whole of Rel(€) (since every object of € is a subquotient of a copower of B}. So 
E may be recovered from Ag in the manner described in A3.4.12. (However, Ag 
will not be pre-tabular unless B x B happens to be isomorphic to a copower of B; 
if we want to make it so, we must take its objects to be all set-indexed copowers 
of finite powers of B.) The point of this observation is that it is often easier to 
give an explicit construction of the allegory Ag, and to prove that it satisfies the 
hypotheses of A3.4.12, than to construct E£ explicitly and prove directly that it 
is a topos. 

More generally, if S is any topos whatever, it can be shown that all bounded 
S-toposes can be constructed from allegories whose objects are those of S, by first 
splitting symmetric idempotents and then cutting down to the subcategory of 
maps. To prove this, it is necessary to recast the (set-theoretic) proof of A3.4.11 
in terms of S-indexed categories; we shall not go into details here. 


Bounded morphisms have properties which are reminiscent of localic mor- 
phisms in several ways. For example, given an arbitrary geometric morphism 
p: E — S and a well-supported object B of €, the full subcategory of objects 
of E which are subquotients of some p*I x B is coreflective — by essentially the 
same proof as in A4.6.3: to construct the coreflection of an object A, form 
the image factorization of the morphism obtained by pulling back the mor- 
phism displayed in 3.1.6(iii) along A — A — and is closed under finite limits 
provided it contains B x B. It is thus a topos, the largest (hyperconnected) 
quotient of E€ for which B is a bound. However, p does not in general have 
a best possible factorization through a bounded morphism; different choices 
of B will yield different coreflective subcategories of E€, and there may be no 
largest one. 

The next lemma should be compared with A4.6.2(e) and (f), of which it is 
a direct generalization. 
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Lemma 3.1.10 Let 


plost 
4 P 
S 


be a commutative triangle (up to isomorphism) of geometric morphisms. 


(i) If f and p are both bounded, then q is bounded. 
(ii) If q is bounded, then f is bounded. 


Proof (i) Since f* preserves finite products, monomorphisms and epimor- 
phisms, it follows easily from 3.1.6(ii) that if B is a bound for E over S and C is 
a bound for F over E, then f*B x C is a bound for F over S. 

(ii) It is even easier to see that any bound for F over S is also a bound for 
F over E. o 


Corollary 3.1.11 Let p: E +S be a bounded S-topos, and I an object of S. 
Then the induced geometric morphism p/I: E/p*I > S/I of A4.1.3 is also 
bounded. In fact, if B is a bound for E over S, then I*B is a bound for E/p*I 
over S/I. 


Proof Apply both halves of 3.1.10 to the commutative square 


E/I — E 
p/I p 


S/I ———> S 
where the horizontal morphisms are localic (A4.6.2(b)) and hence bounded. O 


We write BTop/S for the full sub-2-category of Top/S whose objects are 
bounded S-toposes; thanks to 3.1.10(ii), this is not an abuse of notation. How- 
ever, in the situation of 3.1.10, the boundedness of the morphism q does not 
imply that of p, even if f is surjective: 


Example 3.1.12 Let F: Set — Set be a cartesian functor, and form the topos 
GI(F) as in A2.1.12. GI(F) is cocomplete (the forgetful functor GI(F) — Set x 
Set, being comonadic by A4.2.4(c), creates arbitrary colimits) and locally small; 
so it is defined over Set (and its indexing over Set is the ‘naive’ one), and there 
is a surjection from the (clearly bounded) Set-topos Set x Set to it. We claim 
that Gl(F’) has a generating set iff F is accessible, i.e. there is a cardinal « such 
that F preserves «-filtered colimits. 
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To prove this, first suppose F preserves «-filtered colimits. Let R be a set 
containing at least one set of each cardinality less than « (for example, we could 
take R to be the power set of a set of cardinality x), and let G be the set 


{(1,0,0 > 1)}U{(A, 1,6) | A E€ R,£: 1 — F(A)} 


where 0 denotes the empty set and 1 a standard singleton set. We claim that G 
is a generating set for GI(F). For if 


(f,9): (A, B,a) —> (A’, B’,a’) 


is a monomorphism in GI(F) which is not an isomorphism, there are two cases 
to consider: 


(a) There exists a € A’ not in the image of f; then we can find a morphism 
(1,0,0 — 1) > (A’, B’, a’) not factoring through (f, 9). 

(b) There exists b € B’ not in the image of g, then a’(b) € F(A’) must be 
in the image of F(A”) — F(A’) for some subset A” C A’ of cardinality 
< «K, and if we choose A € R of the same cardinality as A”, we then have 
a morphism (A, 1,€) — (A’, B’, a’) (for a suitable £) not factoring through 


Conversely, suppose GIF} has a generating set G, and let x be a strict 
upper bound for the cardinalities of the sets which occur as the first components 
of members of G. Reversing the above argument, we find that if A is any set and 
x € F(A), then in order for us to detect the fact that 


(14,0 > 1): (A,0,0 > F(A)) —> (A,1,z) 


is not an isomorphism, x must lie in the image of F(A’) — F(A) for some subset 
A’ C A of cardinality less than «; from which it follows easily that F preserves 
«-filtered colimits. 


The question whether every cartesian functor Set — Set is accessible is 
a nontrivial one, and appears to be independent of the usual axioms of set 
theory. However, it can be shown (see [121]} that if there is a proper class 
of measurable cardinals then inaccessible cartesian functors exist. Under this 
hypothesis, therefore, we have an unbounded Set-topos which is a surjective 
image of a bounded one. 

We conclude this section by considering when the indexed topos E corre- 
sponding to an S-topos € has a coseparating family. We remark first that if E 
has a coseparating family, then it has a single coseparator (that is, a coseparating 
family living in the fibre over 1); for if G € ob £7 is a coseparating family then so 
is any object of €/ containing G as a subobject, in particular the power object 
PG; and injectivity of the latter ensures, by a standard argument, that the 
object Hy PrG (= P(X=;G)) of the fibre £} also coseparates E. Further, we have 
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Proposition 3.1.13 Let p:E — S be an S-topos. If B is a bound for E over 
S, then PB is a coseparator for the corresponding S-indezed topos E. 


Proof Consider first an object A of E 1. By assumption, we have an 
epimorphism e: p*J x B —> A for some I € ob S; and then the composite 


{} Pe 


4 ~ ~ * 
A —^ > 4 — > PA — 5> P(p*I x B) © PB” S My I*(PB) 


is monic by A2.4.7, A2.2.3(i) and A2.3.6(iii). To deal with an object of a general 
fibre £7, we argue similarly using 3.1.11. g 


The converse of 3.1.13 is false. Let C be the finite category represented 


diagrammatically by 
‘se 


then [C, Set] is not localic over Set (cf. A4.6.9); that is, 1 is not a bound for it. 
But an easy calculation (cf. [139]) shows that Q is a coseparator for |C, Set]. Sim- 
ilarly, the topos G1( F) of 3.1.12, though not (in general) bounded over Set, does 
have a coseparator, namely (2 x F(2), 2, m2: 2 x F(2) — F(2)) where 2 denotes 
a two-element set; this follows by another easy calculation from the fact that 2 
is a coseparator for Set. 


Remark 3.1.14 It should be emphasized, before we leave this topic, that the 
vast majority of geometric morphisms encountered in practice are bounded; in 
restricting our attention (as we shall do frequently hereafter) to BLop/S rather 
than Top/S, we are not really sacrificing very much. In the case S = Set, there 
are essentially only three classes of unbounded S-toposes known to us: those of 
the form [G, Set] where G is a large groupoid (but with only a set of isomorphism 
classes of objects, to ensure local smallness of the functor category), or more 
generally a large category satisfying the condition of A2.1.5; those of the form 
Unif(G), where G is a topological group having no smallest open subgroup, as in 
A2.1.7; and those of the form Gl(F’) where F is an inaccessible cartesian functor 
Set — Set, as in 3.1.12 (or, more generally, those of the form Eg where € is a 
Set-topos and G is a cartesian comonad on € whose functor part is inaccessible). 


Suggestions for further reading: Adelman & Blass [30], Blass [121], 
Borceux [139], Jibladze [497], Moens [820]. 


B3.2 Diaconescu’s Theorem 


Among the toposes defined over a given base topos S, the diagram categories 
[C, S] defined in Section B2.3 play an important rôle. In this section we investi- 
gate them in more detail: our main objective is to prove the important theorem 
of R. Diaconescu [280] characterizing the geometric morphisms over S whose 
codomain is a topos of this form. 
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First we need the analogue, over an arbitrary base, of a result noted for Set 
in 3.1.8(): 


Lemma 3.2.1 For any C in Cat(S), [C, S] is bounded as a topos over S. 


Proof Intuitively, a bound for [C, S] over S should be obtained by taking ‘the 
disjoint union of the representable functors C —> S’: by 2.5.4(c), we can identify 
the latter as the discrete opfibration Decı (C) — C of 2.5.4(6). To prove that this 
is indeed a bound, let f: F — C be an arbitrary discrete opfibration; then we 
have a diagram 


Dec; (F) F 


Dec, (C) x C* (Fo) 


of discrete opfibrations over C, where the horizontal arrow is an epimorphism 
because it is split epic in each dimension, and the vertical arrow is induced in 
dimension n by the canonical monomorphism 


Fanı = n+1 XCo Fo >> Cn+1 x Fo é 


(Note that these monomorphisms commute with all the face and degeneracy 
maps of F and C except for the face maps dy which are dropped in the passage 
from F to Dec, (F).) o 


Remark 3.2.2 IfC is an internal preorder (that is, if (dọ, d1): C1 —> Co x Co 
is monic), then we may alternatively take the terminal object of [C,S] as a 
bound, since the canonical morphism Dec: (C) — C x Co is monic. Thus, in this 
case, the geometric morphism [C,S] — S is localic — as we had already noted, 
in the case S = Set, in A4.6.2(d). 


In theory, we have two ways of indexing [C, S] over S: that defined in 2.3.12(a) 
from the canonical indexing of S over itself, and that arising from the geometric 
morphism [C, S] — S. Fortunately, it is not hard to show that these two indexed 
categories are isomorphic, since we have 


[C, S]/C*I = doFib(S)/C x I = S&*! 


for any object I of S. 

We now introduce the concept of torsor, which will play a fundamental rôle 
in the main theorem of this section. We recall that if C is an internal cate- 
gory in a topos S, and p: E — S is an S-topos, then diagrams of shape C in 
the S-indexed topos E correspond by 2.3.14 to diagrams of shape p*(C) in the 
canonical indexing of E over itself, and hence to discrete opfibrations over p*(C) 
in cat(€). 
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Definition 3.2.3 Let C be an internal category in a topos S, and p: E + S an 
S-topos. By a torsor over C (or C-torsor) in E, we mean an S-indexed functor 
F: C? — E (that is, a diagram of shape C°P in E) such that the domain of the 
corresponding discrete fibration F — p*(C) is filtered (as defined in 2.6.2). We 
write Tors(C,€) for the full subcategory of [C°?, €] whose objects are torsors 
over C. 


The term ‘torsor’ is commonly used for the special case of this concept where 
the underlying category C is a group (see 3.2.4(6) below); over a general C, the 
usual name for the concept is ‘flat presheaf’. However, there seems to be no good 
reason for perpetuating this distinction. 


Examples 3.2.4 (a) Let C be a discrete internal category. Then the domain 
F of any discrete fibration F — p*(C) is again discrete (cf. 2.5.4(a)); hence it 
is filtered iff the unique morphism Fo — 1 is an isomorphism. Thus Tors(C, £) 
is isomorphic to the discrete category whose objects are the morphisms 1 — 
p*(Cpo) in £. 

(b) Let G be an internal group in S, regarded in the usual way as (the object 
of morphisms of) an internal groupoid G with object of objects 1. Then a diagram 
of shape G°P in € is a right G-object, i.e. an object F equipped with a unitary, 
associative action map ¢: F x p*G — F. Using the fact that the morphisms of 
G are all invertible, it is not hard to show that the corresponding total category 
F satisfies the second (resp. third) condition of 2.6.2(a) iff the G-action on F 
is transitive (resp. effective), i.e. iff the morphism (71, ¢): F x p*G — F x F is 
epic (resp. monic.) Thus a G-torsor is a right G-object F for which F — 1 
is epic and (71,@) is an isomorphism. Equivalently, it is a G-object F locally 
isomorphic to p*G itself: that is, one for which there exists A — 1 in E and 
an isomorphism A*(F) = A*(p*G) of right G-objects in €/A. For if F is a torsor 
as defined above, then we can take A to be F itself; conversely, if F satisfies 
the second definition, then A*(F — 1) is epic and A*(7,,¢) is an isomorphism 
(since both these assertions are true for A*(p*G)) and A* reflects epimorphisms 
and isomorphisms. 

G-torsors arise naturally in cohomology theory, as ‘obstructions’ to the split- 
ting of epimorphisms; we shall see a good deal more of them in Part E. Note, 
incidentally, that if F and F’ are G-torsors in €, then any G-equivariant map 
F — F' is an isomorphism; this may be proved directly from our original de$ 
inition, but it also follows easily from the second one and the fact that the 
G-equivariant maps p*G — p*G (the ‘left translations’ by elements of p*G) are 
all isomorphisms. Thus Tors(G, E) is a groupoid. 

(c) For an example of torsors over a monoid which is not a group, we consider 
the two-element monoid M = {1,e} with e? = e. We may identify M with an 
internal monoid M in any topos, by the method of 2.3.12(6); for simplicity, 
we shall discuss its torsors only in the case € = S = Set, but readers who are 
already familiar with the interpretation of coherent logic in toposes, as described 
in Section D1.3, will be able to verify that the description which we give is valid 
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in any topos. An M-set is, of course, just a set A equipped with an idempotent 
endomorphism e: A — A (there is no need to distinguish between left and right 
M-sets, since M is commutative; but we shall write the action on the right, as 
at+a-e, for consistency); what does it mean to say that (A,e) is an M-torsor? 
The first condition of 2.6.2(a) simply says that A is nonempty. The second 
says that, given any two elements a,b of A, there exists an element c and a pair 
of morphisms f,g of M such that a = c- f and b = c - g. But there are only four 
possibilities for the pair (f,g), two of which have f = g and so imply a = b. Also, 
since ef = e for all morphisms f of M, we see that if both a and b satisfy a = a-e 
and 6 = b- e then we must have a = b; thus there is at most one element in the 
image of e, and since A is nonempty there must be exactly one such element, 
ao say. Now, for a general pair of elements (a,b), the two possibilities for the 
pair (f,g) which have f 4 g yield either a = ap or b = ao. It turns out that the 
third condition of 2.6.2(a) is automatically satisfied by any M-set; so we deduce 
that an M-torsor may be identified with a set A equipped with a distinguished 
element ao (the image of the idempotent) and satisfying the coherent axiom 


(T Fz,y (£ = ao) V (y = ao) V (£ = y)). 


A morphism of M-torsors is simply a map preserving the distinguished elements. 
As mentioned earlier, the description just given is valid for M-torsors in any 
topos £; we remark that the displayed axiom implies that the classifying map 
A — Q of ap: 1 >> A is monic, and so we may further identify M-torsors in 
E with subobjects of Q which contain the generic subobject, T and satisfy the 
displayed axiom (with ag taken to be T). (We shall meet an important example 
of an M-torsor in D4.5.12.) 

(d) Suppose C is an internal preorder in S. Then the second condition of 
2.6.2(a) implies that, if F is any C-torsor in £, the canonical map Fo —> p* (Co) 
is monic. Using this, it is easy to see that C-torsors may be identified with what 
are commonly called ideals of p*(C), that is subobjects of p*(Co) which are 
downwards closed in the ordering and (upwards) directed. 

(e) For any internal category C in S, the (contravariant) décalage 
Dec!(C) introduced in 2.5.4(b) comes equipped with a discrete fibration 
Dec!(C) — Co x C, which makes it into a C-torsor in S/Co; for, when regarded 
as an internal category in S/Cp, Dec!(C) has a terminal object as defined at the 
beginning of Section B2.6, given by the pullback square 


d ———> lo, 


in S/Co.- 
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(f) Let f: (F,q) — (E€, p) be a geometric morphism over S. Since f* is a reg- 
ular functor, it preserves discrete fibrations and filtered categories; so it induces 
a functor Tors(C,&) — Tors(C, F) for any internal category C in S. There is 
a converse result if f is a surjection (i.e. f* is conservative): if y: F — p*(C) is 
any discrete fibration in € such that f* (y) corresponds to a C-torsor in F, then 
~y corresponds to a C-torsor in E. As a particular case of this last observation, 
we note that the Yoneda profunctor Y(C) of 2.7.2, when regarded as a diagram 
of shape CP in [C, S], is always a C-torsor; for its image under the forgetful 
functor [C, S] + S/Cp is the torsor corresponding to Dec! (C). 


In the case E = S = Set, the definition of a torsor may be made more explicit: 
a functor F: C°P — Set is a torsor iff, given any finite diagram D: J — C and 
any family of elements z; € F(D(j)) (j € ob J) which are compatible in the sense 
that F(D(a))(z;) = z; for alla: j — j’ in J, there exists a cone (f;: D(j) +C | 
j € ob J) under D and an element z € F(C) such that F(f;)(z) = z; for all j. 
(Of course, it suffices to verify this condition for the special cases when J is the 
empty category, the discrete category with two objects and the ‘parallel-pair’ 
category.) Clearly, in the case when C is cocartesian, this condition will hold 
provided F maps finite colimits in C to limits in Set. But the latter condition 
is also necessary, and in a more general context than Set: 


Lemma 3.2.5 Let C be a cocartesian internal category in a topos S (i.e. 
one such that the corresponding S-indexed category [C] is cocartesian), and let 
p: E — S be an S-topos. Then an S-indezed functor F: [C] — E is a C-torsor 
iff it is cartesian. 


Proof We observe first that if F is a (contravariant!) diagram of the form R(z) 
for a morphism x: 1 — p* (Co), then for any object J of S and any y: I + Co € 
ob [C], F! (y) = p.(J), where J is the object of €/p*I indexing morphisms 
p*(y) — z in the locally small fibration [p*(C)], as we defined it in 1.3.12. It is 
thus immediate from the definition of colimits in [C] that F transforms them to 
limits. (Note that diagrams of the form R(x) are indeed C-torsors, since they are 
images of Dec!(C) under inverse image functors S/Co > E.) Now 2.5.7 tells us 
that a general diagram F of shape p*(C°?) can be expressed as limg(G), where 
+: F — p*(C). is the discrete fibration corresponding to F and G is a diagram of 
shape F in [p*(C)°?, E} (equivalently, a diagram of shape p*(C)°? in [F, €]) whose 
underlying diagram in €/Fo is of the form R(7o). It follows that the S-indexed 
functor corresponding to F can be factored as 


G limp 
[c]P° ——> [F, E —— E; 


and the first factor is cartesian since the forgetful functor to E/Fp creates limits, 
and the second is cartesian by 2.6.8. 

For the converse, we first consider the case when € = S and p is the iden- 
tity. Let y: F — C be the discrete fibration corresponding to F: by definition, a 
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morphism I — Fp corresponds to a morphism c: I — Cp together with a mor- 
phism z: 1; — F/(c) in S/I. Taking c to be the initial object of C and z to 
be the unique morphism 1 — F!(c) & 1, we deduce that there exists a mor- 
phism 1 — Fo, so Fy — 1 is (split) epic. Next, given two morphisms I 3 Fo 
corresponding to c,d: I = Co and z: 1; — F! (c), y: Lz — F! (d), we have a mor- 
phism c+ d: I — Co corresponding to the coproduct of c and d in {C]/ and a 
morphism (x,y): lr — F’ (c +d) & F! (c) x F? (d), yielding a third morphism 
I — Fo, which is clearly the codomain in [F]? of a pair of morphisms whose 
domains are the two objects of [F]/ from which we started. Applying this argu- 
ment to the two projections when I = Fo x Fp yields a splitting for the morphism 
Pp — Fo x Fo of 2.6.2(a). And a similar argument using the existence of coequal- 
izers in [C] and the fact that F maps them to equalizers will yield a splitting 
for the morphism Ty — Rr of 2.6.2(a). So all three of the morphisms in the 
definition of filteredness for F are split epic. 

Now consider the general case. The assertion ‘C is cocartesian’ is clearly 
equivalent to the existence of certain morphisms satisfying certain equations 
between objects constructed from C using finite limits (for example, the existence 
of binary coproducts is equivalent to the existence of a morphism Co x Co —> Pe 
which constructs the coproduct cones, together with a morphism with domain 
Pe which constructs the factorization of an arbitrary cone through the colimiting 
one); so it is preserved by cartesian functors, and in particular by the functor p*. 
Thus the proof already given, if interpreted in € rather than in S, shows that 
F is a C-torsor if the €-indexed functor F’: [p*(C)|°? — E which corresponds to 
F under the equivalence of 2.3.14 is cartesian. At first sight, this condition is 
stronger than saying ‘F is cartesian’: the latter is tantamount to saying that F'7 
is a cartesian functor for all objects J of € which happen to be of the form p*(J), 
but not for arbitrary objects of €. However, because the ‘generic’ cocartesian 
structure of p*(C) lies in fibres of this kind (for example, any pair of objects of 
[p*(C)]” is obtained by applying an appropriate transition functor of [p*(C)] to 
the generic pair (1,72) in [p*(C)]? (Co*°o), and an indexed functor which sends 
the coproduct of this pair to a product will map arbitrary binary coproducts in 
{p*(C)] to products), the two conditions (on the S-indexed functor F and the 
€-indexed functor F’) are in fact equivalent. E 


Thus we may think of torsors as ‘generalized cartesian functors’; informally, 
they are functors [C]°P — E which ‘preserve all finite limits, even when they 
don’t exist in [C]°?’. 


Remark 3.2.6 If our base topos S has a natural number object, then for any 
internal category C in S we may construct the free cocartesian category generated 
by C, as an internal category. We saw how to construct the free category with 
finite coproducts generated by C, as an indexed category, in 1.4.17, and in 2.2.4 
we saw that this construction preserves local smallness; but it also preserves 
essential smallness, since if the objects of C are indexed by Cp it is easy to see that 
those of Fam;(C) may be indexed by the list object L(Co) (cf. A2.5.17). Having 
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thus adjoined finite coproducts to our category, it suffices to give a construction 
for freely adjoining reflexive coequalizers to a category with finite coproducts; 
but there is a standard construction (due to A. M. Pitts, cf. [193]) for doing 
this in the ‘external’ context, by forming a category whose objects are reflexive 
pairs in the given category, and it is easy to see that this may be carried out in the 
internal logic of a topos. Thus we have a functor C ++ CoCart(C) from Cat(S) 
to itself, which sends C to the free cocartesian category which it generates. 

If now C ~ CoCart(D) for some internal category D, then 3.2.5, plus the 
freeness of the construction, tell us that C-torsors in an S-topos € correspond 
to arbitrary S-indexed functors [D°P] — E, i.e. to diagrams of shape D? in E. 


We are now ready to prove the main theorem of this section. 


Theorem 3.2.7 (Diaconescu’s Theorem) Let S be a topos, C an internal 
category in S and p: E — S a geometric morphism. Then there is an equivalence 
of categories 


Top/S (E, [C, S]) ~ Tors(C, £), 


which is natural in E in the sense that, if g: F —E is a geometric morphism 
over S, then the diagram 


Top/5 (E, 1C, S) ES Top/s (F, 1C, 8) 


^ 
? 


Tors(C, £) aeo Tors(C, F) 


commutes up to (coherent) natural isomorphism. 


Proof One direction is immediate from 3.2.4( f): given a geometric morphism 
f: E — |C, S] over S, we define ®(f) to be the C-torsor f*(Y(C)) in E. It is 
clear that this yields a functor ©: Top/S (E, [C, S]) — Tors(C, €), and that © is 
natural in € in the sense defined above. 

Conversely, suppose given a C-torsor F in €. Consider the composite 


(p°)* y limp 
[C, S] ——> [p*(C), €] —> F, £] —— €, 


where (p©)* denotes p* applied to discrete opfibrations over C, and y: F — p* (C) 
is the discrete fibration corresponding to F. We note that (p©)* is cartesian and 
has a right adjoint (p©),, the latter being obtained by applying p, to discrete 
opfibrations over p*(C) and then pulling back along the unit C —> p,p*(C). So it 
is the inverse image of a geometric morphism; and so are the other two factors of 
the composite above, by 2.3.22 and 2.6.9 respectively. Moreover, they are both 
inverse images of geometric morphisms over E; and since it is easy to verify that 
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the diagram 
pe 
[p*(C), €] FAF [C, S] 


fa Ss 


commutes, it follows that the composite geometric morphism V(F') whose inverse 

image is displayed above is a morphism over S. We note further that the com- 

posite limpy*: [p*(C),E] > E may also be written as F @,-(c) (—), where we 

regard F as a profunctor p*(C) % 1 in €, by 2.7.9; from this it follows easily 

that the construction F + W(F) is a functor Tors(C,€) — Top/S (E, [C, S}). 
Now, given a torsor F, we have 


OV(F) = F pcc) (W°) (Y (C)) = F@Y(p*(C)) = F 


by 2.7.3. Similarly, given a geometric morphism f: E — |C, S] over S and an 
object G of [C, S], we have 


(WO(F))*(G) = F* (Y (€)) 870) (P®)* (G) 
= f*(Y(C)) 8r (F (E (G) 
= f*(Y (€) @c G) = f*(G) 


where we have temporarily written q for the name of the geometric morphism 
(C, S] — S, and used the fact that f* preserves tensor products of profunctors. 
Clearly, these isomorphisms are natural in G and in f; hence W is a two-sided 
inverse for ® up to natural isomorphism. 0 


Remarks 3.2.8 (a) We recall that any S-topos is S-cocomplete, locally small 
and well-copowered, and that [C,S] also has an S-indexed separating family 
(3.2.1), so that any S-cocontinuous indexed functor [C, S] — E has an indexed 
right adjoint by 2.4.6. Thus, if C is cocartesian, we may regard Diaconescu’s 
Theorem as expressing the fact that [C, S] is the ‘free S-cocompletion’ of {C]°?, 
not only in the 2-category Cats of S-indexed categories (cf. 2.5.8), but also in the 
2-category Carts of cartesian S-indexed categories and functors: that is, compo- 
sition with the Yoneda embedding C°? — fC, S] induces an equivalence between 
cocontinuous cartesian S-indexed functors [C,S] — E and cartesian functors 
CoP — E. We shall explore this point of view further in Section B4.5 below. 

(b) Combining 3.2.7 and 3.2.4(a), we deduce that for any object I of S and 
any S-topos p: E > S, Top/S(E,S/I) is equivalent to the discrete category 
whose objects are the morphisms 1 — p*I in £. Specializing to the case when € 
is itself of the form S/J for some J, we obtain the information that the functor 
S — Zop/S of A4.1.2, which sends I to S/I, is a (2-categorical) full embedding. 

(c) On the other hand, the functor Cat(S) — Top/S of 2.3.22, which sends C 
to [C, S], is not full. Using 3.2.7, we may characterize the geometric morphisms 
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[C, S] — [D,S] over S: they correspond to D-torsors in [C,S], or equivalently 
to profunctors T: D + C which are ‘torsors for the right action of D’, i.e. such 
that the functor T p (—): Profs(E,D) — Profs(E, C) is cartesian for any E. 
(This condition may of course be expressed in elementary terms, as the assertion 
that the internal category in S/Co derived by the Grothendieck construction 
from the right action of D on (To — Co x Do) is filtered.) Clearly, the class of 
such profunctors is closed under composition, so the right torsors are the 1-cells 
of a locally full sub-bicategory r'Torss of Profs, which is dually equivalent to 
the full sub-2-category DTop/S of Top/S whose objects are the toposes [C, S]. 
And this equivalence identifies the functor Cat(S) — DTop/S of 2.3.22 with 
the functor Cat(S) — rTorsY which sends an internal category C to itself and 
an internal functor f to f° (cf. 2.7.4; it follows from the dual of 2.7.4(i) that 
profunctors of the form f° are right torsors). We note that this functor is locally 
full and faithful, by 2.7.4(ii), and hence so is the functor Cat(S) — DTop/S; 
that is, for any two internal categories C and D, the functor 


€at(S) (C, D) —> Top/S ([C, S), [D, S)) 
is full and faithful. 


We next discuss two important examples of the application of Diaconescu’s 
Theorem, in which we shall presume familiarity with material from Chapter D5 
(specifically, from Section D5.2 for the first example and from D5.4 for the 
second). The reason for placing these examples here is that they are both of fun- 
damental importance in the theory of classifying toposes which we shall expound 
in Section B4.2. 


Example 3.2.9 For the first one, we suppose our base topos S has a natural 
number object N: let C denote the generic finite cardinal in S/N (cf. A2.5.14) 
and let Ss be the internal full subcategory S[C] as defined in 2.3.5. Thus the 
object of objects of Sy is N, and its object of morphisms is (the domain of) 
the exponential 73(C)™1(©) in S/N x N. Since inverse image functors preserve 
natural number objects (A2.5.6(i)) and commute with exponentiation to the 
power of a finite cardinal (D5.2.11), we note that we have p*(S;) = Ey for any 
S-topos p: € ~> S. Thus, once we have identified the S-torsors in S, for an 
arbitrary topos S, we shall have identified the category Top/S (E,[S;,S]) for an 
arbitrary S-topos €. Now the indexed category corresponding to Sy is the full 
subcategory of S whose J-indexed families of objects are the finite cardinals in 
S/I; it follows from D5.2.7 that this category is cocartesian, and so by 3.2.5 
S;-torsors correspond to cartesian S-indexed functors SP — S. 

Now S; is the free category with finite S-indexed coproducts generated by 
the terminal category 1 (cf. 1.4.17); but it is in fact the free cocartesian category 
generated by 1 — this can be shown using the facts that it already has coequalizers 
(D5.2.5) and its objects are (internally) projective (D5.2.9(ii)). Hence cartesian 
functors SP — S are equivalent to arbitrary functors 1 — S, i.e. to objects of S. 
We may also derive this equivalence more explicitly, as follows. 
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If A is any object of S, then it is clear that the S-indexed functor which sends 
a cardinal [p] in S/I to (I* A)! is cartesian. On the other hand, if B > N is 
the object of S/N underlying an S;-torsor, then preservation of finite products 
forces 0*(B) = 1 and s*(B) = A x B, where A = (so)*(B); so by the uniqueness 
theorem D5.1.9 we have B = LA, where LA is the list object over A (cf. A2.5.17). 
Hence every cartesian functor S? — S is of the form [p] +> (I*A) Pl, Moreover, 
an S-indexed natural transformation between two functors of this kind, corre- 
sponding to objects A and A’, is necessarily induced by a (unique) morphism 
A — A’ in S, as may be seen by applying it to the element of (A*A)!**! which 
corresponds to the diagonal map A — A x A. So we deduce that Tors(S;, S) is 
equivalent to S itself, and hence for any S-topos E€, we have 


Top/S (E, (Ss, S]) ~ E; 


that is, (Sz, S] is a representing object for the forgetful functor Top/S°? — CAT 
which sends a topos to tts underlying category, and a geometric morphism to its 
inverse image functor. 


Example 3.2.10 Let I be an object of S, and let K(J) denote the object of 
K-finite subobjects of I, as constructed in D5.4.1. Since K (JI) is a subobject of 
the power object PI, it carries a canonical partial ordering, i.e. we may regard 
it as the object of objects of an internal poset K(I). Once again, it follows 
from D5.4.12 that we have p*(K(I)) ~ K(p*(J)) for any geometric morphism 
p: E — S, and so to identify the geometric morphisms over S into [K(J), S] it 
suffices to identify the K(J)-torsors in S itself. Also, K(J) is cocartesian (since 
it is a join-semilattice), so these torsors correspond to cartesian S-indexed func- 
tors K(I)°? — S. Since K(J) is a poset, any such functor must take values in 
the subfibration Sub of subterminal objects of S (cf. 1.2.2(g)); but since S is a 
topos, the latter is equivalent to [Q] where © is the internal poset (Q1 3 Q) 
(cf. 2.3.8(a)). Thus, by 2.3.3, K(J)-torsors correspond to internal cartesian func- 
tors (that is, meet-semilattice homomorphisms) K(I)°? — Q; but K(J) is the 
free join-semilattice generated by I (D5.4.9), and so these in turn correspond 
to arbitrary morphisms I — Q, that is to subobjects of J. It is also straight- 
forward to verify, by chasing through the above equivalences, that morphisms 
of K(J)-torsors correspond to inclusions between subobjects of J. Thus we have 
shown that for any object I of S, we have an equivalence 


Zop/S (E, |IK(I), S]) ~ Sube (p* (T)) 


which is natural in E. 


Remark 3.2.11 As a particular case of 3.2.10, we note that the functor cat- 
egory [2, S] (where 2 denotes the finite category (e — e), identified with an 
internal poset in S by the method of 2.3.12(b)) is a classifying topos for subter- 
minal objects in S-toposes, since K(1) = 1 H 1 by D5.4.4(i). This category is 
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often called the Sierpiński topos over S: in the case S = Set, it may be identi- 
fied with the topos of sheaves on the Sierpiński space S, i.e. the two-point space 
{0,1} with {1} (but not {0}) open. 

In fact, in this case the equivalence Top/S (E,[2,S]) ~ Sube(1) may easily 
be verified without appealing to Diaconescu’s Theorem. For [2,5] is the topos 
obtained by glueing along the identity functor S — S (cf. A2.1.12), and so by 
A4.5.6 it has complementary open and closed subtoposes, both isomorphic to S. 
Hence a geometric morphism E — [2, S] induces, by pullback, a corresponding 
decomposition of € into complementary open and closed subtoposes. Conversely, 
given an S-topos p: E — S equipped with a distinguished subterminal object U, 
we may identify € with a topos obtained by glueing as in A4.5.6, and then ‘glue 
together’ the morphisms shau) (£) > E > S and sh,y)(E) > E > S into a 
single morphism € — [2,5]. We leave to the reader the verification that these 
two constructions are inverse to each other up to isomorphism. 


An important application of Diaconescu’s Theorem is the result that geomet- 
ric morphisms of the form [C, S] — S have pullbacks in Yop. At first sight, this 
appears to be a straightforward consequence of the theorem: 


Corollary 3.2.12 LetC be an internal category in a topos S, andp:E—-S a 
geometric morphism. Then the diagram 


pe 
[p*(C), €] — [C, s] 


E ——— S 
constructed in the proof of 3.2.7 is a product diagram in Yop/S. 


Proof For any S-topos q: F — S, pairs of geometric morphisms f: F > €, 
g: F — [C, S] over S correspond, up to equivalence of categories, to pairs (f, G) 
where G is a torsor over C (that is, over g*(C) = f*p*(C)) in F. But these in 
turn correspond to pairs (f, h), where h is a geometric morphism F —> [p*(C), €] 
over E. 


As a special case of 3.2.12, we note 


Corollary 3.2.13 The functor Cat(S) —> Top/S of 2.3.22 preserves finite 
products. 


Proof It clearly preserves the terminal object; to prove that it preserves binary 
products, take € to be a topos of the form [D, S] in 3.2.12. Then [p*(C), €] is 
equivalent to [C x D, S], since the objects of both categories may be identified 
with discrete opfibrations over C x D by 2.5.2 and 2.5.3. 0 
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However, if we wish to prove that the square in the statement of 3.2.12 is a 
(2-categorical) pullback in Yop, and not merely in Zop/S, we have to consider 
non-invertible geometric transformations between geometric morphisms into S, 
and such 2-cells do not live in the 2-category Yop/S. To get round this difficulty, 
we need to borrow a result from Section B3.4 (see 3.4.2 below): namely, that the 
2-category Top admits tensors with 2. That is, for any topos E there is a topos 
2@€ (in fact it is the Sierpiński topos [2, E], cf. 3.2.11) and a natural equivalence 


Top (2@E,F) ~ (2, Top (E£, F)| 


for all toposes F. 


Corollary 3.2.14 For any geometric morphism p: E —> S and any internal 
category C in S, the square in the statement of 3.2.12 is a pullback in Yop. 


Proof Given a topos F and a pair of geometric morphisms f: F — €, 
g: F —|C,S] satisfying pf = ‘Cg, we obtain a geometric morphism 
h: F — [p*(C), E] as in 3.2.12. Given two such pairs (fo, go) and (f1, 91) together 
with geometric transformations a: fọ —> fı and 3: go — gı satisfying p o a = 
"Co, we may regard the data as a compatible pair of morphisms from 2 ® F 
to the diagram, and hence obtain a morphism 2@F — [p* (C), E], or equivalently 
a geometric transformation between the two morphisms ho, hi: F 3 [p*(C), €]. 
It is now straightforward to verify that these constructions yield a functor 


Top (F £) X Top (F,S) Top (F, [C, S]) => Top (F, [p*(C), €]) 


which is inverse up to natural isomorphism to the functor obtained by composing 
with the left and top edges of the square of 3.2.12. 0 


Suggestion for further reading: Diaconescu [280]. 


B3.3 Giraud’s Theorem 


In this section we shall prove the fundamental representation theorem for 
bounded S-toposes (again due to R. Diaconescu {280]), which says that a 
geometric morphism p: E — S is bounded iff it can be factored as a composite 


622 S10 Ss sg 


where C is an internal category in S and i is a geometric inclusion. Bearing in 
mind the identification of inclusions E — F with local operators on F, estab- 
lished in Section A4.4, this is tantamount to saying that any bounded S-topos 
E is equivalent to the topos of ‘S-valued sheaves’ on an ‘internal site’ in S. 
The representation theorem is commonly known as Giraud’s Theorem, because 
J. Giraud [405] proved a representation theorem of this kind for the case S = Set; 
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but it is not really the S-indexed analogue of Giraud’s original theorem, because 
Giraud did not have the elementary concept of topos available, and therefore 
sought a characterization of categories of sheaves on sites amongst arbitrary 
categories, rather than amongst toposes defined over Set. (In modern termi- 
nology, his theorem is essentially the equivalence of (ii) and (vii) in Theorem 
C2.2.8 — whereas the theorem we are about to prove is (the analogue over an 
arbitrary base of) the equivalence of (ii) and (v).) We shall return to this topic 
at the end of the section, and show how a theorem more like Giraud’s original 
may be extracted from the one we have proved here. 

For now, let p:E€— S be a bounded geometric morphism, and let 
(g: G — p*I) € ob E! (for some I € ob S) be a separating family for the 
S-indexed category E. We take our internal category C to be the opposite 
of the internal full subcategory E[gj, defined as in 2.3.5: explicitly, Co = J, 
and the object of morphisms and domain and codomain maps of C are given by 
the pullback 


Ci —————> p, E 
(do, di) (DG, pxb) 


n 
Co x Co —> pap" I x pąp™I 


where (a,b): E — p*I x p*I is the exponential 73(g)"19 in £/p*I x p*I, and ņ 
is the unit of (p* 4 p,). Then, by 2.3.13, the S-indexed full embedding [C]°? — E 
corresponds to an internal diagram F of shape p* (C)? in E: explicitly, (Fo — Co) 
is g: G— p*J, and the right action of p*C on Fo is given as follows. We have a 
commutative diagram 


* * * € 
Fo Xpo PCi ——> p*C, ———> p*p, E ——> E 


do p*p.a a 


G ———> p* I ——> p*p,p" I sius pl 


(where € is the counit of (p* 4 p,)), and hence a morphism Fo Xp*cy p*C1 > 
G xp-; E. But the codomain of this morphism is (the domain of) the product 
m*(g) x 13(g)™ 9 in E/p*I x p*I, and so we have an evaluation map from here to 
73(g), which we compose with the projection Lp+7xp-1(73(g)) = p*I x GG. 


Lemma 3.3.1 The diagram F just defined is a C-torsor in E. 


Proof Applying 2.4.3 to the terminal object of €!, we obtain a morphism 
xz: J — I in S and an epimorphism L ;x*(g) > 1 in E; but the latter factors 
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through G — 1, and so it too must be epic. Thus the first condition of 2.6.2(a) 
is verified for F. 

Next, we apply 2.4.3 to the object (g x g: G x G ~ p*I x p*I) of ETSI, We 
obtain a commutative diagram of the form 


154 
ge 8g OO) exe 
9 h 9X9 
p*Yy3 (p* 41, P* y2) 


yle ak ep Pel 


where the left-hand square is a pullback. We shall show that the epimorphism 
(q1, 92) factors through the morphism Py — Fo x Fo = G x G of 2.6.2(a), so that 
the latter is also epic. 
Now qı induces a morphism y3(g) — y*(g) in €*, and hence a morphism 
z: K — Cı in S satisfying dọz = y3 and d,z = yı. And this in turn induces a 
morphism 
rı = (q3, p” (z)h): H —> Fo Xp co Va = F 


in € satisfying dor, = q3 and dir) = qı. Similarly, we obtain rọ: H — F; satis- 
fying dor2 = q3 and dir2 = qo; so the pair (rı, r2) is the required factorization 
of (91,92) through Py. 

For the third condition of 2.6.2(a), we similarly start by applying 2.4.3 to 
the object (Rr — p*(Rc)) of €%, where (as in 2.6.1(c)) R denotes the ‘object 
of parallel pairs’ functor. We obtain a diagram 


83 
G «——— B 


Rf 


g b 
w3 u š 
p I —— p L — > Re 


where, once again, the left-hand square is a pullback. We shall write wı and 
we for the composites of u with the ‘domain’ and ‘codomain’ morphisms Re 3 
Co = I, and similarly sı and s2 for the composites of t with Re 3 Fo = G. 
Now u corresponds to a parallel pair of morphisms w3(g) = wł(g) in E}, and 
t corresponds to a morphism from w3(g) to the equalizer of this pair. Thus we 
have a diagram 


w3(9) ——> 3(9) ——S wi (9) 
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of shape T°? (where T is the finite category displayed in 2.6.1(e)) in E}, which 
corresponds to a morphism v: L — Te such that 


ps2 SF 


Re 


commutes. Now the pair (s3, p*(v)b): B — Fo Xp*co p*Tc = Tr similarly yields 
a factorization of t through Ty — Rg; so the latter must be epic. 


Remark 3.3.2 The proof of 3.3.1 may be somewhat simplified if we assume 
that our separating family is ‘closed under subobjects’ in the sense that, given 
any x: J — I and any subobject S — x*G, there exists a morphism y: J > I 


such that 
S ———> G 


pd PEE Ae pI 
is a pullback. We note that this condition is always satisfied by the separating 
family constructed from a bound for € over S as in the proof of 3.1.6(ii) > (i). 
(For an account of the proof in this particular case, see [504].) 

Still further simplification is possible when the base topos S has a natural 
number object: for in this case we may assume that our separating family is 
closed under finite products as well as subobjects, by the device of replacing 
a separating family (G — p*I) closed under subobjects by the family (LG — 
L(p*I) = p*(LI)) where LG is the list object over G as constructed in A2.5.17. 
Then we see that our internal full subcategory C°P has S-indexed finite limits 
and the inclusion functor C°? — E is cartesian; so the result of 3.3.1 follows from 
3.2.5. The same argument may be employed, even if S does not have a natural 
number object, in the case when p is localic (cf. 3.3.5(iii) below). 


However, for the next step in the argument there seems to be no substitute 
for hard calculation. 


Lemma 3.3.3 The geometric morphism i: E — |C, S] which corresponds to the 
torsor F of 3.3.1 is an inclusion. 


Proof We first observe that the direct image functor i, may be described as 
follows: given an object A of £, we form the exponential (I*A)? in €/, apply p» 
to it and pull it back along the unit J — p,p*J to obtain an object (Bo — J) of 
S/I. It is easy to see that this object admits a left action of C induced by the 
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composition map 13g™19 x ((I x I)*A)™9 — ((I x I) A)™9 in €7*!, yielding 
a diagram B of shape C which we take to be 7,(A). For, given an arbitrary 
diagram D of shape C, morphisms Do — Bo over J correspond to morphisms 
G Xp*1 p*Do — A in E, and it is straightforward to verify that a morphism 
Do — Bo is equivariant for the left C-actions iff the corresponding morphism 
G Xp*1 p* Do — A factors through the coequalizer G xp; p* Do —> F @ +c p* D 
=c" D: 

We now show that i, is faithful. Suppose given a parallel pair f, f’: A= A’ 
in € with f # f’. Then we can find x: J — I in S and a morphism h: z*g — J*A 
in E7 with (J*f)h Z (J* f')h. But morphisms z*g — J*A in E£” correspond to 
morphisms J — Bg over J (where Bo is defined as above), and so we have such 
a morphism whose composites with i, f and i, f’ are different; thus i, f Æ if’. 

To show that i, is full as well as faithful, we observe that a morphism 
f:A— A in [C, S] induces, for every x: J — I and every u: Xyx*(g) > A, a 
morphism f(u): £,a*(g) — A’, in a manner which is ‘natural’ with respect to 
morphisms between pullbacks of g. Now we may use 2.4.3 to construct a diagram 


a 
Me Ro ar OE N. 


b 
m h 
pz Ty 
pK —> p* J x pJ Ss pJ 
T2 
p*y p*r 
pl pvt 


where (a,b) is the kernel-pair of u, and h and m are the pullbacks of g along 
p*x and p*y respectively. The argument above yields a morphism f(u): H — A’, 
and the naturality condition tells us that this morphism has equal composites 
with av and bv, so it factors through their coequalizer, which is u. Thus we 
obtain a morphism f: A — A’; and a further application of the same technique 
shows that, for any u’: Dy2'*(g) > A, f(u’) is the composite fu’, and hence 


l= f. o 
Combining 3.3.1 and 3.3.3, we have proved 


Theorem 3.3.4 (Giraud’s Theorem) For a geometric morphism p: E > S, 
the following are equivalent: 

(i) p is bounded. 

(ii) There exists an internal category C in S and an inclusion E — |C, S] in 


Top/S. 
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Proof (i) > (ii) follows directly from 3.3.1 and 3.3.3. For the converse, we use 
the fact that [C, S] is bounded over S by 3.2.1, together with 2.4.2(d). o 


It is of course possible to introduce a notion of ‘internal coverage’ on an 
internal category C, corresponding to the external notion introduced in A2.1.9, 
and to say what it means for a diagram of shape C°? to be a sheaf for such a 
coverage. Then one may prove that every subtopos of a diagram topos [C°?, S] is 
the category of T-sheaves for a suitable coverage T, and thus deduce from 3.3.4 
that every bounded S-topos is of this form. However, we shall find it convenient 
to postpone consideration of the notion of internal coverage until Section C2.4. 

As a particular case of Giraud’s Theorem, we note 


Corollary 3.3.5 For a geometric morphism p:E—S, the following are 
equivalent: 


(i) p is localic. 
(ii) There exists an internal preorder P in S and an inclusion E — [P,S] of 
S-toposes. 


(iii) There exists an internal complete Heyting algebra H in S and an inclusion 
E — [H°?, S] of S-toposes. 


Proof (ii) > (i) follows from 3.2.2 and the facts, noted in A4.6.2(a) and (e), 
that inclusions are localic and composites of localic morphisms are localic. 

(iii) > (ii) is trivial. 

(i) = (iii): If 1 is a bound for E over S, then by 3.1.6 we obtain a separating 
family for E by taking the left vertical map in the pullback 


H ———_> 1 
T 


* ER 
ppe (N) ——> 2 


considered as an object of €?-(@). It is then easy to see that the internal category 
C constructed from this separating family as above is simply p,({2)°?, where Q is 
the internal poset constructed from the subobject classifier of € as in A1.6.3. But 
the latter is an internal Heyting algebra by A1.6.3, and €-complete by 2.3.8(a); 
and both these properties are preserved by the functor p,. O 


We note that the particular case of 3.3.5 where S = Set appears, with a 
different proof, as C1.4.7. 

One of the most important applications of Giraud’s Theorem is to the 
construction of pullbacks in Yop: 


Proposition 3.3.6 Letp:E— S andq: F —S be geometric morphisms with 
common codomain, such that p is bounded (resp. localic). Then there erists a 
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pullback square 


G EAS E 
q 
F — s 


in Top, in which g is bounded (resp. localic). 
Proof By 3.3.4 (resp. 3.3.5) we may factor p as 


EEE o E 


where C is an internal category (resp. an internal preorder) in S and i is an 
inclusion. By 3.2.14 the pullback of [C,S] along q exists and is of the form 
la* (C), S]; and clearly g*(C) is a preorder if C is. So it suffices to show that the 
inclusion i can be pulled back along qÏ, and that its pullback is an inclusion; 
but this was done in A4.5.14(e). o 


As at the end of Section B3.1, we should emphasize that although the bound- 
edness restriction in 3.3.6 is a real one, it is not in practice very irksome, since 
we know so few examples of unbounded geometric morphisms. Indeed, we do not 
know any example of a pair of unbounded morphisms whose pullback is definitely 
known not to exist in Yop. 

For future reference, we note 


Lemma 3.3.7 Let q: F — S be a hyperconnected geometric morphism. Then 
the pullback of q along any bounded geometric morphism p:E—-S is 
hyperconnected. 


Proof As before, it suffices to consider separately the cases when p is of the 
form |C,S] — S, and when p is an inclusion. In the former case, since the inverse 
image of the pullback qÏ of q is simply q* applied to diagrams of shape C in S, 
it is easy to see that it is full and faithful and that its image is closed under 
subobjects. The latter case was dealt with in A4.6.11. D 


In particular, it follows from 3.3.6 and 3.3.7 that the hyperconnected-localic 
factorization of an arbitrary geometric morphism, as constructed in A4.6.5, is 
stable under pullback along arbitrary bounded morphisms. In C2.4.11 we give 
an entirely different proof that bounded hyperconnected morphisms are stable 
under pullback along arbitrary geometric morphisms. 

Since any bounded geometric morphism p: E — S can be pulled back along 
itself, we can form the diagonal map Ap: E — E xg E. In Part C we shall often 
have occasion to consider such diagonal morphisms, and the following result will 
be of use. 
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Proposition 3.3.8 
(i) For any bounded geometric morphism p, the diagonal A, is localic. 


(ii) If p is localic, then A, is an inclusion. 
(iii) If p is an inclusion, then A, is an equivalence. 


Proof It is convenient to prove the third assertion first. But (iii) is immediate 
from the construction of pullbacks of inclusions given in A4.5.14(e); for the 
pullback of any inclusion along itself is clearly an equivalence. (Alternatively, we 
could appeal to the fact that inclusions are fully monic in Top, in the sense of 
1.1.19.) 

(i) We may factor p, as usual, as E — [C, S] — S, where the first factor is 
an inclusion. For the second factor, we know by 3.2.13 that [C, S] xs [C, S] ~ 
[C x C, S], and it is clear that the diagonal map of this factor is induced by the 
diagonal functor C — Cx C. But the latter functor is faithful, so by (the internal 
version of) A4.6.2(c) it induces a localic geometric morphism. Now we have a 
pullback square 

E X [C,5] E ————> (C, S] 


E xs E — > [C,S] xs [C,S] 


where the right vertical arrow is the diagonal of [C, S] — S, so the left vertical 
arrow is also localic by 3.3.6. But the diagonal of p is the composite of this 
morphism with the diagonal of E€ — [C, S], and we have already seen that the 
latter is an equivalence. 

(ii) Similarly, if p is localic, we may factor it as € — |P, S] — S where P is an 
internal preorder in S, by 3.3.5. In this case, the diagonal functor P — P x P is 
full as well as faithful, so it follows from the internal version of A4.2.12(6) that 
the diagonal of [P, S] — S is an inclusion. The rest of the proof is similar to that 
of (i). oO 


We shall eventually see (in C2.4.14) that the converse of 3.3.8(ii) is true. 
The converse of 3.3.8(iii) is, however, false: a counterexample is given by the 
morphism Set/X — Sh(X) of A4.2.4(e), for any non-discrete space X. When S 
has a natural number object, alternative proofs of both parts (i) and (ii) of 3.3.8 
may be given using the notion of classifying topos developed in Section B4.2 
below (and also in chapter D3). If p: E — S classifies a geometric theory T over 
S, then E xs € classifies the theory of pairs of T-models, and as a topos over it the 
diagonal may be viewed as classifying the theory of isomorphisms between the 
generic pair of T-models — which is a propositional theory, and hence classified 
by a localic topos (4.2.12). Similarly, if p itself is localic, then we may take T to 
be a propositional theory, in which case freely adjoining an isomorphism between 
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two T-models is equivalent to forcing the models to be the same — which, as we 
shall see in Section B4.2, can be done by an inclusion. 

One might be tempted to conjecture after 3.3.8 that if p is (bounded and) 
hyperconnected than its diagonal should be a surjection. But this is false: con- 
sider the topos [C,Set] where C is the free category generated by the directed 
graph 

e aT ° . 
Then it follows easily from A4.6.9 that [C, Set] — Set is hyperconnected. How- 
ever, neither of the two objects of C is a retract of the other, so it follows from 
A4.2.7(b) that the diagonal morphism [C, Set] — [C x C, Set] is not surjective. 


We now return to the problem, mentioned at the beginning of this sec- 
tion, of formulating an S-indexed version of Giraud’s original theorem. In order 
to do this, we shall need to introduce the S-indexed version of the notion of 
oo-pretopos, which we met originally in Section A1.4. 


Definition 3.3.9 Let S be a topos, and E an S-indexed category. We say E is 
an S-indexed oo-pretopos if it satisfies the following conditions. 


(i) Each fibre £" is a pretopos, and the transition functors z*: E! — £7 are 
coherent functors. 


(ii) Each €" has coequalizers (of arbitrary pairs) which are stable under 
pullback and preserved by the functors x*. Moreover, if 


f' 
A ef 
g' 
J 
A PEN C 
g 


is a commutative diagram in some £7 where the bottom row is a coequalizer 
and both the squares are pullbacks, then B’ — B is the pullback along h 
of the image of the composite B’ — B — C. 


(iii) E is well-powered as an S-indexed category. 


(iv) E has S-indexed coproducts which are disjoint and stable, in the sense 
defined in Section B1.4. 


In the case S = Set, it is not hard to verify that the naive indexing of 
an ordinary category E is a Set-indexed oo-pretopos iff € is an co-pretopos as 
defined in Section Al.4. The only surprising element of the definition is the 
condition on arbitrary coequalizers in (ii), which was not present in our defini- 
tion of an oo-pretopos (though it follows from it, by A1.4.19; note that if the 
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pair (f,g) in the second condition of (ii) is an equivalence relation, then we 
have B’ = 4¢9*(B’), from which B’ = h*d,(B’) follows by Beck—Chevalley). 
Condition (ii) was also not present in Giraud’s original theorem (he required 
merely that coequalizers of equivalence relations should exist and be stable under 
pullback — which is true in any pretopos). The reason why we have had to include 
it here is that we are not assuming that our base topos S has a natural number 
object, and so we cannot hope to construct the equivalence relation generated by 
an arbitrary parallel pair as we did in A1.4.19. (However, it would be sufficient 
to restrict the condition to coequalizers of reflexive pairs, since we already have 
well-behaved finite coproducts given by (i).) We note also that if p: E > S isa 
geometric morphism, then the indexing of E over S induced by p* satisfies the 
conditions of 3.3.9: to verify that the second part of (ii) holds in any topos, note 
that the hypotheses imply that the classifying map x: B — 2 of B’ — B factors 
through h. Conversely, we have: 


Lemma 3.3.10 Let E be an S-indexed co-pretopos with a separating family. 
Then E is an S-indexed topos as defined in Section B3.1. 


Proof We note first that, by 1.4.12, E is equivalent to the S-indexed category 
(I + E'/p*I) for some cartesian functor p*: S — €'; moreover, we shall see 
shortly that E is locally small, so that p* has a right adjoint by 2.2.5. Thus it 
is sufficient to show that the fibre £! is a topos. However, our method for doing 
this will easily yield the extra information that the other fibres are also toposes, 
and the transition functors between them are logical; so we do not in fact need 
to appeal to 1.4.12. 

To show that E is locally small, we use the S-indexed version of the argument 
of A1.4.17: given objects A and B of fibres €! and E7 respectively, the object of 
S/I x J indexing morphisms A — B is a subobject of the object indexing sub- 
objects of 7] A x 73.B, corresponding to those subobjects (a,b): S — af A x 73B 
which are graphs of morphisms, i.e. such that a is an isomorphism. To construct 
this subobject, we need to know that invertibility is definable in E; and we can- 
not use 1.3.15 for this purpose, since we do not yet know that E is locally small. 
However, it is easy to see that in a well-powered indexed category the prop- 
erty of being invertible is definable for monomorphisms, since z*(m: A’ — A) is 
invertible iff x factors through the equalizer of the morphisms in S indexing m 
and 14; and an arbitrary morphism f: A — B in €! is monic iff the diagonal 
A >= A Xp A is an isomorphism, so we may first construct the subobject I’ — I 
which forces this to be true, and then construct I” — I’ forcing f itself to be 
an isomorphism. 

E is also well-copowered, by an argument which we have seen before (for 
toposes) in 1.3.14: quotients of an object A of E£! correspond to subobjects of 
A x A which are equivalence relations, and we can construct an object of S/I 
which indexes the latter. Since it is also S-cocomplete, we can therefore use 
the S-indexed adjoint functor theorem to construct right adjoints for indexed 
functors defined on E. 
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Thus, to show that £! is cartesian closed, it suffices to show that for each 
object A of €! the indexed functor which sends B € ob £! to B x I*A is 
S-cocontinuous. But conditions (i) and (ii) of 3.3.9 ensure that this functor 
preserves (fibrewise) finite colimits, so we need only check that it preserves 
S-indexed coproducts. For this, we note that, given x: J — I and objects B,C 
of E1, E! respectively, we have a pullback square 


T 
B xz*C ———>B 
nB 


z* (£B x C) —> 2*,B 


since x* preserves finite products; hence by 1.4.10 the canonical morphism 
E(B x 2*C) > E.B x C is an isomorphism. Specializing to the case when 
C = I*A, we deduce that the indexed functor (—) x A preserves S-indexed 
coproducts. 

Thus £} is cartesian closed; and repeating the argument in the context of 
S/I-indexed categories will show that £7 is cartesian closed for every I. More- 
over, we have verified the Frobenius reciprocity condition for the transition 
functors x* and their left adjoints, so by A1.5.8 the x* are cartesian closed 
functors. 

To verify that the €/ have subobject classifiers, we consider the indexed func- 
tor Subg: E°P — S which sends an object A of €/ to the object of S/I indexing 
subobjects of A. We shall show that this functor is continuous, and so has an 
indexed left adjoint L; it is then straightforward to verify that L7(1,) is a subob- 
ject classifier for £7, and since the L/ form an indexed functor it follows that the 
transition functors of E preserve the subobject classifiers, and are thus logical. 

Once again, the fact that Subg sends finite (fibrewise) colimits to limits is 
straightforward from conditions (i) and (ii) of 3.3.9, so we have only to verify 
that, given z: J — I in S and A € ob EY, we have Sub (£A) = I, (Subz(A)). 
For this, we note first that the functors 5, are conservative, by an easy applica- 
tion of 1.4.11. We claim next that they also preserve monomorphisms: for this, 
suppose we have m: A’ — A in £7, and form the image factorization 


g f 


ELA — B > 5, A 


of &am. Now form the diagram 


A’ eee C ae A 
NA’ A 
x* he 
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where the right-hand square is a pullback, and h is the unique factorization of 
(m, (*g)na‘) through it. Then 1.4.10 tells us that B = XC, by an isomorphism 
which identifies g with X,h. Since £, is faithful, it reflects epimorphisms; so h 
is epic; but it is also monic since kh = m is, and so it is an isomorphism. Thus 
g is an isomorphism, and ©,m is monic. 

Now we claim that we have a bijection between subobjects of A in £7 and 
subobjects of £, A in £7: the mapping in one direction is obtained by applying 
x, to subobjects of A, and in the other direction we apply x* to subobjects of 
E, A and pull back along 74. Lemma 1.4.10 implies that the composite one way 
round is the identity, and the argument in the previous paragraph verifies that 
the composite the other way round is also the identity. 

Finally, given an object K of S and a morphism y: K — I, let us form the 
pullback 


| el ee 
| i 
y 


K ——> I 
then morphisms y —> Il, (Subġ (A)) in S/I correspond to morphisms z — 
Subz(A) in S/J, and hence to subobjects of z*A in £}. By the argument 
just given, these correspond bijectively to subobjects of E „z*A = y*,A in 
EK, and hence to morphisms y — Subj(=,A) in S/I. It is straightforward to 
verify that these bijections are natural in y; and so we have an isomorphism 
Il, (Subz(A)) & Subg(=,A), as required. Oo 


We may now prove a result (the equivalence of (i) and (iv) below) which, 
in the case S = Set, reduces to something much more like Giraud’s original 
theorem. 


Theorem 3.3.11 Let S be a topos, and E an S-indered category. The following 
conditions are equivalent: 

(i) E is an S-indexed o0-pretopos with a separating family. 

(ii) E is a locally small, cocomplete S-indexed topos with a separating family. 


(iii) There exists a bounded geometric morphism p:E€—S such that E is 
equivalent to the indexing of E over S induced by p*. 

(iv) There exists an internal category C in S, such that E is equivalent to a 
reflective indexed subcategory of the indexed functor category [C,S] with 
cartesian reflector. 


Proof Combine 3.3.10, 3.1.2, 3.3.4 and A4.3.9. O 


Suggestions for further reading: Chapman & Rowbottom [238], 
Diaconescu [280], Giraud [405]. 
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B3.4 Colimits in Top 


There is a sense in which this section does not belong in the present chapter. 
Half of it belongs in Chapter A4, since it consists of elementary results about 
geometric morphisms; and the other half is restricted to bounded toposes over 
a base, and so belongs in Chapter B4 (or even in Part C). However, the logical 
flow of ideas seems to point to this stage in the text as the correct one at which 
to gather together the results which follow. 

We shall see in the cases which follow that, whenever a diagram D in Yop has 
a (possibly weighted) colimit, the underlying category of the colimit is always 
the (weighted) limit in CAT of the corresponding diagram of toposes and inverse 
image functors. There is of course a reason for this: we have seen in 3.2.9 
above that, for a topos S with a natural number object, the forgetful functor 
U: (Yop/S)°? — GAY, which sends an S-topos p: E — S to the underlying cate- 
gory of € and a geometric morphism to its inverse image functor, is representable, 
and so it must transform arbitrary weighted colimits to weighted limits. Hence, 
given a weighted diagram (D, W) in Top, provided there exists some topos S with 
a natural number object such that the diagram D can be regarded as living in 
Zop/S, then the weighted colimit of (D, W) if it exists will be the weighted limit 
of (UD°?, W) in CAT. (Note that if W is the trivial weighting, so that we are 
seeking an ordinary ‘conical’ colimit, then the colimit topos £ itself, if it exists, 
will be a topos such that the diagram D lives in Top/ZL; but this is not the case for 
more general weightings.) Since weighted limits are generally easy to construct 
in CAT, the major part of the work in showing that particular types of weighted 
colimits exist in Top is thus to show that these weighted limits are in fact toposes. 

We begin with the simplest case, that of finite coproducts. 


Lemma 3.4.1 The 2-category Yop has finite coproducts. 


Proof The terminal object of CAT, the category 1 with one object and one 
(identity) morphism, is initial in Lop: for any topos £, the unique functor E — 1 
is the inverse image of a geometric morphism, whose direct image sends the 
unique object of 1 to the terminal object of €, and this is clearly the only 
geometric morphism 1 — E up to unique 2-isomorphism. Similarly, if € and F are 
toposes, then the cartesian product € x F is a topos (we can regard it as a special 
case of the glueing construction of A2.1.12, where F is the constant functor with 
value 1), and the projections from it to E and F are inverse image functors (their 
right adjoints send objects A and B to (A, 1) and (1, B) respectively). And, given 
an arbitrary cospan 


E S G TE eee F 
in Yop, we have an induced morphism h: Ex F— G given by h*(C) = 
(f*(C),9*(C)) and h,(A,B) = f(A) x g,(B); it is clear that the construc- 
tion (f, g) + h is functorial, and that, together with the operation of composing 
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with the morphisms whose inverse images are the projections, it defines an equiv- 
alence of categories between Lop (E x F,G) and Top (E,G) x Lop (F,G). O 


We note that the initial object of Top is strict, since if we are given a geo- 
metric morphism f: € — 1 then f* preserves the isomorphism 0 = 1, and so E 
is degenerate. Also, coproducts are disjoint and stable under pullback; for we 
can identify E and F with the complementary open subtoposes (E€ x F)/(1,0) 
and (E x F)/(0,1) of the coproduct, and any geometric morphism f: G —> E x F 
induces a corresponding decomposition G ~ G/f*(1,0)xG/f*(0, 1) of its domain. 

The only reason for restricting to finite coproducts in 3.4.1 is the need to form 
a product inside G in the definition of h,; if we are working in a context where 
the toposes concerned all have arbitrary set-indexed products, for example in the 
2-category ‘BLop/Set, then we may conclude by exactly similar arguments that 
an arbitrary set-indexed family of toposes has a coproduct, which is simply the 
product of their underlying categories. We shall generalize this observation in 
3.4.7(i) below. 

The glueing construction, or some variant of it, is used in most of the cases 
where colimits can be constructed in Top by elementary means. A good example 
is the following: 


Lemma 3.4.2 The 2-category Top has cocomma objects. 


Proof Let 
f g 


F <—— E — G 


be a span in Top. To form the cocomma object (f Tg), we need to consider the 
comma object (f*}g*) in CAT; but this is the usual comma category, whose 
objects are triples (B,C,h) where B € ob F, C € ob G and h: f*(B) > g*(C). 
And since morphisms f*(B) — g*(C) correspond bijectively to morphisms B — 
f.«g*(C), this category is isomorphic to the comma category (17 | f,g*), i.e. to 
Gl(f.g*). Moreover, f,g* is cartesian, so by A2.1.12 GI(f,g*) is a topos, and 
by A4.1.12 we have geometric morphisms 


Ree S Giggs =e 


and the canonical isomorphism qg*p, = f,g* of A4.1.12 transposes to yield a 

natural transformation f*g* — g*p*, i.e. a geometric transformation gf — pg. 

So we have constructed a weighted cone of the required form under (f, g) in Top. 
Given any other such weighted cone 
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there exists (up to unique isomorphism) a unique functor 1*: H — GI(f.g*) 
with p*l* = h* and qg*l* = k*, namely (D) = (h*(D),k*(D),@p) where 
@p: k*(D) — f.g*h*(D) is the transpose of ap across the adjunction (f* 4 f,). 
Moreover, l* is cartesian since h* and k* are, and it has a right adjoint I, sending 
(C,B,u: B — f,9*(C)) to the pullback of 


k,(B) 


k,(u) 


pie) ngage) "2 Rhee ©) 


where 77 is the unit of (g* 4 gą) and & is the mate of a. So we have a geometric 
morphism l: Gl(f.g*) — H; the construction of l is clearly functorial in the 
data (h, k, a), and defines (the other half of) the required equivalence between 
Top (GL f.9*), H) and the appropriate category of weighted cones. o 


If we take f = g = le in 3.4.2, we obtain the result mentioned before 3.2.14 
that the functor category [2, €] (the Sierpiński topos over £, cf. 3.2.11) serves 
as the tensor 2 @ € in Yop. More generally, if C is any finite category, then 
the functor category [C, E] (which is of course the cotensor C h E in CAT — 
cf. 1.1.4(b)) can similarly be shown to have the universal property of a tensor 
C & E in Top. (Recall that we showed that [C, E] was a topos in 2.3.18.) 

The next example uses a generalization of the glueing construction. 


Lemma 3.4.3 Pushouts of pairs of inclusions exist in Top. 


Proof Let 


re e—l7”>g 


be a pair of inclusions with common domain. Consider the functor 
G:F xG —F xG which sends (B,C) to (B x f.g*(C),C x g.f*(B)); this 
functor is clearly cartesian, and it has a comonad structure whose counit is 
induced by the projections B x f,g*(C) — B and C x g, f*(B) — C, and whose 
comultiplication is induced by 


(T1, n2, n2, 071) 

Bx fag*(C) ——————_> B x fag*(C) x fag* (C) xX feo" Ge f*(B) 
(where @ is the composite of the unit ng: B — f, f*(B) and f, applied to the 
inverse of the counit of (g* 4 g,)) and the corresponding morphism with f 
and g interchanged. (The verification of the comonad identities is tedious but 
straightforward.) So the category of coalgebras (F x G)g is a topos, by A4.2.1; 
but the objects of this topos can be identified with quadruples (B,C,a, 3) 
where a: B — f,g*(C) and 8: C — g,f*(B) are morphisms satisfying coher- 
ence conditions equivalent to saying that their transposes f*(B) — g*(C) and 
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g*(C) — f*(B) are mutually inverse isomorphisms. So we may identify the topos 
(F xG)g with the (pseudo-)pullback of f* and g* in CAT. As in 3.4.1 and 3.4.2, 
the verification that it has the universal property of a pushout in Top is now 
straightforward. O 


From the proof of 3.4.3, we may obtain a result reminiscent of A2.4.3: note 
that it includes the result, already mentioned, that coproducts in Top are 
disjoint. 


Scholium 3.4.4 Let 


ae) 


2<— 5 


f 
pe oes 
g 

k 
qe es 


Q 


be a pushout square in Lop where f and g are inclusions. Then h and k are also 
inclusions, and the square is also a pullback. 


Proof By 3.4.3, we may identify H with the category whose objects are triples 
(B,C, a) where B € ob F, C € ob G and a is an isomorphism f* B — g*C in £. 
The inverse images of h and k are the two projections; and an easy calculation 
shows that h,(B) may be taken to be the triple (B, 9. f* B, (ey-3)~1), where € 
is the counit of (g* 4 g+). Hence the counit of (h* 4h,) is an isomorphism; and 
similarly for k. To show that the square is a pullback, we may identify £, F and 
G with full subcategories of H, consisting respectively of those objects (B,C, a) 
for which B and C are sheaves for the appropriate local operators on F and G 
(resp. C is a sheaf, B is a sheaf). It is clear that E is simply the intersection 
of F and G; so the fact that the square is a pullback in Top is immediate 
from the criterion of A4.3.11(iii) for a geometric morphism to factor through an 
inclusion. O 


Corollary 3.4.5 Any inclusion is a regular monomorphism (that is, the 
pseudo-limit of a coreflexive pair) in Top. 


Proof Set f = g in 3.4.4, and combine it with A1.2.10. O 


The reader should be warned not to read into 3.4.5 more than it says: it 
does not assert that if (h, k) is the cokernel-pair of an inclusion f, then f is the 
(pseudo-)equalizer of the pair (h, k), but rather that f is the limit of the diagram 
formed by h, k and the ‘codiagonal’ morphism d which is a common splitting 
for g and h. The point is that, when we are working in a 2-category, these two 
limits do not necessarily coincide: a cone over the diagram formed by h and k 
is a morphism g equipped with a 2-isomorphism a: hg — kg, whereas a cone 
over the diagram formed by h, k and d must satisfy the additional condition 
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that doa is (modulo the coherence isomorphisms) the identity 2-cell on g. If the 
identity functor on the domain € of f admits nontrivial automorphisms (as is the 
case, for example, if E = [G, Set] for some group G with nontrivial centre), then 
f will not have the required universal property among cones of the first type; 
but it will be universal among cones of the second type. We shall encounter a 
similar phenomenon when we discuss the difference between 1-dimensional and 
2-dimensional colimits of simplicial objects in 3.4.11 below. 

We do not know of any other cases where ‘conical’ colimits can be constructed 
in Top by elementary means. However, as noted by G. C. Wraith [1235], a gen- 
eralized version of the glueing construction suffices to construct (op)lax colimits, 
as defined in 1.1.6, for arbitrary finite (op)lax diagrams in Top. Recall that a 
lax diagram D of shape J in Top assigns a topos D(j) to each object j of J, 
a geometric morphism D(a): D(j) — D(j’) to each morphism a: j > j’ of J, a 
geometric transformation ¢;: 1p; ~ D(1;) to each object j, and a geometric 
transformation ġa, 8: D(@)D{a) — D(Ga) to each composable pair (œ, 3), sub- 
ject to appropriate coherence conditions. (In Section B1.1 we actually considered 
lax diagrams over 2-categories, but for simplicity we shall assume here that J 
has no 2-cells other than identities.) 


Lemma 3.4.6 Any finite lax (resp. oplax) diagram in Top has a lax (resp. 
oplax) colimit. 


Proof We consider the lax case in detail. Suppose given a lax diagram 
D: J — Top as above; we form the topos € = ILe z D(J) (that is, the coproduct 
of the D(j) in Lop), and define a functor G: E — E by 


GA; FEI) =( J] Dla).(4y) 15 € J). 
aj! gj 


G is clearly a cartesian functor, since each D(a), is cartesian. We claim that 
G has a comonad structure: the counit €: G(A; | j € J) — (A; | j € J) is the 
morphism whose jth component is the composite 


Hoei — S50) .4) 2S 4, 


where ¢; is the mate of ¢;. Similarly, the comultiplication 6 is the unique 
morphism making 


[I 2o). (4;) > J] 08). Dla). (A) 
a a,ß 


da, 
D(Ba).(Aj") —2> D()-D(a) (Aj) 
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commute for each composable pair (a, 8). The comonad identities for € and 6 
follow straightforwardly from the coherence conditions on the @’s. 

Hence by A4.2.1 we have a topos Eg of coalgebras for the comonad G = 
(G,«¢,6). We claim that this is (the vertex of) a lax colimit for the given lax 
diagram in Top. For giving an object (A; | 7 € J) of E a G-coalgebra struc- 
ture is easily verified to be equivalent to specifying a family of morphisms 
ĝa: A; — D(a).(Aj;-) for each a: 7’ — j in J, satisfying appropriate compati- 
bility conditions with the $3 and ¢a,g; hence it is also equivalent to specifying 
a family of morphisms ĝa : D(a)*(A;) — A; satisfying compatibility conditions 
with the ¢; and ¢,,g. Thus we see that the projections Eg — D(j) are the legs 
of a lax limit cone over the lax diagram in CAT formed by the D(a)*; but they 
are inverse image functors, and as usual it is easy to see that the corresponding 
lax cone under the diagram in Top is a lax colimit. 

The oplax case is similar, except that we use the inverse image functors D(a)* 
to define our comonad G. a 


Of course, an ordinary finite diagram may be considered as a lax (or oplax) 
diagram in which the ¢; and ġa,g are isomorphisms. In particular, considering 
a single geometric morphism f: F — E as a diagram of shape 2 in Top, we note 
that its lax and oplax colimits are the cocomma objects (f Tl) and (iff) 
respectively. It is easy to see that the construction of 3.4.6 reduces to that of 
3.4.2 in these two particular cases. 

Now we turn from Top to Top/S. We note that the latter is itself enriched, 
not just over CAT, but over the 2-category CATs of S-indexed categories: given 
S-toposes p: E — S and q: F — S and an object I of S, we define an ‘J-indexed 
family of geometric morphisms F — E over S’ to be a geometric morphism 
F/q*I — E/p*I over S/I, with transition functors induced by pulling back 
along geometric morphisms of the form S/J — S/I. (In fact BLop/S is even 
enriched over Cats, the 2-category of locally internal categories over S: as we 
shall see in 4.1.6 below, if E is bounded over S, then there exists an object E of E 
such that any geometric transformation a: f — g between geometric morphisms 
f,g: F 3 E over S is uniquely determined by its component at E, and hence we 
can construct an object of S indexing the geometric transformations between f 
and g. However, we shall not need this extra structure.) 

In considering colimits in BYop/S, it therefore makes sense to ask for 
weighted colimits of diagrams indexed by internal categories in S, where the 
weighting functor also takes values in Cat(S). We shall not go into the general 
theory of such weighted colimits, though in fact it is very similar to the case 
S = Set; in particular, the analogue of Theorem 1.1.5 still holds. But we note a 
couple of special cases of the notion. 


Lemma 3.4.7 Let S be a topos. 


(i) Zop/S and BLop/S both have S-indexed coproducts. 
(ii) Lop/S and BTLop/S both have tensors with internal categories in S. 
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Proof (i) Clearly, an J-indexed family of S-toposes is just a topos q: F — S/I 
over S/I. To form its coproduct, we simply form the composite F — S/I — S; 
for, given any p: E — S, morphisms F — E over S correspond (by a particular 
case of 3.2.12) to morphisms F — €/p*I over S/I. And the fact that BTop/S 
is closed under S-indexed coproducts is a particular case of 3.1.10(i). 

(ii) In the same spirit, given q: F —> S and an internal category C in S, we 
define the tensor C Q F in Top/S to be the diagram category [q*C, F]. Clearly, 
this is the cotensor C h F in the 2-category of S-indexed categories, by 2.3.13; 
so it is easy to see that if we are given a diagram of shape C in Yop/S (Ff, €) 
(that is, a geometric morphism f : F/q*Co — E/p*Cp over S/Co, equipped with 
an action transformation dj(f) — d§(f) in Zop/(S/C1) satisfying the appro- 
priate equations), then we obtain an indexed functor g*: E — [C, F]: given an 
object A of €, we define g* A to be the object f*(Cj(A)) of F/Co, equipped 
with the component at A of the given geometric transformation. Defining the 
direct image functor g, is a little more tricky: we need to introduce an auxiliary 
internal category D, as follows. Its object of objects Do is C1, and its object of 
morphisms D; is the colimit of the diagram 


Co 


S 8 
0 So 0 


Ci Cı Cı 


with the domain map dı: Dı — Do being induced by the triple (sod1, 1c, , Sodo) 
of morphisms Cı — C and the codomain map induced by (1c,,1c,,1c,); the 
inclusion of identities Do — Dy, is the middle leg of the colimit cone defining D4. 
(Composition is easy to define, since a straightforward diagram-chase shows 
that at least one member of any composable pair in D is an identity morphism.) 
Now, given an object (F — Co, ¢: dj F — d§F) of [C, F] (we shall suppress all 
mention of the functors p* and q* from now on, to simplify the notation), we 
may construct a diagram of shape D in E: we form the object (djg).(d§F) of 
E/C, = E/Do, equipped with the D,-action in which elements of D, belonging 
to the first copy of Cı in D, act via (djg).(¢) and those in the third copy of 
Cı act via the C-action on f, (which is of course contravariant, since C acts 
covariantly on f*). Now we define g*(F, ¢) to be the limit in E of this internal 
diagram. Checking that this does indeed define a right adjoint to g* requires a 
lot of tedious verification, but it should be clear enough if one decodes the above 
definition in the case S = Set (and compares it with the construction we gave 
for the direct image functor J, in the proof of 3.4.2). 

In the opposite direction, we have a canonical geometric morphism F/Co — 
[C, F] over S (indeed, over F) whose inverse image is the forgetful functor; as in 
part (i), we may equivalently regard this as a morphism F /Co — [C, F|/Co over 
S/Co, ie. a Co-indexed family of geometric morphisms F — [C, F] over S; and 
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this clearly admits a C-action map, i.e. we can regard it as a dagram of shape 
C in Lop/S (F, [C, F}). So composition with this diagram yields a functor from 
Vop/S ([C, F], E) to [C, Lop/S (F, EJ; it is straightforward to verify that this is 
inverse (up to natural isomorphism) to the construction defined above. 

As in part (i), the ‘bounded’ case of the result follows immediately from 
the case already done, plus the fact that [q*C, F] is bounded over F and hence 
over S. o 


Of course, in the bounded case of (ii), we do not have to go through the 
hard work of defining gą: we may simply observe that g* is cocontinuous as an 
S-indexed functor, and then use 2.4.6 to deduce that it has a right adjoint. 

Whilst we are on the subject of Bop/S (F,€) as an S-indexed category, 
we note the following result, which can be viewed as an extension of A4.1.13. 
(Recall that splittings of idempotents in a category are essentially the same thing 
as colimits indexed by finite filtered categories. ) 


Lemma 3.4.8 For any two bounded S-toposes E and F, the S-indexed category 
BLop/S(F,E) has colimits indexed by filtered S-indexed categories. 


Proof Given a diagram of geometric morphisms fe: F — E indexed by the 
objects of a filtered category C in S, we may form the ‘pointwise’ colimit f, 
of the functors f* in the S-indexed category of S-indexed functors E — F. The 
latter is cartesian by 2.6.8; but it is also clearly S-cocontinuous; so it has a right 
adjoint foo. by 2.4.6. It is now routine to verify that the adjoint pair (fž 4 foo») 
defines a geometric morphism F — € over S, and that it is the colimit of the fe 
in BVop/S (F, €). O 


We now revert to the problem of constructing colimits in BTop/S itself, 
rather than in its hom-categories. In order to do this in more complicated 
cases, we shall have to rely on Giraud’s theorem: specifically, on the form of 
it proved in 3.3.11, where we saw that bounded S-toposes are the same thing as 
S-indexed oo-pretoposes with separating families. The point is that the inverse 
image of a geometric morphism over S is a morphism of S-indexed co-pretoposes 
(in the sense that it preserves all the structure involved in the definition of 
such categories), and it is easy to see that any S-indexed weighted limit of 
S-indexed oo-pretoposes (and functors preserving this structure) is an S-indexed 
oo-pretopos. Since (as already indicated in this section) we expect weighted col- 
imits of diagrams of toposes to be weighted limits of their underlying categories 
and inverse image functors, it therefore only remains to show that the limit in 
question has a separating family in order to deduce that it is a bounded S-topos, 
and hence is the desired colimit in BYop/S. 

However, even for the simple case of coequalizers, showing that the equalizer 
in Cats has a separating family is by no means simple. From now on, we shall 
need to assume that S has a natural number object; in fact, for notational con- 
venience, we shall treat it as if it were Set, but the reader who has followed the 
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material of Part B thus far should have little conceptual difficulty (as opposed to 
the practical difficulty of actually writing it down!) in translating the argument 
which follows into ‘S-indexed language’. The argument which we give is essen- 
tially that of I. Moerdijk [831], but similar arguments were given independently 
by M. Tierney and P. Freyd. 

So let us suppose given a parallel pair f,g: F 3 E of geometric morphisms 
over S. Of course, when we refer to the equalizer of f* and g* in Cats, we 
mean their pseudo-equalizer; that is, the category G whose objects are pairs 
(A,a@) where A € ob € and a is an isomorphism f*A — g*A in F. Morphisms 
(A,a) — (B, 8) in G are morphisms A — B in € commuting with a and £8 in the 
obvious sense; note in particular that the forgetful functor G — € is faithful (so 
that, once we have proved that G is a topos, we shall know that the coequalizer 
E€ — G is a surjection). Note also that the inverse image functor p*: € — E has a 
canonical lifting to a functor r*: S — G, since we are given natural isomorphisms 
f*p* = q* = g*p*; and the indexing of G over S which it acquires via this 
functor coincides with that which it acquires as the (pseudo-)equalizer of a pair 
of S-indexed functors. 

Now let C be an internal (to S) full subcategory of E containing a separating 
family for the latter, so that € may be represented as a subtopos of [C°P, S}. For 
convenience, we shall suppose that C is closed under finite limits in E (we may 
clearly do this, since S has a natural number object). Borrowing the notation to 
be introduced formally in Section C2.4, we shall write € ~ Shg(C, J), the topos 
of ‘S-valued sheaves’ on the internal site (C, J). We similarly choose an internal 
site (D, K) for F over S; but we require that, in addition to a separating family 
for F over S, the objects of D should include also the images under both f* and 
g* of the objects of C. Thus f* and g* restrict to internal functors F,G: C = D, 
which are cartesian and cover-preserving (that is, they are morphisms of sites 
in the sense of C2.3.1); and they induce the geometric morphisms f and g in 
the manner of C2.3.4. Explicitly, this means that f* may be computed in terms 
of sheaves by first forming the left Kan extension along F (which yields only 
a functor D°P — S, not a sheaf in general) and then applying the associated 
K-sheaf functor (cf. C2.2.6). 

Next we proceed to enlarge C: let C = C and, for n > 1, define C™ to 
be the internal full subcategory of E whose objects are coproducts [],<7+ Ai of 
objects of C), where J+ = I II 1 is the set obtained by adding a singleton to 
the index set of some cover (B; — B |i € I) in the coverage K on D. Let C(~) 
similarly consist of all countable coproducts [[,,59 An, where A, is an object 
of C™ for each n, and let C be the full subcategory on all quotients in € of 
objects of C(). It is clear that all these families of objects can be indexed by 
appropriate objects of the base topos S, so C is indeed an internal category. 

We claim that the indexed family of all pairs (A, œ), where A is an object of 
Č and a is an isomorphism f*(A) S g*(A) in F, is a separating family for G. To 
prove this, let u,v: (C, Y) 3 (D,6) be two morphisms in G such that uw = vw 
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for every w: (A,a) — (C, y) with A an object of C; we must show that u = v. 
Thinking of C and D as sheaves on the site (C, J), let x be an element of C(A) 
for some A € ob C (which we may equivalently consider as a morphism A — C 
in £). We shall show that there is a subobject C — C containing the element z, 
which is the image of a morphism A — C with A € ob Cl) (and is therefore an 
object of C), such that y restricts to an isomorphism f*(C) — g*(C); then we 
can take w to be the inclusion C — C, and deduce that u and v agree on the 
element z. 

Now z clearly determines a morphism f*(x): f* A — f*C, or equivalently an 
element of f*(C)(F'A), corresponding to the element of the left Kan extension of 
C determined by the element z and the identity morphism FA — FA. Applying 
yra to this, we get an element of g*(C)(F'A): this may be represented by a 
K-covering family (B; > FA | i € I) and a family of elements y; of the left Kan 
extension of C along G; and the y; may in turn be represented by morphisms 
(B; > GA;) and elements z; € C(A;,). Let A, be the coproduct of the family 
obtained by adjoining A itself to the A;, i € I, and z1: Ay; — C the morphism 
induced by the z;; then the construction ensures that the composite 

*(x 
ra LB precy 2s ro 
factors through the image of g* (z1): 9*(A1) — g*C. 

We now proceed recursively to define a sequence of objects A, of C(™ and 
morphisms Tn: A, — C in E, such that each An+1ı contains An as a subobject, 
and zn is the restriction of zn+1 to this subobject. For even values of n we choose 
XIn+1, as above, so that the composite yc - f* (£n) factors through the image of 
g*(2n41), and for odd n we do the same thing with f and g interchanged (and 
y replaced by its inverse). Let A% be the coproduct of the An, and C — C the 
image of the morphism A. — C induced by the zn. Then C is the union of the 
increasing sequence of subobjects Cn which are the images of the zn; and since 
Yo maps f*(C2n) into g*(Con+1) for all n, and its inverse maps g*(C2n+1) into 
f*(Con+2), it is clear that yc must restrict to an isomorphism f*(C) — g*(C), 
as required. 

We have thus done all the hard work in the proof of 


Theorem 3.4.9 The 2-category BYop/S has coequalizers, for any topos S 
with a natural number object. 


Proof As above, given f,g: F 3 E in BLop/S, we form the (pseudo-)equalizer 
G of f* and g* in Cats. The verification that G is a S-indexed oo-pretopos, as 
defined in 3.3.9, is straightforward, since E is one, and the functors f* and g* 
preserve all the structure involved in the definition. The argument above shows 
that G has an S-indexed separating family; so by 3.3.10 it is an S-indexed 
topos, i.e. it corresponds to a (bounded) S-topos (r: G—S). Moreover, the 
forgetful functor G — E preserves all S-indexed colimits; so it has a right 
adjoint by 2.4.6, and is thus (the indexed version of) the inverse image of a 
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geometric morphism h: € — G over S. The fact that hf & hg is clear from the 
construction; but if we are given any geometric morphism k: € — H over S sat- 
isfying kf = kg, then k* certainly factors through h*, and since we observed 
earlier that h is surjective we can use the adjoint lifting theorem A1.1.3 to fac- 
tor the right adjoint k, through h., and thus obtain a geometric morphism 
G — H over S. So h: E — G has the universal property of a coequalizer in 
BTop/S. 0 


As suggested earlier, we may combine 3.4.7 and 3.4.9 to obtain arbitrary 
S-indexed weighted colimits of bounded S-toposes, where S has a natural num- 
ber object. We shall not consider these in full generality, but we pause to note 
one particular case: 


Remark 3.4.10 We may construct coinserters in BYop/S (cf. 1.1.4(d)) by a 
construction very like that of coequalizers. That is, to form the coinserter of 
f,9g: F = E, we form the inserter of their inverse image functors (that is, the 
category whose objects are pairs (A,a) where A € ob € anda: f*A — g*Ain F 
is a not-necessarily-invertible morphism), and we may prove as in 3.4.9 that this 
category is an S-indexed oo-pretopos with a separating family. As previously, the 
only hard part is constructing a separating family; and we may do this in the 
same way as we did for coequalizers — except that the proof that the construction 
works is slightly easier, because we do not have to ‘work from both sides’ at the 
inductive stage of defining the objects An- 


Without going into further technicalities about S-indexed weighted colim- 
its, we note that 3.4.7(i) and 3.4.9 do suffice to construct arbitrary ‘S-indexed 
conical colimits’, that is colimits of functors C + SZop/S, where C is an inter- 
nal category in S, using the usual construction of colimits via coproducts and 
coequalizers. 

Next, we need to say a bit about the particular case of colimits of simpli- 
cial toposes, in preparation for our work on descent morphisms in BTop/S in 
Chapter C5. Of course, by a simplicial topos we simply mean a (pseudo-)functor 
AP — Yop, where A is the category of nonzero finite ordinals and order- 
preserving maps between them, as in Section B2.3 above. We shall use the 
standard notation for simplicial objects: if Ee is a simplicial topos, €, will 
denote the image of the ordinal n + 1 under the functor, and d?: En — €,- 
(resp. sot : En—1 — En) the images of the standard injections n — (n+1) (resp. 
surjections (n + 1) — n). If S has a natural number object, then it is clear that 
A may be identified with an internal category in S, either by the method of 
2.3.12(b) or by explicitly constructing it as a subcategory of the internal cate- 
gory S; of 3.2.9. So if we are given a diagram of shape A°? in BTop/S we may 
compute its colimit as suggested above. 

When we are working in an ordinary I-dimensional category, the colimit of a 
simplicial object A. is just the coequalizer of do, dı: A; 3 Ag, since the inclusion 
A, — A is initial (cf. 2.5.12), where A, is the subcategory of A consisting 
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of the ordinals 1 and 2 and the two injections 1 = 2. But for diagrams in a 
2-category, where we have to take the coherence 2-isomorphisms into account, 
we need to retain a bit more of the structure of A. Let Az be the subcategory 
of A consisting of the ordinals 1, 2 and 3, the injections 1 = 2, the surjection 
2 — 1, the three injections 2 — 3 and all composites of these morphisms. We 
shall refer to a diagram of shape A$? as a ‘2-truncated simplicial object’. 


Lemma 3.4.11 The inclusion Az — A is 2-initial; that is, for any simplicial 
object Ag: A°? — KR in a 2-category R (admitting the appropriate colimits), the 
canonical morphism from the colimit of the restriction of Ae to Ao, to the colimit 
of Ae, is an equivalence. 


Proof Let us write A, for the restriction of A, to A2. We shall show that any 
(pseudo-)cone under A, in & may be extended, uniquely up to unique isomor- 
phism, to a cone under A,. By definition, a cone under A, (with vertex B, say) 
consists of three 1-cells Ag — B, A, — B and Az — B, plus six 2-isomorphisms 
corresponding to the six generating morphisms of Ay. But since the coher- 
ence isomorphisms tell us that fı S fod} and f2 S fodid?, we may reduce 
these data to a single 1-cell f : Ag — B plus a 2-isomorphism a: fd} — fd} sat- 
isfying the ‘unit condition’ that a o s? is the composite of the isomorphisms 
fdis§ = f = fdis® arising from the equations disg = 1 = ds? in A°?, and 
the ‘cocycle condition’ that a o d? is (modulo the appropriate coherence isomor- 
phisms) the vertical composite of a o d3 and ao d2. (These conditions are of 
course reminiscent of those in the definition of descent data for a sieve generated 
by a single morphism; cf. 1.5.5. However, it will be noted that we cannot omit 
the requirement that a be invertible here, as we did in 1.5.5, since A contains 
only order-preserving maps and we do not have the ‘twist isomorphism’ A; — A, 
available. See 3.4.14(c) below for further discussion of this point.) 

Given such data, we may define a pseudo-cone under Ae, as follows: 
fn: An —> B is the composite fdjd?--.d?. For a morphism d?, the coherence 
isomorphism f,-,|d? = fn is the identity if i = 0, the coherence isomor- 
phism arising from the equality d}---dj~'d? = dj---dj~'dm in A” if 1 < 
i < n— 1, and the composite of the coherence isomorphism arising from 
d} - --d?~>d? = did2-..dS—'d" with ao d2---d? if i = n. Similarly, the isomor- 
phism fn418? = fn is just a coherence isomorphism arising from an equality in 
A°P, for all 2. It is straightforward but tedious to verify that these isomorphisms 
satisfy the appropriate compatibility conditions for all the simplicial identities, 
so that we do indeed have a pseudo-cone. Conversely, given any pseudo-cone 
(Qn: An — B |n € N) under Ag, the coherence isomorphisms yield a natural iso- 
morphism from it to the cone constructed as above by extending its restriction 
to A,. And the operations of extension and restriction are also well-defined on 
2-cells between cones. So we conclude that Cone(A,,—) and Cone(A,,—) are 
naturally equivalent CAT-valued pseudofunctors on K; hence their representing 
objects, if they exist, are equivalent. a 
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Corollary 3.4.12 Let S be a topos with a natural number object. Given a 
simplicial bounded S-topos Ee, its colimit in BYop/S may be identified with 
the category Desc(E,) whose objects are pairs (A,a) where A € ob & and 
a: di A — djA is an isomorphism in E, satisfying the unit condition that s(a) 
is (modulo coherence isomorphisms) the identity isomorphism A — A, and the 
cocycle condition that dj(a) is (isomorphic to) the composite of d3(a) and dj(a) 
in E2. In particular, Desc(€,) is a bounded S-topos. 


Proof If we construct the colimit of the restriction E, of Ee to A>", and 
simplify its description in the same way that we simplified the description of 
cones under AȘ? above, we arrive at the description of the colimit given in the 
statement. So the result follows from 3.4.11. o 


Remark 3.4.13 A direct proof that Desc(E.) is a bounded S-topos may be 
given along exactly the same lines as 3.4.9: if we consider the full subcategory of 
the coequalizer of dp and d; consisting of those (A, a) for which a satisfies the unit 
and cocycle conditions, it is still an S-indexed co-pretopos (being closed in the 
coequalizer under the appropriate operations), and the argument we gave to show 
that the coequalizer has a separating family goes through without change for the 
descent category. (Note that the descent category is closed under subobjects as 
a subcategory of the coequalizer; in fact the inclusion is the inverse image of a 
hyperconnected geometric morphism.) 


Examples 3.4.14 (a) Let C be an internal category in a topos €. As we saw 
in Section B2.3, we may regard C as a simplicial object Ce in £; and since 
the assignment A ++ €/A is functorial by A4.1.2, this in turn gives rise to a 
simplicial bounded €-topos €/C,. If we form the topos Desc(E/C,), we do not, 
as might have been expected, simply get the diagram topos [C, €E], because of 
the requirement, in the definition of an object (A,a) of this topos, that the 
structure map a should be invertible. Thus what we actually obtain is the topos 
[G, E] where G is the groupoid reflection of C, i.e. the category obtained by freely 
adjoining inverses for all morphisms of C. Of course, if C is already a groupoid, 
this is the same as [C, £]; however, if we wish to obtain [C, €] in the case when C 
is not a groupoid, then we must form the ‘lax descent topos’, defined as in 3.4.11 
but omitting the requirement that a be invertible. A direct proof that this is a 
topos may be given in the same way as for coinserters (cf. 3.4.10 above). 

(b) Slightly less trivially, let G = (G, =3 Go) be a topological groupoid 
(that is, an internal groupoid in the cartesian category Sp). As in Section B2.3, 
we may extend G to a simplicial object G, in Sp; and since the construction 
X ++ Sh(X) is functorial by A4.1.11, we obtain a simplicial object Sh(G,) in 
BTop/Set. Thus we obtain a topos Desc(Sh(G,)), whose objects are sheaves E 
on Go equipped with a (necessarily invertible) action map dj E — då E in Sh(G)) 
satisfying the unit and cocycle conditions; given the equivalence between Sh(X ) 
and LH/X, to be established in C1.3.11, we may alternatively regard this as the 
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full subcategory of the internal diagram category [G, Sp] (defined as in 2.3.11) 
whose objects are local homeomorphisms E — Gp. 

In particular, if G is a topological group (that is, if Go is the one-point 
space, so that sheaves on Go are simply (discrete) sets), then Desc(Sh(G,)) 
may be identified with the category Cont(G,) of continuous G,-sets which we 
defined (and showed to be a topos) in A2.1.6. We shall continue to use the 
notation Cont(G) for Desc(Sh(G,)) when G is a general groupoid in Sp, and 
call it the topos of continuous actions of the groupoid G. Note, however, that 
the proof given in A2.1.6 that Cont(G) is a topos cannot be extended to the 
case of a groupoid G whose space of objects Go is not discrete, since we do 
not have a comonadic forgetful functor from Cont(G) to the topos [UG, Set} of 
‘not-necessarily-continuous actions of G’. (Here U denotes the forgetful functor 
Sp — Set.) And when we pass from topological groupoids to localic groupoids 
(that is, internal groupoids in the category Loc of locales, to be introduced in 
Chapter C1), we cannot even employ the argument of A2.1.6 in the case of a 
group. However, the argument via categories of descent data, given here, goes 
through without change for localic groupoids. 

(c) In practice, we are often interested in studying Desc(€,) in the case 
when the simplicial topos € arises from the iterated pullbacks of a (bounded) 
morphism f: & — F; that is, we have €; = Eo XF Eo, E2 = Eo XF Eo XF £, 
and so on, with the d? and s} induced by the product projections and diagonal 
maps. In this case (as in (b), and the sub-case of (a) where C is a groupoid), 
we do have a twist equivalence t: €; — E1, obtained by interchanging the factors 
in the pullback; that is, we may extend E, to a pseudofunctor defined on the 
opposite of the category Set} of nonempty finite sets and all maps between 
them. Accordingly, in defining the category Desc(&.), we may leave out the 
requirement that the structure map a be invertible, since its image under ¢* will 
be (modulo coherence isomorphisms) a two-sided inverse for it. 


Suggestions for further reading: Moerdijk [831], Wraith {1235}. 


B4 
BLop/S AS A 2-CATEGORY 


B4.1 Finite weighted limits 


In this chapter we shall study the remarkable 2-categorical structure of the 
2-category BLop/S of bounded S-toposes, where S is a fixed topos. We shall 
restrict ourselves entirely to bounded geometric morphisms throughout the 
chapter: although some of our results can be extended to unbounded morphisms, 
the structure of Sop/S is much more delicate, and the gain in generality does 
not seem worth the extra trouble that would be involved in formulating all our 
results so as to apply to unbounded morphisms. In this first section, our aim is 
to construct finite weighted limits (in the sense of 1.1.5) in BYop/S. 

The construction of ordinary ‘conical’ finite limits was almost done in 
Section B3.3: there we saw that the pullback of a bounded geometric morphism 
along an arbitrary geometric morphism exists and is bounded. So, if we are given 
a pair of bounded morphisms f: F — E and g: G — E, both projections from the 


pullback 
H ———> G 


F —— E 
are bounded; since composites of bounded morphisms are bounded by 3.1.10(i), 
it follows that H is bounded over E£, and more generally over any topos S such 
that € is bounded over S. And since BYop/S is a full subcategory of Lop/S by 
3.1.10(ii), the square above is a pullback in it. 
Since the identity morphism on S is trivially a terminal object for BTop/S, 
we have thus established 


Theorem 4.1.1 For any topos S, BVop/S has finite conical limits. g 
Thus, to complete the proof that BTop/S has all finite weighted limits, it 


suffices by 1.1.5 to prove that it has cotensors with 2. The major part of the 
work is contained in 


Proposition 4.1.2 For any topos S, the 2-functor Cat(S) — Lop/S sending 
C to the diagram category [C, S| preserves cotensors with 2. 
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Proof Let us write D for the cotensor 2 tħ C in Cat(S): of course, we have 
Do = Cı, and D, is the object Sc of commutative squares in C, as defined 
in 2.6.1(d). Since the construction of D from C involves only finite limits, it is 
preserved by any inverse image functor; thus, in order to show that we have an 


equivalence 
Lop/S (E, [D, 5}) = (2, Lop/S (E, [C, S])] 


for any S-topos €, it suffices by Diaconescu’s Theorem to show that this 
equivalence holds for € = S, i.e. that we have 


Tors(D, S) ~ (2, Tors(C, S)| . 


Now the universal 2-cell 


in Cat(S) induces a 2-cell between the corresponding diagram toposes, since 
C ++ [C,S] is a 2-functor; and composition with this 2-cell yields a functor 
Tors(D,S) — [2, Tors(C,S)|. In order to show that this functor is one half of 
an equivalence of categories, we must investigate its effect in more detail. Since 
we are now working inside a single topos S, we shall feel free to use set-theoretic 
notation and terminology for its objects; the justification for this is to be found 
in Chapter D1. 

Let To, Tı : CP = S be two C-torsors, and a: To — T; a natural transforma- 
tion between them. We define U: D°P — S by the pullback diagram 


U(f) ————> Ti (cod f) 


Ti(f) 


Qdom f 
To(dom f) ———> T, (dom f) 


where f is an object of D, i.e. a morphism of C, with the obvious action on U of 
morphisms of D. We must first show that U is a torsor, i.e. that the domain U 
of the corresponding discrete fibration U — D is filtered. 

First, ob U is nonempty: since To is a torsor, there exists z € To(c) for some 
c € ob C, and then the pair (z,a,(z)) defines an element of U (14). Next, sup- 
pose we are given elements (z1, y1) € U (fı: c1 — d1), (£2, yo) E€ U (f2: cg > da). 
Then since Tp is a torsor there exist g;: c; — ¢3 (i = 1,2) and x3 € To(c3) 
with To(g:)(z3) = z; (i = 1,2). Let y3 = ac,(x3) € Ti(c3); then Tiı(g:)(y3) = 
Ti (fi)(yi) (i = 1, 2), so using the fact that T, is a torsor we can find morphisms 
hi: d; > dz (i = 1,2) and hg: c3 — d3, together with y4 € Tı(d3), such that 
hifi = hag; (i = 1,2) and Tı (hi)(y4) = y: (i = 1,2,3). Now the pair (£3, y4) 
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defines an element of U(h3), and we have morphisms (g;,h;): fi — h3 with 
U (gi, hi)(z3, ya) = (Ti, yi) (i = 1,2). So we have verified the second condition 
for filteredness of U. 

For the third condition, suppose given a diagram 


91 
Cl SEEE C2 
92 
fi fr 
hy 
di Z3 d 
he 


and an element (z,y) of U(f2) with To(g:)(z) = To(g2)(z) and T,(hi)(y) = 
Ti (h2)(y). Then there exists g3:c2 — c3 with 9391 = g3g2 and an element 
z’ E€ To(c3) such that To(gs)(z’) = z. Now if y' = ac,(z’) € Ti(c3), we have 
Ti (93)(y’) = Ge, (x) = Ti (f2)(y), so since T, is a torsor we can find h3: dz — d3 
and f3: c3 — d3 with f393 = h3fe and h3h, = hgh, and an element y” € T; (ds) 
such that 7)(f3)(y”) = y’ and T,(h3)(y”) = y. Thus the pair (z’, y”) defines an 
element of U(f3) satisfying U (g3, h3)(z’, y”) = (x, y). So we have verified that U 
is a torsor. 

Next, we must verify that if we apply the functor Tors(D,S) — 
{2, Tors(C,S)] which we defined earlier to U, we recover the transformation 
a: To > T; up to isomorphism. In particular, we must verify the coend formulae 


f 
Tole) = f C(edom f) x UCS) 


f 
nos f C(c,cod f) x U(f). 


The first of these is easy, because the coequalizer diagram defining the coend 
splits to yield an isomorphism f tc (c,dom f) x U(f) = U(1_), and it is clear 
from the definition of U that we have U(1.) = To(c). For the second, note that 
we have a dinatural map 


[[C (c cod f) x U(f) —> Tie) 
f 


sending (g, (z,y)) to T\(g){y), and hence an induced map @¢ from the coend to 
Ti(c). This map is surjective: given z € Tı (ec), there exists x € Tp(d) for some d, 
and then since T; is a torsor we can find f: d— e, g: c — e and y € Ti (e) such 
that Tı (g)(y) = z and T;(f)(y) = aa(zx); so the element of the coend represented 
by (g, (z, y)) is mapped by ¢ to z. 
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To show that ¢ is injective, suppose given elements (2;,y;) € U (fi: di > e:) 
(i = 1,2) and morphisms g;: c — e; such that T,(91)(y1) = Ti(g2)(y2). We have 
to show that (91,(%1,41)) and (g2, (2, y2)) represent the same element of the 
coend. But we can find h;: d; — da (i = 1,2) and z3 € To(d3) with To(hi)(r3) = 
x; (í = 1,2); and then for y3 = ag, (x3) we have T; (hi)(y3) = aa; (2:1) = Ti (fi) (yi) 
(i = 1,2), as well as T1(g1)(y1) = Tı (g2)(y2). So we can find k;: e; — e3 (1 = 1, 2) 
and k3: d3 — e3 with k3h; = kif; (i = 1,2) and kigi = koge, together with an 
element y4 € Ti (e3) with T1(k:)(ys) = yi (i = 1,2) and T1(k3)(ys) = y3. Then it 
is clear that the element (k191, (x3, y4)) of C (c,cod k3) x U(k3) represents the 
same element of the coend as both (g1, (21,y1)) and (g2, (£2, y2)). So we have 
identified the domain and codomain of the natural transformation 


f f 
f cedon ) x u(t) — | C (c,cod f) x U(f) 


(which sends the element represented by (g,(z,y)) to that represented by 
(f9,(z,y))) with To(c) and Ty(c) respectively; it is now straightforward to 
identify the morphism itself with a. 

Finally, we must verify that if we start with an arbitrary D-torsor U and 
construct a: To — Tı by the coend formulae above, then we may reconstruct U 
up to isomorphism by the pullback square 


U(f) —————> Ti (cod f) 


Ti(f) 


Qdom f 
Tp(dom f) ——S T;(dom f) 


But we certainly have a commutative square as above: the top edge sends z € 
U(f) to the element of the coend represented by (leod f, Z), and the left-hand 
edge sends it to the element represented by (laom f, 2). So we have to show that 
the induced map from U(f) to the pullback is bijective. 

For surjectivity, suppose f: c — d and we are given morphisms 


ha g2 f 91 hy 
dy ee es Se Se Se, Se 


and elements z; € U(hi) (i = 1,2), so that the pairs (g1, z1) and (g2,22) rep- 
resent elements of T, (d) and To(c) respectively. Then, since U is a torsor, the 
compatibility condition T;(f)(g91, 21) = @-(g2, 22) is equivalent to saying that we 
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have a commutative diagram 


k2 kı 
eS 


ha h3 hy 


pac ag gone ae 


and an element z3 € U(h3) such that Sven = z; (í = 1,2). But now 
(k292,l191) is a morphism f — hg in D, and the element U(k2g2,1191)(z3) of 
U(f) is easily seen to map to the two given elements of T;(d) and To(c). 

For injectivity, suppose we are given two elements 2),22 of U(f:c— d) 
such that (la, 21) and (la, z2) represent the same element of T; (d), and (1e, 21) 
and (1,,22) represent the same element of To(c). By the identification of To(c) 
with U(1,), the latter condition is equivalent to saying that U (le, f)(z1) = 
U(1¢, f)(z2) in U(1,); since U is a torsor, this implies that there are morphisms 
(gi, hi): f > f’ (i = 1,2) in D such that gı = g2 (and hi f = haf), and an ele- 
ment z3 € U(f’) such that U(g;,hi)(23) = zi (i = 1, 2). But now the pair (hi, 23) 
represents the same element of T(d) as (1la,2:) (i = 1,2), so from the fact that 
these two are equal we deduce that there exists (g3, h3): f’ — f” and z4 € U(f”) 
such that h3h, = h3h2 and U(g3,h3)(z4) = z3. So z1 = U(g391, h3hi)(z4) = 
U (9392, h3h2)(24) = 22. O 


Remark 4.1.3 As well as cotensors with 2, we know that the functor 
Cat(S) — BTop/S of 4.1.2 preserves finite products, by 3.2.13. However, it 
does not preserve all finite limits. The following counterexample borrows ideas 
from the theory of classifying toposes, and so properly belongs in the next section; 
but there seems to be little harm in giving it here. 

Let Set; denote (a small skeleton of) the category of finite sets, and consider 
the equalizer of 


do 
(2, Set] 3 Set, 
dy 


in Cat(Set). By definition (remember that we are dealing with pseudo-limits!), 
the objects of this category consist of objects (f: Ap — A1) of [2, Sets] equipped 
with an additional isomorphism g: Ag — Aj; but this object is (canonically) iso- 
morphic to the object (g~! f : Ao > Ao) equipped with the identity isomorphism, 
and hence the equalizer is equivalent to the category of finite sets equipped with 
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an endomorphism. (Warning: it is not true in general that strict equalizers and 
pseudo-equalizers in Cat(Set) yield equivalent categories.) The functor category 
[Set z, Set] is an object classifier over Set, by 3.2.9: that is, for any Set-topos € 
we have 


Top /Set (E, (Set, Set]) = £ . 


Hence by 4.1.2 [[2, Set +], Set] ~ 2 ch [Set j, Set] is a morphism classifier. Taking 
the equalizer in BYop/Set of the geometric morphisms induced by do and dj, 
we therefore obtain (by an argument like that already given) a classifying topos 
for the theory whose models in € are objects of € equipped with an endomor- 
phism. But this is a finitary algebraic theory, and the classifying topos for such 
a theory is well known to be of the form [C,Set] where C is the category of 
finitely presented models of the theory in Set (cf. D3.1.2). Since not every finitely 
presented model of the theory is finite (in particular, the free model on one gener- 
ator may be identified with the set of natural numbers with the successor map), 
this topos is not equivalent to [Co, Set] where Co is the equalizer computed in 
€at(Set). (In fact the latter can be shown to be a classifying topos for ‘torsion 
models’ of the theory, that is those in which every element generates a finite 
submodel.) 

Despite the foregoing, the functor Cat(S) — BTop/S does preserve arbitrary 
comma objects (f |g). This may be proved by a straightforward generalization 
of the argument in the proof of 4.1.2 (the latter being, of course, the special case 
in which f and g are identity functors). 


Theorem 4.1.4 For any topos S, BYop/S has cotensors with 2. 


Proof Let E be a bounded S-topos. If E is of the form [C,S], the existence 
of the cotensor 2 +h E follows immediately from 4.1.2. In general, we have an 
inclusion i: € — F where F is of the above form; then for any topos G the 
functor 


Top (G, €) —> Top (G, F) 


induced by composition with į is a full embedding, as we noted before 1.1.19. 
And this clearly remains true if we replace the 2-category Top by BLop/S. So, 
if we form the pullback 


P——> Exé 


(do, d1) 
2th F ——> FKF 


in BYop/S, it is easily verified that P has the universal property of a cotensor 
2h in Bop/S. o 


B4.1 FINITE WEIGHTED LIMITS 421 


Remark 4.1.5 As we observed in Section B3.4, we may regard BYop/S as a 
Cats-indexed category; so it makes sense to talk about cotensors with internal 
categories in S, not just with finite ‘external’ categories such as 2. And in fact 
we may show that all such cotensors exist (at least provided S has a natural 
number object), using the theory of classifying toposes to be developed in the 
next section: if € is a bounded S-topos, classifying a geometric theory T over S, 
then C rh € may be defined as the classifying topos for the theory of diagrams of 
T-models of shape C, and a presentation of the latter as a geometric theory may 
easily be constructed from one for T. On the other hand, the functor Cat(S) — 
BTop/S does not in general preserve these more general cotensors (that is, 
C h [D, S] is not equivalent to [D©,S]): the reason is that, although the hard 
part of the proof of 4.1.2 still works, the construction of the exponential D© in 
Cat(S) (cf. 2.3.15(ii)) involves exponentials in S, which are not preserved by 
inverse image functors. 

We shall also see later in this chapter that BYop /S has S-indexed products in 
an apprpopriate sense, and not just finite ones; cf. 4.3.4 below. Hence it also has 
arbitrary S-indexed conical limits, since these can be constructed from products 
and equalizers. In Part C, we shall prove a number of results about cofiltered 
S-indexed limits, that is limits indexed by cofiltered internal categories in S. 


The proofs of 4.1.1 and 4.1.4 both made use of the explicit representa- 
tion of bounded S-toposes provided by Giraud’s theorem. However, there are 
some important classes of weighted limits, in particular inverters and equifiers 
(cf. 1.1.4(e) and (f)), which can be constructed directly from the definition of a 
bounded morphism, using the machinery of A4.5.13. 


Lemma 4.1.6 Let p:E — S be a bounded S-topos. Then there is an object E 
of E such that, for any 2-cell 


in Yop/S with codomain E, a is an isomorphism iff its component ag is an 
isomorphism. 


Proof Let B be a bound for € over S. Form the pullback 


E ———— EB 


cpR xl 
p’p.(PB) x B ———~> PBxB 
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where PB is the power object of B, and e€ is the counit of (p* 4 p,). We shall 
show that if ag is an isomorphism then a, is an isomorphism for all objects A 
of €. 

First consider an object A of € which is a subobject of p*I x B for some 
object I of S. Let h: I — p,(PB) be the transpose across (p* < p,) of the name 
of the relation A — p*I x B; then there is a pullback diagram 


A — E 


p*(h) x1 
p*I x B ———> p"p, (PB) x B 


and hence a pullback diagram 


A ——— E 


* 


p*I Ph), Pp’ ps( PB) 


Now ag is an isomorphism by assumption; and a+; is the canonical isomorphism 
ftp" I = qt I = g*p* I, since a is a geometric transformation over S, and similarly 
for Qp*p,(PB)- SO the components of a at three corners of the pullback square 
above are isomorphisms; and since the pullback square is preserved by both f* 
and g*, it follows that a4 must be an isomorphism. 

Now suppose A is an arbitrary object of €. By the definition of a bounded 
morphism, we can find an epimorphism C -» A, where C is a subobject of 
p*I x B for some I. Let R = C be the kernel-pair of this epimorphism, and 
choose an epimorphism D — R where D is a subobject of some p*J x B. Then 
we have a coequalizer diagram D = C — A, and we know that ac and ap 
are isomorphisms by the argument above. But, once again, this coequalizer is 
preserved by both f* and g’*; so it follows that a, must be an isomorphism. 

l O 


Corollary 4.1.7 The 2-category BYop/S has inverters and equifiers. 


Proof To form the inverter of a geometric transformation 


let E be an object of € as in 4.1.6, and let j be the smallest local operator 
on F for which L(ag) is an isomorphism (where L is the associated j-sheaf 
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functor), as constructed in A4.5.14(c). Then a geometric morphism hk: G + F 
factors through sh;(¥) — F iff the local operator k corresponding to its image 
satisfies k > j, iff h* (œg) is an isomorphism. So it follows from 4.1.6 that the 
inclusion sh;(¥) — F is the required inverter. 

To form the equifier of a parallel pair of geometric transformations 
a, b: f 3 g, we similarly construct the smallest local operator j on F for which 
the associated sheaf functor L maps the equalizer of ag and 8g to an isomor- 
phism. Then L(agz) = L(g); and by an argument just like that in the proof 
of 4.1.6 it follows that L(a4) = L(8,) for every object A of F, and hence that 
sh;(F) — F is an equifier for a and £. o 


We shall make extensive use of inverters in BYop/S in Section B4.5 below. 
Also in that section, we shall require a remarkable result about comma objects 
in BTop/S, originally due to A. M. Pitts. What is surprising about it is its 
‘something for nothing’ aspect: starting from a mere S-indexed left adjoint, we 
end by constructing one which is not only S-indexed but G-indexed, where G is a 
topos over S. The proof of this result involves working with sites, so we defer it 
to Part C (specifically, to C2.3.17); but we state the result here, so that we can 
refer to it in Section B4.5. Recall from 3.1.1 that a geometric morphism over S is 
said to be S-essential if its inverse image functor has an S-indexed left adjoint. 


Theorem 4.1.8 Let 
q 
(Fi) ——>G 


P mA g 


| aw 5 


be a comma square in BYop/S, and suppose f is S-essential. Then q is 
G-essential, and the Beck-Chevalley natural transformation qp* — g* fı (the 
mate of the (non-invertible!) natural transformation p*f* — q*g*) is an 
isomorphism. 


There is one particular case in which we can prove 4.1.8 with the tools cur- 
rently at our disposal: namely that in which f and g are induced by functors 
between internal categories in S. For we noted in 4.1.3 that the comma square 


(F1) —>D 


C ——— > B 
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in €at(S) is preserved by the functor Cat(S) — BTop/S of 2.3.22; and the top 
edge of this square is a split opfibration, i.e. it corresponds to a (strict) functor 
D — Cat(S). (In set-theoretic terms, it is the functor which sends an object D to 
the comma category (f | D), and a morphism 6: D — D’ to the functor induced 
by composition with 6.) Thus we can identify the morphism [(f | g), S] — [D, S] 
induced by q with one of the form [A,[D,S]] — [D,S] where A is an internal 
category in [D, S]; and the latter is clearly [D, S]-essential (just as [C, S] — [B, S] 
is automatically S-essential in this case), by the remark after 3.1.1. Also, we 
noted in 2.5.11 that the Beck—Chevalley condition holds for comma squares of 
this type. 


B4.2 Classifying toposes via weighted limits 


One of the central themes of topos theory is the construction of classifying 
toposes for geometric theories, and the study of their properties. Since this 
properly belongs to the interaction between toposes and (first-order) logic, it 
will be treated in detail in Part D — specifically, in Chapter D3. However, there 
is an alternative presentation of the construction, which makes it a part of the 
2-categorical study of Top, and which may appropriately be described at this 
point. We shall not presume any familiarity with the formal definition of a geo- 
metric theory (which can be found in Chapter D1); for this presentation, it will 
suffice to take a more informal notion of a ‘theory’ as a functor on toposes over 
a given base. 


Definition 4.2.1 (a) Let S be a topos. By a theory T over S, we 
mean (informally) a (BTop/S)-indexed category, ie. a (pseudo-)functor 
T: (BLop/S)°? CAT. That is, T assigns to each bounded S-topos p: E > S 
a category T(E) (whose objects are called T-models in £}, to each geometric 
morphism f: F — E over S a functor f*: T(€) — T(F), and to each geometric 
transformation a: f — g over S a natural transformation a: f* — g*, subject 
to the obvious compatibility conditions. 

(b) By a classifying topos for a theory T over S, we mean a representing 
object (in the up-to-equivalence sense) for the functor T; that is, a bounded 
S-topos S[T] equipped with a distinguished object G of T(S[T]) (the generic 
T-model over S) such that, for each bounded S-topos €E, the functor 


BTop/S (E, SİT) —> T(E) 
which sends a geometric morphism f to f*(G) (and a geometric transformation 
a to ac), is one half of an equivalence of categories. 


Clearly the classifying topos of a theory T, if it exists, is unique up to canon- 
ical equivalence in BTop/S. We note that the existence of a classifying topos 
for T imposes some restrictions on T: in particular, it must transform arbitrary 
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(weighted) colimits in BTop/S to limits in CAT. Applying this to the results 
of 3.4.1 and 1.5.6, we obtain 


Lemma 4.2.2 If T has a classifying topos over S, then for any S-topos E 
the €-indexed category Te obtained by restricting T to toposes of the form E/A 
(and geometric morphisms over € between them, cf. 3.2.8(b)) ts a stack for the 
coherent coverage on E. 


Proof For any two objects A and B of €, we have €/AUB~ €&/Ax €/B, 
since coproducts in € are stable under pullback; so 3.4.1 implies that 
T(€/A I B) ~ T(€/A) x T(€/B). Similarly, if f: A -» B is an epimorphism in 
E€ with kernel-pair a,b: R3 A, then 1.5.6(iii) implies that we may identify 
E/B with the category Desc(E,(f)) of objects g: X — A of E/A equipped 
with descent data a: a*X — b*X relative to f. So, if a geometric morphism 
h: €/A — F is ‘equipped with descent data for f’, in the sense that h* lifts to a 
functor F — Desc(E,(f)), then it factors uniquely through €/A — €/B. Tak- 
ing F to be the classifying topos for T, this says that there is an equivalence 
T(€/B) ~ Desc(Te, (f)). o 


Similarly, we have 


Lemma 4.2.3 If T has a classifying topos over S, then for any S-topos E we 
have T([2, €]) ~ [2, T(E)]. 


Proof In 3.4.2 we saw that [2,€] is the tensor 2 & € in Yop (and hence in 
Zop/S, if E£ is defined over S); so T must map it to a cotensor in CAT. o 


Lemma 4.2.3 implies that, if we know what the objects of T(&) are for all 
bounded S-toposes € (and T is ‘classifiable’), then we must know the morphisms 
as well. For example, from the fact that the theory of objects is classifiable 
(cf. 4.2.4(a) below), we can deduce that there does not exist a classifying topos 
for the theory defined by taking T(E) to be the category of all objects of € and 
monomorphisms between them (even though this theory is easily seen to satisfy 
the criterion of 4.2.2). 

Before embarking on the general construction of classifying toposes, it seems 
appropriate to give some examples of the theories we have in mind. 


Examples 4.2.4 (a) The theory O of objects is simply the forgetful functor 
which sends an S-topos to its underlying category, a geometric morphism to its 
inverse image functor and a geometric transformation to itself. We saw in 3.2.9 
above that this theory has a classifying topos over any topos S with a natural 
number object, namely the diagram category [S,, S]. 

(b) More generally, if D is a category, the theory D of diagrams of shape 
D sends € to the functor category [D,E], and a geometric morphism f to the 
functor obtained from composition with f*. In the particular case when P is the 
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finite category 
e ——— eo, 


we shall denote this theory by M and call it the theory of morphisms. And in the 
particular case when D is the finite discrete category with m objects, we shall 
denote it by Om and call it the theory of m-tuples of objects. 

(c) As a further generalization of (b), if T is any theory over S and D is a 
category, we may construct a theory [D, T] whose models are diagrams of shape 
D in the category of T-models. 

(d) Let I be an object of S. The theory O; of I-indezred families of objects is 
given by Oz(€) = €/I (by which, as usual, we mean €/p*I, where p: E — S is 
the structure morphism of £), and f*: O7;(€) — O;(F) is f* applied to objects 
over p* I. Clearly, the theory Om defined in (b) above may be identified with the 
special case of O; where T is an m-fold copower of 1. Similarly, we may form the 
theory M of /-indexed families of morphisms, and so on. 

(e) Again, let I be an object of the base topos S. Then we may con- 
struct a theory Sub; whose models in € are subobjects of p*Į (with inclusions 
between subobjects as morphisms). We saw in 3.2.10 above that this theory has 
a classifying topos over any topos S, namely the diagram category [K(J), S]. 

(f) Another generalization of (b) above: if C is an internal category in S, then 
we have a theory whose models in € are diagrams of shape C (that is, of shape 
p*(C)) in £. Similarly, we have a theory Tors(C, —) whose models are C-torsors 
in S-toposes. Of course, Diaconescu’s Theorem 3.2.7 is exactly the statement 
that the latter theory has a classifying topos, namely [C, S]. And 3.2.6 combined 
with 3.2.7 shows that the theory of diagrams of shape C has a classifying topos 
provided S has a natural number object, namely the topos [(Cart(C))°?, S] where 
Cart(C) is the cartesian category freely generated by C. 


In order to build more complicated examples of theories, we shall make use 
of the notion of a geometric construct in a theory, which we now define. 


Definition 4.2.5 Let T be a theory over S, as defined in 4.2.1(a). By a geo- 
metric construct in T, we mean a (BZop/S)-indexed functor F: T — O, where 
O is the theory of objects defined in 4.2.4(a). Informally, F may be thought 
of as something which ‘constructs’ an object F(M) of € from a T-model M 
in an S-topos €, and a morphism F(M) — F(N) from each morphism of 
T-models M — N, in a manner which is compatible with composition of T-model 
morphisms and with the functors f* between T-models in different S-toposes. 
A morphism of geometric constructs a: F — G is a (BYop/S)-indexed natu- 
ral transformation; equivalently, it may be thought of as a (BTop/S)-indexed 
functor from T to the theory M of 4.2.4(0). 


If T has a classifying topos over S, then a geometric construct F in T is deter- 
mined up to isomorphism by the object F(G) of S[T], where G is the generic 
T-model; for, given any T-model M is an S-topos £, we have F(M) = f*(F(G)), 
where f is the geometric morphism such that f*(G) = M. In fact this defines an 


B4.2 CLASSIFYING TOPOSES VIA WEIGHTED LIMITS 427 


equivalence between S[T] and the category of geometric constructs in T, by the 
‘2-dimensional Yoneda lemma’. Provided the theory O also has a classifying 
topos, we may also identify geometric constructs in T with geometric mor- 
phisms S[T] — S[O] over S. However, a more ‘concrete’ feel for what geometric 
constructs look like may be provided by the following examples: 


Examples 4.2.6 (a) Let J be an object of the base topos S. The ‘constant 
construct’ which assigns to every T-model in an S-topos p: E — S the object p*I 
is clearly geometric. In particular, if S has a natural number object, then the 
constant functor sending a T-model to the natural number object (in the topos 
where it lives) is a geometric construct. 

(b) If n is a natural number, then the functor A + A” is a geometric con- 
struct in the theory O of objects, since inverse image functors preserve nth 
powers. More generally, if F}, Fo,..., Fn are all geometric constructs in a theory 
T, then so is their (pointwise) product; so, for the theory D of 4.2.4(b), we have 
a geometric construct sending a diagram G: D — € to []j_, G(f(2)), for any 
function f: {1,2,...,n}— ob D. 

(c) Still more complicated finite limits and colimits may also be obtained 
as geometric constructs. For example, let G be a group in a topos €. For any 
natural number n, the object G, of elements of G of order dividing n may be 
constructed as the equalizer of the nth power map G — G and the constant 
map G — 1 £ G. Clearly, this equalizer is preserved by any cartesian functor 
(in particular, by any inverse image functor); and it is functorial with respect 
to group homomorphisms in €. So we can regard it as a geometric construct in 
the theory of groups, over any base topos S. If S (and hence £) has a natural 
number object, then we may form ‘the union of the Gn for all n’ as a subobject 
of G x N; and by taking the image of this subobject under the first projection 
we obtain the subobject G; — G of torsion elements of G; once again, this is 
functorial with respect to group homomorphisms in £, and preserved by inverse 
image functors, and so it is a geometric construct. 


Definition 4.2.7 (a) Let T be a theory over S. A theory T is called a simple 
functional extension of T if there exist geometric constructs F and F’ in T such 
that, for every S-topos £, T’(€) is the category whose objects are pairs (M, f) 
where M is a T-model in £ and f: FM — F’M in €, and whose morphisms are 
T-model homomorphisms h: M — M’ such that 


FM ace F'M 
Fh Fh 
J 


FM’ ——> F'M’ 
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commutes; and the effect of T’ on geometric morphisms and geometric transfor- 
mations is induced by that of T in the obvious way. 

(b) Let T be a theory over S. A theory T’ is called a simple geometric quotient 
of T if there is a morphism f: F — F” of geometric constructs in T such that, 
for every S-topos €, T’(E) is the full subcategory of T(€) on those T-models M 
for which fm: FM — F'M is an isomorphism, and the effect of T’ on geometric 
morphisms and geometric transformations is induced by that of T in the obvi- 
ous way. (We sometimes refer to the assertion that fm is invertible, for some 
particular T-model M, as a geometric axiom satisfied by M.) 

(c) We say T is a geometric theory over S if there is a finite sequence of 
theories (To, T1,..., Tn) such that To is the theory of m-tuples of objects for 
some finite m, T, is equivalent to T, and each Tj, is either a simple functional 
extension or a simple geometric quotient of Tj. 


In D1.1.6 we give a very different-looking definition of the notion of geometric 
theory; but it turns out to have exactly the same expressive power as the one 
given above (that is, given a geometric theory in the above sense over Set, 
there is a geometric theory in the sense of D1.1.6 giving rise to an equivalent 
(BTop /Set)-indexed category, and vice versa). We shall not give a formal proof 
of this equivalence; but in fact such a proof could be extracted from the list of 
examples of geometric theories which follows. 


Examples 4.2.8 (a) The theory M of morphisms is a geometric theory, since 
it is a simple functional extension of the theory Oz of pairs of objects. More 
generally, for any finite category D, the theory D of 4.2.4(b) may be obtained 
from the theory Om (where m is the number of objects of D) by making a 
finite number of simple functional extensions, followed by a finite number of 
simple geometric quotients to ensure that the commutativity of diagrams in D 
is respected (to force a diagram to commute, we force the equalizer of the two 
ways round it to be an isomorphism). So it is a geometric theory. 

(b) More generally, for any finite family of finitary operation-symbols 
{w,.-.,Wn}. (where each w; has a preassigned arity a; which is a natural num- 
ber), we may obtain the theory whose models are objects A equipped with 
morphisms w;: A% — A as an iterated functional extension of O, using 4.2.6(b). 
And we may then use iterated geometric quotients to impose equations between 
composites of these operations: that is, any finitely-presented (finitary) algebraic 
theory may be regarded as a geometric theory. (Examples of such theories include 
groups, rings, lattices, Heyting algebras (cf. A1.5.11), and so on.) And we may 
also consider many-sorted algebraic theories in this way, provided the number of 
sorts is finite. 

(c) However, we may also consider geometric quotients corresponding to 
axioms which are not algebraic. For example, if our base topos S has a nat- 
ural number object, then the theory of torsion groups (resp. torsion-free groups) 
is a geometric theory over S, obtained from that of groups by adding the axiom 
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that G; — G (resp. e: 1 — G+) is an isomorphism, where G; is the geometric 
construct defined in 4.2.6(c). 

(d) Let T be a theory over S. By a simple S-specialization of T, we mean 
a theory T’ such that, for some geometric construct F in T and some object I 
of S, T’(E) is the subcategory of T(€) whose objects are those T-models M for 
which FM = p*I, and whose morphisms are those T-model morphisms which 
induce the identity on p*I. Thus, for example, the theory Oz of 4.2.4(d) is 
a simple S-specialization of the theory M of morphisms. We claim that if T 
is a geometric theory over S, then so is any simple S-specialization of T; for 
we may construct a theory equivalent to T’ by first using a simple functional 
extension to adjoin a morphism of geometric constructs from F to the constant 
construct J, and then using a simple geometric quotient to make this morphism 
an isomorphism. Hence in particular O; is a geometric theory for any object I 
of S. 

(e) The theory Sub; of 4.2.4(e) is a simple geometric quotient of Oz, obtained 
by imposing on an O;-model (A — p*J) the requirement that the diagonal map 
A >= A Xp; A should be an isomorphism. So it is a geometric theory. 

(f) Generalizing (a), let C be an internal category in S. Then the theory 
of diagrams of shape C is a geometric quotient of a simple functional extension 
of Oca; so it is a geometric theory. (Alternatively, we could obtain this theory 
by first forming the theory whose models are pairs of internal categories with a 
discrete opfibration between them, and then ‘specializing’ the codomain of the 
opfibration to be the particular category C.) 

(g) Similarly, we may replace the finite presentability restrictions which 
appeared in (b) by ‘S-presentability’ using the notion of S-specialization: that is, 
we may consider finitary algebraic theories whose sorts, generating operations 
and defining equations are all indexed by objects of the base topos S, rather 
than by finite sets. (A good example, albeit a single-sorted one, is the theory 
of modules over an internal ring in S.) And the same applies to the more com- 
plex theories considered in (c): forcing an ‘J-indexed family’ of simple geometric 
axioms to hold is the same as forcing a single morphism F' — F’ to be an isomor- 
phism, where F and F” are ‘J-indexed families of geometric constructs’, that is, 
geometric constructs equipped with morphisms to the constant construct J. 

(h) The theory of filtered categories is a geometric quotient of the theory of 
(arbitrary) categories, and hence a geometric theory: we simply have to add to 
the theory of categories three geometric axioms which say that the images of the 
three morphisms of 2.6.2(a) are isomorphisms. Combining this with (f) above, 
we deduce that the theory of C-torsors is geometric, for any internal category C 
in S. 

(i) If € is a classifying topos for a geometric theory T, then any sheaf subtopos 
sh, (€) of £ classifies a simple geometric quotient T; of T: for A4.3.11 tells us that 
a geometric morphism f: F — E over S factors (uniquely) through sh, (E€) > E€ 
iff f* maps j-dense monomorphisms to isomorphisms, iff f* (d) is an isomorphism 
where d: 1 — J is the generic j-dense monomorphism. And, by the remarks 
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after 4.2.5, we may regard d as a morphism of geometric constructs in T. Com- 
bining this observation with the last sentence of (h) above (and with Giraud’s 
Theorem 3.3.4), we have shown that for any bounded S-topos €, the theory 
BLop/S(—,E€) classified by E is geometric. 


The main theorem of this section is that the result just stated has a converse, 
provided the base topos S has a natural number object. 


Theorem 4.2.9 Let S be a topos such that the classifying topos S[O] exists in 
BTop/S (for example, any topos with a natural number object). Then, for any 
geometric theory T over S, the classifying topos S|T] ezists. 


Proof Clearly, it suffices to prove (i) that the theory Om of m-tuples of objects 
has a classifying topos, (ii) that if T has a classifying topos then so does any 
simple functional extension of T, and (iii) that if T has a classifying topos then 
so does any simple geometric quotient of T. But each of these is easy, given the 
results of the previous section. For (i), we take the m-fold product of copies 
of the object classifier S[O] in BLop/S. For (ii), suppose we wish to adjoin a 
morphism F — F’, where F and F” are geometric constructs in T. We may 
regard F and F” as geometric morphisms 


f 
sit] = sO} 
f' 


in BTop/S; and it is then clear that the inserter of this parallel pair of mor- 
phisms, as defined in 1.1.4(d), is a classifying topos for the extended theory T’. 
Similarly for (iii), if T’ is a simple geometric quotient of T then we may construct 
its classifying topos as the inverter (1.1.4(e)) of a 2-cell between two morphisms 
S[T] 3 SJO]. o 


We have seen that the only step in the construction of classifying toposes 
which requires a natural number object in the base topos is the initial one of 
constructing an object classifier; the rest of the construction is insensitive to the 
particular properties of the base. It is therefore of interest to ask whether an 
object classifier might exist over some topos without a natural number object. 
However, the answer is no: the following argument is due to A. Blass [130]. 


Lemma 4.2.10 Let T be a finitely-presented algebraic theory, and suppose 
there exists a classifying topos p: S[T] > S for T over S. Then T(S) has an 
initial object. 


Proof Consider p,(G), where G is the generic T-model in S{T]. Since p, is 
cartesian, px(G) has a T-model structure in S. But given any T-model A in S, 
there exists a geometric morphism f: 8S — S(T] over S such that f*(G) = A, 
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and hence we have a composite morphism 


p.(G) ——> f*p*p,(G) Jey FG) — = A 


in T(S), where € is the counit of (p* 4 p,). 

Thus we have shown that p,(G) is a weakly initial object in T(S), i.e. that 
there exists at least one morphism from it to any other T-model in S. In fact, as 
will be clear from the construction of classifying toposes given in Section D3.1, 
px(G) is actually initial in T(S}; but it does not seem possible to prove this 
directly. However, it is clear that for any T-model A in S we may form the 
subobject PyA of PA which is ‘the object of sub-T-models of A’, by intersecting 
a finite number of equalizers, and that this subobject is closed under internal 
intersections in PA; hence we may obtain the unique smallest subobject A’ — A 
which is a sub-T-model of A. If we apply this to p,(G), the sub-T-model A’ we 
obtain is still weakly initial in T(S); but, for any two T-model homomorphisms 
A’ = B, their equalizer is a sub-T-model of A’, and so must be the whole of A’. 
Hence A’ is actually an initial object. a 


Theorem 4.2.11 Fora topos S, the following are equivalent: 


(i) S has a natural number object. 
(ii) BTop/S has an object classifier. 
(iii) BTop/S has classifiers for all geometric theories over S. 


(iv) BLop/S has a classifier for the algebraic theory freely generated by one 
unary and one nullary operation. 


Proof (i)=> (ii) was proved in 3.2.9, and (ii) > (iii) in 4.2.9. (iii) > (iv) follows 
from 4.2.8(b), and (iv) = (i) follows from 4.2.10, since a natural number object 
is by definition an initial model of the theory described in (iv) (cf. A2.5.1). O 


Nevertheless, over a topos S without a natural number object we may con- 
struct classifying toposes for all propositional geometric theories. The formal 
definition of a propositional geometric theory will be found in D1.1.7(m); for 
present purposes, it may be taken to be theory built up by an inductive process 
like that of 4.2.7, but starting from the theory of a finite family of subobjects 
of (given) objects of the base topos rather than from the theory of m-tuples of 
arbitrary objects. 


Proposition 4.2.12 For any topos S, every propositional geometric theory T 
over S has a classifying topos S[T], which is localic over S. 


Proof The outline of the argument is similar to that of 4.2.9, but we take 
3.2.10 rather than 3.2.9 as our starting-point. That is, if our initial theory To is 
the theory of m-tuples (A; — J, A2 >> I2,...,Am > Im) (where hh, Io,...,Im 
are given objects of S), we take S[To] to be the product in BYop/S of the 
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diagram toposes [K(J;),S] (1 < j < m). For the inductive steps, we cannot 
argue from the existence of finite weighted limits in BZop/S as we did in the 
proof of 4.2.9, since we no longer have the object classifier available; but we can 
argue directly as follows. If T’ is a simple geometric quotient of T, correspond- 
ing to the requirement that f: F — F" be invertible, then we can take S[T’] 
to be sh;(S[T]), where j is the smallest local operator whose associated sheaf 
functor maps fg: FG — F’G (where G is the generic T-model) to an isomor- 
phism (cf. A4.5.14(c)). For simple functional extensions, we note first that the 
diagram category [K(F'G), S[T]] classifies T-models M equipped with a distin- 
guished subobject of FM; so by first adjoining a generic subobject of F x F” in 
this way, and then using a simple geometric quotient to force it to be the graph 
of a morphism F — F’, we may obtain a classifying topos for a simple functional 
extension of T. Finally, we observe that the two constructions we have used (tak- 
ing sheaves for a local operator, and taking diagrams on an internal poset) both 
give rise to localic geometric morphisms, by A4.6.2(a) and (d); since a composite 
of localic morphisms is localic by A4.6.2(e), it follows that any classifying topos 
constructed in this way will be localic over S. O 


Once again, Proposition 4.2.12 has a converse: for any localic S-topos €, the 
theory T defined by T(F) = BTop/S (F, €) is (equivalent to) a propositional 
geometric theory over S. The proof of this follows from the representation of 
localic toposes in 3.3.5, together with the observation in 3.2.4(d) that if P is a 
preorder then the theory of P-torsors is equivalent to the theory of ideals of P 
(which is easily seen to be a propositional geometric theory). Cf. also D1.2.15(m). 


Suggestions for further reading: Blass [130], Tierney [1169]. 


B4.3 Some exponentiable toposes 


The 2-category BYop/S is not cartesian closed, but it contains an important 
class of exponentiable objects, i.e. S-toposes p: E — S such that the 2-functor 
(-) xs E: BLop/S — BTop/S has a right 2-adjoint. The precise determina- 
tion of this-class belongs properly to Part C of this book, since it relies on a 
topologically-inspired notion of ‘local compactness’ for S-toposes; and we shall 
describe it in Section C4.4. In the present section, we shall see what can be 
said about exponentials in BYop/S by the general 2-categorical methods of this 
chapter. We shall make heavy use of the theory of classifying toposes developed 
in the previous section, and in particular of the object classifier S[O]; therefore 
we shall assume throughout this section that our base topos S has a natural 
number object. 
We say that a particular exponential FE exists if the functor 


BTop/S((-) xs E, F): BLop/S°? —> CAT 


is representable (as usual, we mean this in the ‘up-to-equivalence’ sense). 
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Theorem 4.3.1 Let S be a topos with a natural number object. Then a bounded 
S-topos p: E — S is exponentiable iff the particular exponential S[O|* ezists. 


Proof We rely on the fact, established in 4.2.8(z), that every bounded S-topos 
occurs as the classifying topos for some geometric theory: what we shall actually 
prove is that if S[O]* exists then so does S[T]* for any geometric theory T over S. 
Now in the proof of 4.2.9 we saw how all the toposes S[T] may be built up by 
finite weighted limits from S/O]; and since (—)*, to the extent that it exists, 
is a right adjoint, it should preserve these weighted limits. Thus, for example, 
to construct the classifying topos for a simple functional extension T’ of T, we 
formed the inserter of a diagram of the form 


f 
S(T] == S(O); 
f 


so in order to construct S[T’]© from S[T]F, we simply form the inserter of the 
diagram 


fE 
SITE ——s sS[O]E 
FE 
and verify that it has the desired universal property. (Here fË of course 
corresponds to the composite 


f 


SmE xs € —> S{T] ——> So}, 
where ev corresponds to the identity morphism on S[T]® .) o 


Another way of stating 4.3.1 involves a notion of ‘duality’ for geometric the- 
ories. If T is a geometric theory over S, then for any bounded S-topos p: E —> S 
we may also regard it (by restriction) as a geometric theory over €, and a purely 
formal argument shows that there is a pullback square 


E[T] ——— S{T] 


E —— >S 


in BTop. Thus we may regard the assignment € ++ €[T] as itself defining a theory 
T* over S in the sense of 4.2.1 (the effect of T* on a geometric morphism f being 
given by the inverse image of the pullback of f along S[T] — S, and its effect 
on geometric transformations being obtained by the method of 3.2.14). We say 
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that the theory T is dualizable if T* is itself a geometric theory (equivalently, has 
a classifying topos). (The word ‘dual’ should not be taken too seriously in this 
context: the relation between T and T* is far from symmetric. The paradigm 
one should: have in mind is that of the dual of an infinite-dimensional vector 
space. However, we note that the construction is indeed contravariant: given a 
morphism of theories S — T, that is a family of functors S(€) — T(E) which are 
natural in E, we obtain a geometric morphism €{S] — €[T] for each £, classifying 
the T-model obtained by applying the given functor to the generic S-model, and 
the inverse images of these morphisms define a morphism of theories T* — S*.) 


Corollary 4.3.2 A bounded S-topos is exponentiable iff it is the classifying 
topos over S of a dualizable geometric theory. 


Proof Suppose € ~ S[T] and T* is geometric. Then S[T*] has the universal 
property of an exponential S[O]*; for we have 


BLop/S (F,S[T*]) ~ T*(F) 
= F{[T] 
~ F xs S|T] 
~ BLop/S (F xs E, S[O]) 


for any bounded S-topos F. Hence by 4.3.1 E is exponentiable. Conversely, if 
E is exponentiable and classifies a geometric theory T, then the reverse of the 
above argument shows that the exponential S[O}* is a classifying topos for T*. 

o 


We already know one class of examples of dualizable geometric theories: 
namely, the theories of C-torsors, for arbitrary internal categories C in S. 


Lemma 4.3.3 For any internal category C in S, the diagram category |C, S] 
is exponentiable in BLop/S. 


Proof -By 3.2.14 we have E x s [C, S] ~ [p*C, E] for any p: E — S, so if T is the 
theory of C-torsors then we can take T* to be the theory of diagrams of shape 
C in S-toposes — recall that we saw in 4.2.8(f) that this is a geometric theory. DJ 


Remark 4.3.4 As a special case of 4.3.3, we note that the slice category S/I 
is exponentiable in BTop/S, for any object I of S. It follows that BLop/S 
has ‘S-indexed products’, as we claimed in Section B4.1: that is, if we are 
given an IJ-indexed family of bounded S-toposes (that is, a bounded S/J-topos 
p: E —> S/I), we may form a bounded S-topos I; (E) such that, for any bounded 
q: F — S, we have 


BLop/s (F, I; (E)) ~ BLop/(S/T) (F/q°1,€) . 
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To do this, we simply form the pullback 


HE) ——> ES! 


S ——> S/I” 
as in A1.5.2(i). 
Next, we note three basic ‘stability properties’ of exponentiable toposes. 
Lemma 4.3.5 


(i) Exponentiability is stable under composition; that is, if p: E — S is expo- 
nentiable in BTop/S, then so is pq: F  S for any exponentiable object 
q: F — E of BIop/E. 

(ii) Exponentiability is stable under pullback; that is, if p: E + S is erponen- 
tiable in BLop/S and we have a pullback 


F—€ 


q p 


T ——— S 
then q: F > T is exponentiable in BTop/T. 


(iii) Exponentiability is stable under retracts; that is, if q: F — S is a retract 
in BLop/S of an exponentiable topos p: E —> S, then q is exponentiable. 


Proof (i) Generalizing the observation in 4.3.4, an object p: E — S is expo 
nentiable iff the pullback functor p*: BTop/S — BTop/E has a right adjoint. 
The latter condition is clearly stable under composition; so the result follows 
immediately. 

(ii) Suppose € classifies a geometric theory T over S. Then it is clear from 
the definition of the dual theory T* that its restriction to a theory over T is the 
dual of the corresponding restriction of T; so it is geometric if T* is. 

(iii) is immediate from the fact that S[O]‘-), to the extent that it is defined, 
is contravariantly functorial; if e is the idempotent endomorphism of € corre- 
sponding to the retract F, then S[OJ¢ is an idempotent endomorphism of S{O]*, 
and if we split it (cf. 1.1.9(b)) we obtain a topos having the universal property 
of S[O}*. 0 


Combining 4.3.3 and 4.3.5(i), we have 


Corollary 4.3.6 If € is exponentiable in BYop/S, then so is [C,E] for any 
internal category C in E. 0 
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In particular, any slice of an exponentiable topos is exponentiable. In the 
converse direction, we have 


Lemma 4.3.7 If €/A is exponentiable for some well-supported object A of €E, 
then E is exponentiable. 


Proof If €/A is exponentiable, then so is €/A” for any natural number n, 
by the remark immediately before the statement of the lemma. But by 1.5.6(iii) 
we may identify E with the category of objects of €/A equipped with descent 
data relative to the morphism A — 1; equivalently, by 3.4.12, with the (pseudo-) 
colimit of the simplicial topos €/A?®, or of its 2-truncation. Hence we may define 
FE, for any bounded S-topos F, to be the pseudo-limit of the 2-truncation of 
the cosimplicial topos F&/4". D 


In contrast to 3.4.7, the property of exponentiability does not ‘descend’ along 
more general descent morphisms in BTop/S (see C4.4.9). 

In addition to diagram categories [C, S], the other class of bounded S-toposes 
whose exponentiability we shall need to invoke in the next section is the class 
of coherent S-toposes. Unfortunately, it seems hard to give a precise definition 
of this class, let alone to prove that its members are exponentiable, without 
trespassing on material which properly belongs in Parts C and D of this book; 
so the reader who has not yet mastered those parts will have to be content for the 
moment with a rather informal account. (Such a reader can rest assured that we 
shall give rigorous definitions of the notions of coherent theory and of coherent 
topos in Sections D1.1 and D3.3 respectively, and that a rigorous proof of the 
exponentiability of coherent toposes will follow easily from the results of those 
sections and of Section C4.4 — see in particular C4.4.6(a).) 

Informally, then, we say that a geometric construct F in a theory T, in the 
sense of 4.2.5, is coherent if F(M) can be built up from the objects defining 
the T-model M using only those categorical constructions (finite limits, finite 
coproducts and coequalizers of equivalence relations) which are preserved by 
arbitrary coherent functors (and not just inverse image functors). And we say 
that T is a coherent theory if it can be obtained from the theory of diagrams of 
shape C, for some internal category C in S, by adjoining ‘S-indexed families of 
coherent axioms’, that is families of axioms which assert that morphisms between 
two coherent constructs are isomorphic (cf. 4.2.7). 

For example, any finitely-presented algebraic theory, as considered in 4.2.8(b), 
is coherent. On the other hand, the theory of torsion groups, as defined in 
4,2.8(c), is not coherent, because the torsion subgroup G; is not a coherent 
construct (being defined as an infinite union). But the theory of torsion-free 
groups is coherent, because instead of imposing the single geometric axiom which 
says that 1 — G; is an isomorphism, we can impose the N-indexed family of 
axioms which say that 1 — G, is an isomorphism (using the notation of 4.2.6(c)), 
and each G, is a coherent construct. Rather more subtly, the assertion that 
a given diagram of shape C°? is a C-torsor cannot, in general, be expressed 
by coherent axioms — even though the theory of filtered categories is coherent. 
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This is because, for example, the first condition of 2.6.2(a) for filteredness of 
the total category of the discrete fibration D — C corresponding to a diagram 
D involves consideration of the geometric construct Do, which is not coherent 
(unless Co is finite) because it is the Co-indexed coproduct of the objects in the 
diagram. However, if C is finite then the notion of C-torsor is coherent; also, if 
C is cocartesian, then we know that C-torsors are the same thing as cartesian 
functors defined on CP, by 3.2.5, and these can clearly be defined by coherent 
axioms. (For further examples of what can and cannot be expressed by coherent 
axioms, see D1.1.7 and the remarks at the end of Section D2.4.) 

Of course, we say a bounded S-topos is coherent if it is (equivalent to) the 
classifying topos of a coherent theory over S. The key result which we require 
about coherent toposes is 


Theorem 4.3.8 A bounded S-topos is coherent iff it can be presented as the 
topos Shs(C,T) of S-valued sheaves on an internal site (C,T) in S such that 
the category C is cartesian and the coverage T consists entirely of finite families 
(that is, families indexed by finite cardinals in S). 


Proof See D3.3.1 for a detailed proof. Intuitively, the reason why the result 
is true is that the classifying topos for the theory of diagrams of shape D may 
be identified with a topos of the form [C°P, S] where C is cartesian (specifically, 
C°P is the category of all diagrams of shape D which are finite colimits of repre- 
sentables - we may construct this as an internal full subcategory of [D, S]), and 
imposing a coherent axiom on a diagram F' amounts to saying that the functor 
[D, S] (—, F): CP — S carries certain finite families in C°? to epimorphic fam- 
ilies. Thus if we impose the coverage on C in which all such families cover, we 
obtain a classifying topos for diagrams satisfying the given coherent axioms. 


O 


We shall also require the following result from Part C: 


Theorem 4.3.9 Suppose given an internal site (C,T) in a topos S, and a 
geometric morphism p: E > S. Then there exists an internal coverage p*T on 
p*C in E, such that the square 


She(p*C, p#T) —> Shs(C,T) 


E ——— S 


is a pullback. Moreover, if C is cartesian and all covers in T are finite, then all 
covers in p#T are finite; and this coverage is such that, under the correspon- 
dence of 2.3.14 between diagrams of shape p*C°P in E and S-indezed functors 
[C] +E, the p#T-sheaves in E correspond to the functors [C°] + E which 
satisfy the sheaf axiom for covers in T. 
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Proof The first assertion of the theorem is more or less immediate from the 
proof of 3.3.6, given that every subtopos of [p*C°?, £) can be written in the form 
She(p*C, R) for some coverage R. However, in order to establish the second 
assertion, we need an explicit description of the ‘pullback coverage’ p#T. This 
is provided in Section C2.4, and the result claimed here follows directly from it 
(see C2.4.7). g 


Given these two results, it becomes a triviality to deduce 


Corollary 4.3.10 Any coherent theory T over S is dualizable; equivalently, 
any coherent S-topos is exponentiable. Moreover, the dual theory T* is also 
coherent. 


Proof If T is classified by a topos of the form Shs(C,T) as in 4.3.8, then the 
dual theory T* is simply the theory of T-sheaves on C, by 4.3.9. And this is clearly 
geometric, and indeed coherent, because the assertion that a functor F defined 
on C°P satisfies the sheaf axiom for a finite covering family (U; ~ U |1<i<n) 
is just the assertion that F(U) maps isomorphically to the limit of a certain 
finite diagram constructed from F (cf. the remarks after A2.1.9). So, in order to 
say that F is a T-sheaf, we have simply to impose a T-indexed family of such 
coherent axioms. o 


As a particular case of 4.3.10, we note that if C is a coherent internal category 
in S and T is its coherent coverage (defined as in A2.1.11(b)), then the topos 
Shs(C,T) is exponentiable. (In fact this ‘particular case’ includes all coherent 
S-toposes; cf. D3.3.1.) 


Suggestion for further reading: Johnstone & Joyal (542). 


B4.4 Fibrations and partial products 


One problem with slice categories of a 2-category, which we have already encoun- 
tered in 3.2.14 above, is that in passing from a 2-category R to a slice R/A we 
‘lose some of the 2-cells’; that is, if f, g: (p: B — A) 3 (q: C — A) are two mor- 
phisms of &/A, not every 2-cell a: f > g in Kis a 2-cell of R/A. In particular, if 
R has (pseudo-)pullbacks, the operation of pulling back along a fixed morphism 
p: B — Aon R is defined on 1-cells q: C — A, but not on arbitrary 2-cells with 
codomain A. 

As we saw in 3.2.14, the presence of tensors with 2 in R can be used to 
circumvent this problem in some cases. However, there is another possible way 
round, using an intrinsic 2-categorical notion of fibration; we devote the first part 
of this section to setting it up, and to investigating the fibrations and opfibrations 
in the particular 2-category BYop/S. 

To motivate the definition, let us consider what happens when we pull back 
arbitrary functors F: D — C along a fibration II: F — C in CAL. We recall from 
1.3.9 that in this case the pseudo-pullback D xe F (whose objects are triples 
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(B, E, f) where B € ob D, E € ob F and f is an isomorphism F'(B) — I(E) 
in C) is equivalent to the strict pullback (the full subcategory of objects for which 
f is an identity morphism). We shall exploit this by working in what follows with 
strict pullbacks rather than pseudo-pullbacks. 

Now suppose we are given a natural transformation a: F — G between func- 
tors D 3 C. The pullbacks II*(F’) and II*(G) are not in general equivalent in 
Cat/F; but we have a canonical functor from the second to the first, which sends 
an object (B, E) with G(B) = I(E) to (B,dom ag) where ag is a prone lifting 
of ag: F(B) — G(B) with codomain E. Moreover, the prone lifting ag itself 
defines a natural transformation fitting into the right-hand cell of the diagram 


F*F —— F 
I 


D<——— GF 


of which the left-hand cell commutes (up to isomorphism). 
Abstracting from this example, we introduce the following definition. 


Definition 4.4.1 Let & be a 2-category with (pseudo-)pullbacks. We say a 
morphism p: E — B of & is a fibration if we are given operations assigning to 
any 2-cell a: f — g between 1-cells f,g: A3 B a 1-cell l(a): g*(E) — f*(E) 
together with a 2-isomorphism f*(p)ol({a) S g*(p) and a (not necessarily invert- 
ible) 2-cell &: p*(f) 0 l(a) — p*(g), which ‘lies over a’ in the sense that the 2-cells 
obtained by pasting the diagrams 


l(a) l(a) 
g'(E) —> F(E) and  g*(E)—> F(E) 


| & 


E 
f 
a | 
: B 


A ————_> 


are equal. These data are required to satisfy the following conditions: 


(i) They are compatible with vertical composition of 2-cells, in the sense that 
I(1s) = 1f-g and ly is the identity 2-cell, and if 8: g — h is another 2-cell 
then we have an isomorphism /(a)ol(3) = I{8a), compatible in the obvious 
sense with the isomorphisms over A and the 2-cells with codomain EF. 
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(ii) They are also compatible with horizontal composition of a 2-cell with a 
1-cell, in the sense that if we are given k: C — A then l(a o k) is the 
pullback of I(a) along k, and aok is obtained by pasting & with the 
canonical isomorphism in the pullback square 


k* g*(E) —> 9*(B) 
(aok) ka) 


kP (E) —> T(E) 


(iii) Given any l-cel m:D—— E together with 1l-cels z:D— f*E, 
y: D —> g*E such that (p*g)y = m, (f*p)x = (g*p)y and we are given a 
2-cell 8: (p* f)z — m, there exists a unique 2-cell y: z — I(a)y such that 
the composite 


(f*p) o 


Fo: ———> TOOTE 4 (apiy 
is the given isomorphism and (p* f) oy is similarly isomorphic to £. 


Dually, we say p is an opfibration if it satisfies the same properties as above, 
but with the direction of the 1-cell (aœ) reversed. 


The definitions may alternatively be phrased in terms of the action of p on 
suitable ‘lax slice 2-categories’ of R; this is done in [536], where it is shown 
that the structure making a morphism into a fibration is unique up to canonical 
isomorphism if it exists. However, for our present purposes it seems more helpful 
to spell out the meaning in elementary terms, as above. 


Example 4.4.2 Suppose & has cotensors with 2 as well as pullbacks, and let 


be a comma square in &. We claim that p has a fibration structure (and, 
dually, q has an opfibration structure). To see this, suppose given a 2-cell 
a:h—k between 1-cells D = A. By the construction of 1.1.4(c), the pull- 
backs h*(f |g) and k*(f |g) may be identified with the comma objects (fh |g) 
and (fk |g) respectively. Let pn, qh and pk, qx respectively denote the projec- 
tions from the two pullbacks to D and to (f |g); then we may obtain a further 
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morphism m: k*(f |g) — (f Lg) from the morphisms hp; and qqk, together with 
the composite 2-cell 


foaocp, 
fhpr ————~> fkpr = fpa ——> 999 - 
Then the pair (pp, m) induces a morphism l(a): k*(f 1g) > h*(f |g); and we 
also have a 2-cell & from qpl(a) = m to qk, induced in the obvious way by a. 
It is straightforward to verify that these constructions of l(a) and & satisfy the 
conditions of 4.4.1 


Lemma 4.4.3 
(i) A composite of fibrations is a fibration. 


(ii) A pullback of a fibration is a fibration. 


Proof (i) Suppose given fibrations p:E—B and q:F — E, and a 
2-cell a: f — g between morphisms f,g: A=3 B. We may first construct 
l(a): g* E — f*E and the 2-cell a: p*(f) o l(a) — p*(g); then the latter induces 
a 1-cell I’(@) and a 2-cell & from the composite q*p* ( f)og*(I(a))ol’(&) to q*p* (g). 
It is then straightforward to verify that the composite g*(I(a)) o l’(&) and the 
2-cell & provide the structure needed to make the composite pq into a fibration. 
(ii) Suppose given a pullback square 

E 

p 

B 


F 
l 
C 


where p is a fibration, and a 2-cell a: f — g between morphisms f, g: A 3 C. By 
lifting the horizontal composite hoa, we obtain a morphism l(k o a): g*F — f*F 
and a 2-cell @:kog*fol{hoa) + koq*g. Using the fact that F is a Cat- 
enriched limit, it is easy to see that this 2-cell may be uniquely factored as 
k o ĝ; and the resulting 2-cell 8, together with the 1-cell l(h o a), provides the 
structure needed to make q into a fibration. 


Ss 


—— 


A further condition which we shall wish to impose on (op)fibrations is a 
‘commutativity condition’ between pullback along a fibration and cocomma 
objects, when the latter exist in our 2-category. Suppose R has cocomma objects, 
and let p: E — B be a fibration in R. Given a 2-cell a: f — g between two 
1-cells f,g: A 3 B, we now have two possible constructions. We may form the 
induced 1-cell "a7: A & 2 — B (recall that A ® 2 is just the cocomma object 
(147 1,)) and then form the pullback (A ® 2) xg E; or we may form the lifting 
l(a): g*E — f*E of a and the cocomma object (l(a) 1T1,- 2). It is easy to verify 
that the 2-cell & and the isomorphism (p* f)l(a) S p*g induce a canonical 1-cell 
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k(a): (l(a) Tle) — (A@ 2) xp E; we shall say that the fibration p satisfies 
condition (PCC) (‘pullbacks commute with cocommas’) if this comparison map 
is an equivalence for all a. 


Lemma 4.4.4 Let R be a 2-category which has pullbacks and cocomma objects. 
Ifp: E > B is a fibration in R satisfying condition (PCC), then any pullback of 
p also satisfies (PCC). 


Proof Let 


k 
— > 


F E 

f p 
h 

C ———> B 


be such a pullback, and let a: f — g be a 2-cell between morphisms f, g: A 3 C. 
By 4.4.3(ii), we know q is a fibration; and we obtain a commutative diagram 


iati Sr Se 


q p 


Cal 
Age eS AEE REE 


where the outer rectangle is a pullback since p satisfies (PCC), and the right- 
hand square is a pullback by definition. So the left-hand square is a pullback; 
i.e. q satisfies (PCC). o 


Condition (PCC) is also stable under composition of fibrations, provided the 
first factor of the composite is exponentiable; we shall not prove this result here, 
since we do not need it, but we refer the reader to [536] for the proof. 

In identifying the fibrations and opfibrations in STop, the following ‘tensor 
condition’ plays an important rôle. Recall from 3.4.2 that, for any topos S, the 
tensor 2@S in Top is simply the functor category [2, S], also called the Sierpiński 
topos over S. We shall write do: S — (2, S] and d,: S — [2, S] for the geometric 
morphisms which are respectively the source and target of the universal 2-cell 
with domain S; of course, they are the geometric morphisms induced by the two 
internal functors 1 3 2 in S. (Explicitly, d sends an object (Ip — Iı) of [2, S] 
to I;; the direct image functor do, sends I to (I — 1), and d,,(J) = 11.) 


Definition 4.4.5 Let P be a class of bounded geometric morphisms which is 
stable under pullback, We shall say that P satisfies the covariant (resp. con- 
travariant) tensor condition if, whenever we have a morphism p: € — (2, S] in 
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P whose codomain is a Sierpinski topos, and we form the pullbacks 


to Pi 
Eo ———-> E €e & ; 
Po p pı 
do dı 


Sea S os 


the composite t}t1, is the inverse (resp. direct) image of a geometric morphism 
over S. 


Remark 4.4.6 If the morphisms in P also satisfy the left (resp. right) Beck- 
Chevalley condition in the sense of C2.4.16 (that is, if whenever we have a 


pullback square 
E 
i 
S 


with p, and hence q, in P, then the Beck~Chevalley natural transformation 
p* fx > 947° (resp. f*ps — q.g*) is an isomorphism), then the words ‘over S’ at 
the end of the above definition are redundant. For it is clear that the composite 
djd,,. is the identity on S, and the left (resp. right) Beck-Chevalley condition 
implies that we have ttipi = podgdis = på (resp. pisthti. S dodisPos S pox). 
In practice, it turns out that those classes of geometric morphisms for which 
we are able to verify the covariant (resp. contravariant) tensor condition tend 
also to satisfy the appropriate-handed Beck~Chevalley condition (the classes of 
morphisms satisfying these conditions are characterized in C3.3.16 and C3.4.11 
respectively), but there does not seem to be any necessity for this. 


(re 


q 
f 
T ———_> 


Theorem 4.4.7 Let p:E-—S be a bounded geometric morphism. Then p is 
a fibration (resp. an opfibration) in Bop satisfying condition (PCC) iff the 
class P of all pullbacks of p satisfies the contravariant (resp. covariant) tensor 
condition. 


Proof The two cases are very similar; we shall deal with the fibration one. 
First suppose the tensor condition is satisfied. Given a 2-cell a: f—g 
between geometric morphisms 7 = S, we form the induced morphism 
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Tat: [2,7] — S and the pullback 
FS 


q p 


Tq! 
Bj ss 


Then the pullbacks of p along f and g are respectively the morphisms gp and 
qı which occur as the pullbacks of q along dg and dı; so we have a geometric 
morphism l(a): g*E — f*E over T whose direct image is the composite tti». To 
construct the required 2-cell &: (p* f(a) — p*g, we observe that it suffices to 
construct a natural transformation (p*f)* — (l(a)),(p*g)*. But p* f and p*g are 
respectively isomorphic to the composites stg and st,, so this is the same thing 
as a natural transformation t§s* — t§t1.t]s*; and the latter is easily obtained 
from the unit of (t] 4 t:.). The verification that the constructions we have 
given for l(a) and & satisfy the compatibility conditions of Definition 4.4.1 is 
straightforward. 

To verify condition (PCC), we observe that since dı and dọ define comple- 
mentary open and closed subtoposes of [2,7], their pullbacks t; and tọ define 
complementary open and closed subtoposes of F; so by A4.5.6 F may be recon- 
structed by glueing along the functor tĝtı+. But by 3.4.2 the cocomma object 
(l(a) T lyre) is obtained by glueing along (I(a)),; so it is equivalent to F, and 
the construction given for & is sufficient to ensure that this equivalence is the 
one appearing in the statement of (PCC). 

Conversely, suppose p is a fibration satisfying (PCC). Then so is any pullback 
of p, by 4.4.3(ii) and 4.4.4; so it suffices to verify that if the domain of p itself 
is of the form [2,7] then the composite tGti. is a direct image functor over T. 
We have a 2-cell a: do — dı with codomain [2,7]; so I{a) is a geometric mor- 
phism E1 — & over T, and condition (PCC) tells us precisely that the topos 
obtained by Artin glueing along the direct image of this morphism is equivalent 
to € (which can, once again, be recovered from & and £ by glueing along the 
composite t§t;,). Moreover, this equivalence must commute with the geometric 
morphisms from € and £, into the two glued toposes; so it induces an isomor- 
phism l(a), & t§ti.. o 


One advantage of 4.4.7 is that it enables us to prove that (op)fibrations in 
BTop are stable under adjoint or coadjoint retracts, in the sense of 1.1.9(c). 


Lemma 4.4.8 Let P be a class of bounded geometric morphisms satisfying 
either the covariant or the contravariant tensor condition. Let Q be either the 
class of morphisms defined by saying that f: E — S is in Q iff it is an adjoint 
retract in BYop/S of a morphism in P, or the class of coadjoint retracts of 
morphisms in P. Then Q satisfies the same tensor condition as P. 
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Proof There are four different cases to consider, though two of them are 
formally dual to the other two. However, all four proofs are very similar, and 
we shall give only one in detail, namely the case when P satisfies the covariant 
tensor condition and we consider adjoint retracts. (The case of coadjoint retracts 
and the covariant tensor condition is treated in [536].) 

We note first that Q is stable under pullback, since pullback along a fixed 
morphism is a 2-functor and hence preserves adjoint retracts. So suppose 
p: E — [2, S] is a morphism in P, and let q: F — [2,S] be a retract of it by 
morphisms r: E — F and i: F — E over [2, S] such that r is left adjoint to i in 
Top, ie. such that r* & i,. Form the diagram 


Uo ui 
Fg ———> F e«—— F 


d 
EE spe s 


where all the squares are pullbacks: then we know that the composite ttx is an 
inverse image functor over S, and we also have (ro ~ ig) and (rı 4 i1). Now 


Uptis = iOtoxUguis 
= iprpuotis 
= igtor” Urs 
= igtots Us 
it th alig 


which is a composite of inverse image functors over S. O 


We are now ready to consider two particular classes of morphisms in Bop 
for which we can verify the tensor condition. The first is the class of morphisms 
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of the form [C, S] — S: 


Lemma 4.4.9 For any internal category C in a topos S, the canonical 
morphism [C,S] — S is an opfibration in BTop satisfying condition (PCC). 


Proof We verify that the class of all such morphisms satisfies the covariant 
tensor condition. By 3.2.14, we know it is stable under pullback. Also, an internal 
category C in [2,S] is the same thing as an internal functor f: Cp — Cı in S, 
and an easy calculation shows that, in the diagram 


CoS] > [C 2,5] < CS), 


do dı 
S ——=> [2, S] c S 
the composite ¢§t1, is simply the functor obtained by applying pullback along f 
to discrete opfibrations over C,, and hence it is the inverse image of a geometric 
morphism over S. go 


The toposes which occur as retracts in BTop/S of diagram categories [C, S] 
are studied in Section C4.3, where we shall in particular prove that every such 
S-topos is in fact a coadjoint retract of a diagram category (cf. C4.3.9). Thus we 
may conclude: 


Corollary 4.4.10 Let p:&€—S be a bounded geometric morphism which is a 
retract in BYop/S of one of the form [C,S] — S. Then p is an opfibration in 
BTop satisfying condition (PCC). o 


Similarly, it turns out that coherent morphisms, as considered at the end of 
the previous section, are fibrations in Top/S. Once again, we shall have to rely 
on a couple of results from Part C in order to prove this. 


Lemma 4.4.11 Letp: E — S be a coherent topos over S. Then p is a fibration 
in BTop. satisfying condition (PCC). 


Proof We noted in 4.3.9 that the class of coherent morphisms is stable under 
pullback, so we have only to verify the tensor condition for a coherent morphism 
E ~ Shjz,s](C, T) — [2, S]. Consider first the case when the coverage T is trivial, 
so that £ is simply the diagram category [C°?, [2, S]]: in this case, as in 4.4.9, we 
find that ¢§t,, is simply the functor obtained by pulling back discrete fibrations 
along f: Cp — Cı. But C is now assumed to be a cartesian internal category 
in {2,S], which means that Cp and Cı are cartesian categories in S and f is 
a cartesian functor; so, by the S-indexed version of A4.1.10 (cf. also 3.2.8(c)), 
pullback along f is the direct image of a geometric morphism over S. 

For the general case, we have to presume upon the reader’s goodwill by 
appealing to results in Part C: this time in Section C2.3, where we develop a 
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notion of ‘cover-preserving functor’ and show that if f: (Co, To) —> (C1, Ti) is a 
cartesian cover-preserving functor between cartesian sites, then the geometric 
morphism [C3?, S] — [C]P, S] whose direct image is given by composition with 
f restricts to a morphism between the corresponding sheaf toposes (see C2.3.4). 
It is clear that this result applies in the present case: the assertion that T is an 
internal coverage in [2, S] contains the information that f preserves covers. O 


Once again, 4.4.11 may be extended to the class of all morphisms p: E — S 
such that £ is either an adjoint or a coadjoint retract in BTop/S of a coherent 
S-topos. We do not know whether this class includes all retracts of coherent 
S-toposes; but for the particular case of localic morphisms, we study those 
which are retracts of coherent ones in Section C4.1, and in C4.1.13 we show 
that they all occur as coadjoint retracts. (We characterize them there as the 
class of morphisms corresponding to stably locally compact locales.) Thus we 
may state 


Corollary 4.4.12 If X is a stably locally compact locale in a topos S, then the 
canonical geometric morphism Shs(X) — S is a fibration in Bop satisfying 
condition (PCC). o 


The other notion we wish to introduce in this section is a 2-dimensional 
generalization of the notion of partial product, which we studied in A1.5.7. It 
is clear that, for an arbitrary morphism f: E — B in a 2-category RK, and an 
object A of K, it does not make much sense to ask whether the partial product 
P(A, f) exists, since we cannot hope to make it universal in a 2-dimensional 
sense. That is, we wish 1-cells C — P(A, f) to correspond to pairs (C — B,C xg 
E — A) of 1-cells, but we cannot say what 2-cells between such 1-cells should 
correspond to, since a 2-cell between morphisms C = P(A, f) would induce a 
2-cell between morphisms C = B by horizontal composition, but we have no 
means of pulling the latter back along f. However, if f has either a fibration 
or an opfibration structure, in the sense defined above, we can circumvent this 
problem. 

For definiteness, let us suppose that f is an opfibration. Then, for any object 
C of R, we define a category of ‘partial product cones over (A, f) with vertex C”, 
as follows: its objects are pairs (u,v) with u: C — B and v: C xg E — A, and 
a morphism (u,v) — (u’,v’) is a pair (a, 3) where a is a 2-cell u — u’ and £ is 
a 2-cell v — v’-I(a). The assignment mapping C to this category is (contravari- 
antly) pseudo-functorial in C, in an obvious way; and by a covariant partial 
product P,(A, f), we mean a representing object for this functor. Similarly, if 
f is a fibration, we may define the notion of contravariant partial product, for 
which our notation is P*(A, f). 

The next result is the ‘2-dimensional analogue’ of the basic existence 
theorem A1.5.7 for partial products. 


Theorem 4.4.13 Let R be a 2-category with pullbacks and cocomma objects, 
and suppose that p: E — B is a fibration (resp. an opfibration) in R which sat- 
isfies condition (PCC) and is exponentiable as an object of R/B. Suppose also 
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that products with B exist in R. Then, for any object A of R, the contravariant 
(resp. covariant) partial product P®(A, p) (resp. P.(A, p)) exists in R. Moreover, 
P*(—,p) (resp. Pe(—,p)) is covariantly functorial, and may be described as the 
composite ` 
B* (S UB 
R ——> K/B ——~> R/B ——> R, 

where Ep and B* as usual denote the forgetful functor R/B — R and its right 
adjoint. In particular, if p is both a fibration and an opfibration, and satisfies 
(PCC) in both senses), then the two notions of partial product coincide. 


Proof We shall consider the opfibration case; that for fibrations is similar. 
Given an object C of R, we have to construct a natural equivalence between the 
category of partial product cones over (A, p) with vertex C, and the category 
KR (C, 2 —((B*A)?)). The correspondence on (isomorphism classes of) objects is 
easy to describe: specifying a morphism f: C —> 0 ,((B*A)?) in & is equivalent 
to specifying a morphism g: C — B in & and a morphism g — (B*A)?, or 
equivalently g x p — B*A, in R/B. And the latter further corresponds to a 
morphism =p(g x p) = C xg E — Ain R. To show that this correspondence 
extends to an equivalence of categories, we use (PCC) to show that a morphism 
between two partial product cones with vertex C may be regarded as a single 
partial product cone with vertex C @ 2; the details are straightforward. The 
final assertion of the theorem follows from the observation that the displayed 
composite takes no account of the variance of p. O 


In the previous section, we verified the exponentiability in BYop/E of all 
the morphisms F — E which we have proved to be (op)fibrations in the present 
section. So we may now conclude 


Corollary 4.4.14 Let S be a topos with a natural number object, f: F — E a 
morphism of BYop/S and G a bounded S-topos. Then 


(i) If f is a retract in BVop/E of a topos of the form [C, E] for some inter- 
nal category C in E, then the covariant partial product P.(G, f) exists in 
BT op/s. 
(ii) If f is either an adjoint or a coadjoint retract of a coherent E-topos, then 
the contravariant partial product P*(G, f) exists in BYop/S. 
(iii) In particular, if f ts a retract of an E-topos of the form [C°?, E} where C 
is cartesian, then the two partial products exist and are equivalent. o 


The &-toposes satisfying the condition of 4.4.14(iii) are characterized in 
C4.3.1 as the injective objects (with respect to inclusions) in the 2-category 
BTop/E. 

The most important example of a partial product functor is the (lower) bag- 
domain functor Bz, first introduced in [535] and [1206]. To explain the thinking 
behind this notion, suppose £ is the classifying topos (over S) for a geometric 
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theory T. If J is a particular object of S, we know how to construct a classifying 
topos for the theory of J-indexed families of T-models, and we have seen in the 
previous section that this is the exponential €5/! in BTop/S. But what if we 
wish to classify families of T-models indexed by a ‘variable’ object? 

By a bag (or indexed family) of T-models in an S-topos F, we mean a T-model 
in F/B for some B; if M and M’ are bags of T-models, living in the slice cat- 
egories F/B and F/B’, then a morphism of bags of T-models from (B, M) 
to (B’, M’) consists of a morphism f: B — B’ in F together with a morphism 
M — f*M’ in T(F/B). It is easy to see that such morphisms can be composed, 
so that we have a category T-bag( F) of bags of T-models in F; also, a geomet- 
tic morphism g: G — F over S induces a functor g*: T-bag(F¥) — T-bag(G) in 
the obvious way, and geometric transformations induce natural transformations 
between such functors. Thus T-bag is itself a theory over S. 

In passing, we note 


Lemma 4.4.15 For any geometric theory T over S, and any bounded S-topos 
F, T-bag(F) is equivalent to the category of pre-geometric morphisms F — S{T] 
(in the sense of A4.1.13) over S, and pre-geometric transformations between 
them (that is, natural transformations between their inverse image functors). 


Proof This is immediate from the canonical factorization 


a 


F —— > F/f*(1) EA S|T] 


of an arbitrary pre-geometric morphism f, constructed in A4.1.13. (Note, how- 
ever, that the statement ‘f is a pre-geometric morphism over S’ simply means 
that its direct and inverse image functors form an S-indexed adjoint pair; it does 
not imply that the diagram 


F EE ae SİT] 


commutes.) O 


Proposition 4.4.16 Let S be a topos with a natural number object. For any 
geometric theory T over S, the theory T-bag is also geometric. 


Proof Given a classifying topos € = S[T] for T, we claim that the covariant 
partial product P,(E, f) has the universal property of a classifying topos for 
T-bag, where f is the geometric morphism S[O]/G — S[O] and G is the generic 
object in S/O]. But this is immediate from the definition of a covariant partial 


450 BIOP/S AS A 2-CATEGORY B44 


product; for a geometric morphism F — P.(E€, f) corresponds by definition 
a a pair (u,v), where u is a morphism F — S[O] (equivalently, an object 

= u* (G) of F), and v is a morphism from the pullback F x stog S{0]/G ~ F/B 
: E (equivalently, a T-model in F/B). A similarly straightforward argument 
shows that a geometric transformation between two morphisms F = P,(E, f) 
corresponds to a morphism of bags of T-models, as defined above. o 


We write BLE for the topos Pe (E, f) introduced in the proof of 4.4.16, and call 
it the lower (or covariant) bagdomain of €. Clearly, the construction E> BrE 
is functorial on BYop/S. 


Example 4.4.17 If € is S itself, then B,€ is simply the object classifier S[O}. 
More generally, suppose € is [C,S] for some internal category C in S; then 
we claim that BLE may be identified with [Famy(C),S] where Fam;(C) is the 
free finite-coproduct completion of C, introduced in 1.4.17. (Recall that Sy = 
Famy(1), so this is indeed a generalization of the case E = S.) To prove this 
claim, note that a bag of C-torsors in F (that is, a C-torsor in some slice category 
of F) may by 2.6.6 be identified with a discrete fibration F — C in F whose 
domain is a weakly filtered category. We shall show that, if this holds, then the 
induced functor Fam,(F) — Fam;(C) is a Fam,(C)-torsor. 

For this, we argue in terms of elements, as if F were the topos of sets. The 
objects of Fam, (F) are finite indexed families g: [p] — Fo of objects of F; clearly, 
there exists such a family, namely the unique morphism [o] ¥ 0 — Fo, and given 
two families gı and g2, the induced map 


miserit es (90) Fy 


is an object of Fam,(F) to which each of them can be mapped. Given two 
objects gi and g2 of Fam,(F), a parallel pair of morphisms a, 3: gi 3 g2 yields 
in particular a parallel pair of morphisms [pı] = [pe] between the indexing 
cardinals; if we write [p2] —» [p3] for the coequalizer of the latter (cf. D5.2.5). 
then for each element of [p3| we have a connected finite diagram in F consisting 
of the objects gı(i) and g2(j) for which i and j map to the given element o 
[p3]. If we choose a cone under this diagram for each k : [p3], we obtain 
morphism g3: [p3] — Fo (picking out the vertices of these cones) for which we 
have a morphism 7: g2 — g3 in Famy(F) coequalizing a and 8. So Fam,(F) i 
filtered. 

Conversely, if we are given a Fam,;(C)-torsor G, then the pullback of G 
along the inclusion C — Fam,(C) (obtained by regarding objects as [so]-indexe 
families) is easily seen to be a weakly filtered category F; and we have an isomor. 
phism G © Fam,(F), since the functor corresponding to G must send coproduct: 
in Fams(C) to products. Thus we have established an equivalence between thi 
category of Fam,(C)-torsors and the category of ‘pre-torsors’ (that is, discret 
fibrations with weakly filtered domain) over C. 
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Generalizing Example 4.4.17, it is possible to proceed from a site of definition 
(C, J) for an arbitrary bounded S-topos €, to one for BLE whose underlying 
category is (Famy(C°P))°P. We omit the details, which may be found in [535]. 


Remark 4.4.18 Lemma 4.4.15 may now be reformulated as saying that BLE 
is the ‘free coproduct completion’ of € in BTop/S, in the sense that, for 
any S-topos F, the category BYop/S(F, BE) of ‘F-valued points of BLE’ 
is the free coproduct completion of BYop/S(F,E) as an F-indexed category 
(cf. 1.4.16). 


In view of this observation, it will come as no surprise to discover that 
the functor Bz has the structure of a KZ-monad on BTop/S. The unit map 
ne: E — BLE is the morphism which arises from regarding a T-model as a bag 
of T-models indexed by a singleton: explicitly, it corresponds to the pair of geo- 
metric morphisms (u, 1¢) where u: € — S/O] classifies the terminal object of £. 
(Equivalently again, in terms of 4.4.15, it corresponds to the identity geometric 
morphism € — E regarded as a pre-geometric morphism.) The multiplication 
similarly corresponds to the operation which sends a bag of bags of T-models 
((M; | i € Ij) | j € J) to the single bag (M; | i € [jey Jj); equivalently, it 
corresponds to the pre-geometric morphism B,;(B,€) — E displayed as the top 
edge and vertical composite in the diagram 


Br(Bré)/C -_—_ Bré/B, — E 


S[O]/G <— Bz(Bz£)/B, ——> BrE 


SO) <2 — Bz (Bz) 


where (u1, v1) and (u2, v2) denote the structure morphisms associated with the 
partial products BŁ and B,(Br€) respectively, B; = uf(G) and (C — B2) = 
v3(B,). The verification that this is indeed a KZ-monad structure on By is 


straightforward. 
We may characterize the algebras for this monad in several different ways: 


Theorem 4.4.19 Let S be a topos with a natural number object, and p: E — S 
a bounded S-topos. The following are equivalent: 
(i) E admits an algebra structure for the bagdomain monad By. 
(ii) ne: E > BLE has a left adjoint in BLop/S. 
(iii) ne: E + BLE admits a splitting in BTop/S. 
(iv) E is a retract in BIop/S of a topos of the form BLF. 
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(v) For each object I of S, the ‘diagonal’ morphism E — ES (that is, the 
exponential transpose of the projection E xs S/I =~ E/p*I — E) has a left 
adjoint in BLop/S. 

(vi) The morphism p and the diagonal Ap: E — E xg E both have left adjoints 
in BTop/S. 


(vii) For each bounded S-topos q: F +S, the category BYop/S(F,E) has 
finite coproducts, which are natural in F. 

(viii) For each bounded S-topos q: F — S, the inclusion of BTop/S(F,E) in 
the category reGeom/S (F,E) of pre-geometric morphisms over S from 
F to E (cf. 4.4.15) has a left adjoint, and these adjoints are natural in F. 


Proof Suppose € is the classifying topos for a geometric theory T over S. 
It is easy to see that n¢ is always an inclusion, since (in terms of classifying 
toposes) it corresponds to forcing a bag of T-models to satisfy the geometric 
axiom (in the sense of 4.2.7(6)) which says that the object indexing the bag is 
a singleton. Hence, if it has a left adjoint, the latter is necessarily a one-sided 
inverse for it; thus the equivalence of (i), (ii) and (iii) follows from the general 
theory of KZ-monads (1.1.13 and 1.1.14), plus the fact that BTop/S is locally 
Cauchy-complete (cf. A4.1.15). (iii) > (iv) is immediate, and (iv) => (ii) follows 
from 1.1.10. 

(ii) > (v): Clearly, the exponential €5/! classifies the theory of I-indexed 
families of T-models: thus we have a canonical geometric morphism E€5/! > Br €, 
which can be described in terms of classifying toposes as the effect of making the 
simple S-specialization (4.2.8(d)) of T-bag which forces the object indexing the 
generic bag of T-models to be isomorphic to the constant geometric construct I. 
It is easy to see (by considering the effect on families of T-models) that if we 
compose this morphism with the left adjoint BLE — E of ne, we obtain a left 
adjoint for the diagonal. 

(v) => (vi) is immediate, since (vi) is equivalent to the two special cases I = 0 
and J = 1 H 1 of (v). And (vi) => (vii) is also easy, since (for example) we 
may form the coproduct in BYop/S(F,€) of a pair of morphisms F = £ by 
regarding them as a single morphism F — E xs € and then composing with the 
left adjoint of the diagonal. 

For (vii) = (viii), we note that (viii) is essentially the statement that 
BTop/S(F,E) has ‘F-indexed coproducts which are natural in F’, where 
we make it into an F-indexed category by defining its B-indexed families of 
objects (for an object B of F) to be geometric morphisms F/B — E over S 
(equivalently, by identifying it with the category BYop/F (F,F xs E£), and 
indexing the latter over F in the manner described in Section B3.4). But by 
3.4.8 this category always has colimits indexed by filtered internal categories 
in F; so the result we need is just an F-indexed version of the fact that arbi- 
trary coproducts may be constructed from finite coproducts and filtered colimits 
(cf. D5.2.14). 
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Finally, (viii) implies (ii) by applying the left adjoint of (viii) to the universal 
pre-geometric morphism from By, € to E. o 


An S-topos satisfying the equivalent conditions of 4.4.19 is called hyperlocal. 
(The reason for this name will become clear in Section C3.6, when we consider 
the class of local morphisms - they are the morphisms p: E — S for which p has 
a left adjoint in Top/S, although A, may fail to have a left adjoint. We shall see 
in C3.6.5 that bounded local S-toposes can also be characterized as the algebras 
for a KZ-monad on BTop/S, whose functor part is a partial product functor.) 


Examples 4.4.20 (a) If C is an internal category in S with finite coproducts, 
then [C,S] is a hyperlocal S-topos: in view of 3.2.13, the binary coproduct 
functor C x C — C and the initial object of C induce the left adjoints required 
for 4.4.19(vi). Conversely if C is Cauchy-complete and fC, S] is hyperlocal, then 
since the left adjoints of 4.4.19(vi) are automatically S-essential it follows from 
the S-indexed version of A4.1.5 that they must be induced by internal functors 
C x C — C and 1 — C, and hence that C has finite coproducts. 

(b) To give examples of spatial hyperlocal toposes, we need to borrow some 
ideas from Chapter C1. Let X be a sober space whose points form a join- 
semilattice in the specialization ordering (defined by x < y iff x belongs to the 
closure of {y}, cf. the remarks before C1.4.5), and such that the binary join map 
X x X — X is continuous. (Here X x X needs to be interpreted as the product 
in the category Loc of locales rather than in Sp; but the two coincide if X is 
locally compact — cf. C4.1.8.) Then it follows easily from C1.4.5 and C1.4.8 that 
the geometric morphisms Sh(X x X) — Sh(X) and Set — Sh(X) induced (as 
in A4.1.11) by the join map and the least element of X provide the left adjoints 
required for 4.4.19(vi); so Sh(X) is a hyperlocal Set-topos. In particular, this 
holds if X is a continuous lattice equipped with its Scott topology (cf. C4.1.6). 


The use of the name ‘lower bagdomain’ will naturally have aroused in the 
reader an expectation that we will also describe an ‘upper’ or contravariant bag- 
domain functor, and prove corresponding results about it. Such a construction 
indeed exists, but its properties are less well explored; we shall merely sketch the 
construction here, and refer the interested reader to [537] for further information. 

The basic idea underlying the construction is that, just as an object of a topos 
S is canonically a filtered colimit of finite cardinals (an idea which we used in the 
proof of 4.4.19(vii) => (viii)), so the objects indexing ‘co-bags’ of T-models should 
be canonically expressible as cofiltered limits of finite cardinals. Consideration of 
the latter leads us naturally into the category of Stone spaces (or better, Stone 
locales) — cf. [520], VI 3.1. So a co-bag of T-models in S should be taken to 
be a sheaf of T-models on a Stone locale X in S, i.e. a T-model in Shs(X), or 
equivalently a T-model in Shs(B,C), where B is an internal Boolean algebra in 
S and C is its coherent coverage (cf. A2.1.11(b)). 
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Definition 4.4.21 (a) We call a geometric morphism f: F — E entire if there 
exists a Boolean algebra B in S such that F is equivalent (as an S-topos) to 
Shs(B,C). By 4.3.8, any entire morphism is coherent; so by 4.4.14(ii) the con- 
travariant partial product functor P*(—, f) exists in BTop/S, for any S such 
that £ is bounded over S. 

(b) Let T be a geometric theory over S, with classifying topos p: E — S. By 
a co-bag of T-models in a bounded S-topos F, we mean a span 


g f 


F «——— G Se 


in BTop/S, where g is entire; a morphism of co-bags of T-models from (f, g) to 
(f’, 9’) is a diagram of the form 


g’ f 
Gg’ 
in BTop/S, where the left-hand cell commutes up to isomorphism but the right- 
hand one contains a not-necessarily-invertible 2-cell. We write T-cobag(F) for 
the category of co-bags of T-models in F; since pullbacks of entire morphisms 
are entire by 4.3.9, it is easy to see that T-cobag is again a theory over S. 

(c) Let B denote the (algebraic) theory of Boolean algebras, and let Gg denote 
the generic Boolean algebra in the Boolean algebra classifier B = S[B] over S. 
We define the upper bagdomain ByE of a bounded S-topos E to be P*(E, F), 
where f is the canonical entire morphism Shg(Gg, C) — B. 


Theorem 4.4.22 


(i) If E is the classifying topos for a geometric theory T over S, then ByE 
classifies the theory T-cobag. In particular, the latter theory is geometric. 


(ii) The functor By carries the structure of a co-KZ-monad on BTop/S. 


Proof We shall only sketch the broad outline of the proof, leaving the reader 
to fill in the details. 

(i) It is clear from the definition of ByE as a partial product that geo- 
metric morphisms F — ByE correspond to co-bags of T-models in F. To 
verify that geometric transformations between such morphisms correspond 
to morphisms of co-bags, we need to show that the contravariant functor 
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B(F) — SZop/F which sends an internal Boolean algebra B in F to 
Sh;(B,C) is full and faithful (in the 2-categorical sense), i.e. that every 
morphism over F between entire F-toposes is canonically isomorphic to 
one induced by a homomorphism between the corresponding Boolean alge- 
bras. This may be deduced by combining C1.4.5 with the fact that B 
is recoverable from Sh(B,C’) as the lattice of complemented subterminal 
objects. 

(ii) Given (i), this may be proved by means essentially similar to those by 
which we constructed the KZ-monad structure on Bz. The unit E + ByE comes 
from regarding a geometric morphism f: F — E as a co-bag (f,g) for which g 
is an identity morphism (it is clear that identity morphisms are entire, since if 
2 = 1111 denotes the initial Boolean algebra in F then Shz(2, C) = F). To obtain 
the multiplication, we similarly need to know that a composite of two entire 
morphisms is entire; this can be proved directly using the techniques of Section 
C2.5, but it also follows from the existence of the pure—entire factorization which 
we shall establish in C3.4.13. o 


We shall not attempt to give a general characterization of the algebras for 
the monad By, similar to 4.4.19, but we note one particular example of a free 
algebra: 


Example 4.4.23 Let R be a commutative ring (in a topos S, though for sim- 
plicity we shall think of S as Set). It is well known that the idempotents of R 
(that is, the elements e satisfying e? = e) form a Boolean algebra E(R) in the 
ordering defined by e < f iff ef =e. The Pierce representation of R expresses R 
as the ring of global sections of a sheaf of rings R on the Stone space of E(R); 
explicitly, R corresponds to the sheaf for the coherent coverage on E(R) whose 
sections over an idempotent e are the elements of the quotient ring R/(1 — e). 
We recall that a commutative ring is said to be indecomposable if it has exactly 
two idempotents 0 and 1, i.e. if it satisfies the coherent axioms ((0 = 1) F L) 
and 


((e? =e) Fe ((e=0) V(e=1))). 


It is easy to verify that the Pierce sheaf R, considered as a commutative ring in 
the topos Sh({E(R),C), satisfies these axioms (see [520, V 2.3]). Moreover, the 
universal property of the Pierce sheaf, described in [520, V 2.5], is precisely the 
statement that the specification of a commutative ring (in an arbitrary topos F) 
is equivalent to the specification of a co-bag of idecomposable rings in F. Thus 
we see that, if € is the classifying topos for the theory of indecomposable rings, 
ByE is (equivalent to) the classifying topos for arbitrary commutative rings. 


There are numerous other examples of algebraic structures which possess 
a ‘Boolean representation theory’ (cf. [290]) saying that arbitrary algebras are 
equivalent to co-bags of ‘simple’ algebras. In all such cases, we can argue as 
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in 4.4.23 to show that the classifying topos for the appropriate theory has the 
structure of an upper bagdomain. 


Suggestions for further reading: Diers [290], Jibladze [499], Johnstone [535, 
536, 537], -Rosebrugh & Wood [1037, 1038, 1039, 1040], Street [1132], 
Vickers [1206]. 


B4.5 The symmetric monad 


In this section we consider a close relative of the lower bagdomain construction 
of the previous section, introduced in [193] and extensively studied in [197] and 
[198], which throws light on the relationship between bounded S-toposes and 
cocomplete S-indexed categories. From 3.1.2, we know that every S-topos is 
cocomplete and locally small as an S-indexed category, and every inverse image 
functor over S is cocontinuous as an S-indexed functor. We thus have a forget- 
ful functor (Top/S)°? — Cocomps, where Cocomps is the meta-2-category of 
cocomplete locally small S-indexed categories, cocontinuous S-indexed functors 
and indexed natural transformations between them. 

The codomain of this functor is evidently much too large for us to hope 
that it might have any sort of adjoint: if we wish for the latter, we must 
impose further ‘size restrictions’ besides local smallness. However, it turns 
out that there is a very natural full sub-2-category of Cocomps, which 
includes (the underlying indexed categories of) all bounded S-toposes, and 
over which (BTop/S)° may be regarded as the category of algebras for a 
KZ-monad. This subcategory is nothing other than the S-indexed analogue 
of the 2-category of locally presentable categories in the sense of Gabriel 
and Ulmer [386], and we shall denote it by LocPres;. Locally presentable 
categories over Set are studied, in the more general context of accessible cat- 
egories, in Section D2.3, and we shall see there that every Grothendieck topos 
is locally presentable as a category (D2.3.7). However, rather than translat- 
ing the general definition of local presentability into S-indexed terms, and 
proving the.S-indexed version of the result just cited, we shall here adopt a 
more ad hoc definition of what it means for a cocomplete S-indexed category 
to be locally presentable. We shall make some remarks about the equiva- 
lence of this definition with the more usual one in 4.5.3 below; however, for 
the moment, readers who already know what locally presentable categories 
are would be well advised to forget that knowledge, and simply accept the 
fact that we have chosen LocPtess as our (somewhat idiosyncratic) name 
for a class of ‘well-behaved’ cocomplete S-indexed categories that we wish to 
study. 

It will also be convenient to introduce a new name for the 2-category 
(BTop/S)°. Just as, in Chapter C1, we shall find it convenient to introduce 
different terminology and notation for the generalized spaces (locales) which we 


B4.5 THE SYMMETRIC MONAD 457 


study in that chapter and the algebraic entities (frames) which are extension- 
ally the same thing, so it is sometimes convenient to distinguish terminologically 
between a bounded S-topos, regarded as a ‘generalized space over S’, and the 
S-indexed category which is how we represent it extensionally, but which is 
really an algebraic gadget rather than a geometric one. Following a sugges- 
tion of S. J. Vickers [1210], we shall use the name Giraud frame (over S) 
for an S-indexed category E which satisfies the conditions of 3.3.11 (that is, 
derives from a bounded S-topos p: E — S), and write GFrms for the 2-category 
whose objects are Giraud frames over S, whose 1-cells are cocontinuous carte- 
sian S-indexed functors (that is, inverse image functors), and whose 2-cells are 
S-indexed natural transformations between such functors. (Thus OFtmsg is, up 
to equivalence, just another name for (BYop/S)°P.) 

Among the objects of LocPtes5, we certainly wish to have all the diagram 
categories [C,S], where C is an internal category in S. We recall from 2.5.8 that 
[C, S] is the free (S-indexed) cocompletion of C°P, so that cocontinuous functors 
[C,S] — [D,S] correspond to arbitrary S-indexed functors CP — [D,S] - or 
equivalently, to profunctors C % D in S, as defined in 2.7.1. That is, we may 
regard the bicategory Profs of internal categories and profunctors as (equivalent 
to) a full sub-2-category of our desired category. We shall denote this subcategory 
by Dg™ms, and call it the 2-category of diagram categories over S. 

But of course not every Giraud frame over S is of the form [C,S]: we need 
to add more objects to our 2-category. The surprising thing is that all we need 
to add are coinverters of 2-cells (cf. 1.1.4(e)): the key result that proves this is 
little more than a restatement of A4.3.11 (and also a close relative of 4.1.6). 


Lemma 4.5.1 Let E be a bounded S-topos, and j a local operator on E. 
Then there exists an internal category D in S and a 2-cell a: f — g between 
morphisms f,g:[D, S] 3 E in Cocompys, such that the associated sheaf functor 
i*: E — shj(E) has the universal property of a coinverter for a in Cocompgs. 


Proof We recall that an object B of € is a j-sheaf iff, for every j-dense 
monomorphism A’ — A in E£, each morphism A’ — B extends uniquely to a 
morphism A — B. But it clearly suffices to consider only those dense monomor- 
phisms A’ — A for which A lies in some given separating family for E; and 
the collection of such monomorphisms may be indexed by an object J of S. We 
take D to be the discrete category indexed by J; then we have a pair of func- 
tors f,g: D 3 E and a natural transformation & between them which simply 
picks out this family of morphisms of E. Since E is cocomplete as an S-indexed 
category, & extends canonically to a natural transformation a: f — g between 
cocontinuous functors (D, S] 3 E, by 2.5.8. 

Since i* inverts all 7-dense monomorphisms, it is clear that 7* oa is an 
isomorphism. Now suppose h: E — X is any morphism of Cocomps such that 
hoa is an isomorphism. Since E is well-copowered and has a separating family, 
it follows from 2.4.6 that k has a right adjoint h,; and the proof of A4.3.11 then 
shows that A,(X) is a j-sheaf for every object X of X, i.e. h, factors uniquely 
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through the right adjoint i, of i*. (In A4.3.11 we were dealing with adjunctions of 
which the left adjoint preserved finite limits, but that condition was not used in 
this part of the proof.) If we denote this factorization by k,, it then follows easily 
from the fact that i, is full and faithful that the composite hoi, is left adjoint to 
k,, and hence that it is the unique factorization (up to canonical isomorphism) 
of h through i*. C 


Definition 4.5.2 We define the 2-category CocPres, of locally presentable 
S-indered categories to have as objects all those cocomplete locally small 
S-indexed categories X which can be presented as the codomain of a coinverter 


f 
[D,S} |a_ [C,S] ——~>X 
— 


g 


in Cocomps, together with all S-cocontinuous functors between them and all 
S-indexed natural transformations between the latter. 


Since every bounded S-topos can be presented as a subtopos of one of 
the form [C, S], it follows immediately from 4.5.1 that the forgetful functor 
OFfrms — Cocomp., factors through CocPresc. 

We recall that in 4.1.7 we showed that arbitrary inverters exist in BYop/S; 
equivalently, 6§tms has coinverters. And a comparison of that proof with 4.5.1 
above will show that the forgetful functor 6Frms — LocPress preserves coin- 
verters. Thus, if the functors f and g appearing in the displayed diagram in 4.5.2 
are actually inverse image functors (i.e. if they are cartesian), then 7 is actually 
a bounded S-topos. 


Remark 4.5.3 Classically, a locally presentable category (over Set) is one 
which is representable as the category of «-continuous Set-valued functors on 
a K-complete small category C, where « is a cardinal and ‘x-complete’ (resp. 
‘x-continuous’) means ‘having (resp. preserving) limits of diagrams of cardinality 
less than x> (cf. D2.3.4). By a similar argument to that in the proof of 4.5.1, we 
may represent it as an inverter (in the 2-category Compg,, of complete locally 
small categories and continuous functors) of a 2-cell between functors [C, Set] = 
[D, Set] (where D may in fact be taken to be discrete); but since any continuous 
functor between locally presentable categories has a left adjoint, by the Special 
Adjoint Functor Theorem, we may take left adjoints of the functors concerned 
to obtain a representation of our locally presentable category as a coinverter in 
Cocomp,s,,. On the other hand, a theorem of Makkai and Paré [787] asserts 
that the 2-category of locally presentable categories is closed under weighted 
limits (in particular, under inverters) in Compg,,; so we may identify the locally 
presentable categories of 4.5.2 (in the case S = Set) with those defined in the 
traditional way. 
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We now embark on our study of the relationship between ©§ftmgs and 
LocPress. From this point on, we shall need to assume that our base topos 
S has a natural number object. 


Proposition 4.5.4 Let S be a topos with a natural number object. Then the 
forgetful functor O§tms — LocPress has a left adjoint B. 


Given a locally presentable S-indexed category C, we call EC the symmetric 
Giraud frame (or symmetric topos) generated by E. The reason for the name 
is an analogy with classical commutative algebra, obtained by thinking of finite 
limits as ‘multiplication’ and S-indexed colimits as ‘addition’ and ‘multiplication 
by scalars in S’: in this analogy, cartesian categories are ‘commutative monoids’, 
cocomplete S-indexed categories are ‘modules’, and Giraud frames are ‘algebras’. 
Thus the free functor © is the analogue of the symmetric algebra construction. 
(The analogy becomes rather more explicit when we compare (ordinary) frames 
with complete join-semilattices, as we shall see in Section C1.1.) 


Proof First let C be an internal category in S. We know from 2.5.8 that [C, S] 
is the free S-indexed cocompletion of C°P: that is, for any cocomplete S-indexed 
category E, arbitrary S-indexed functors CP — E correspond to cocontinuous 
indexed functors [C, S} —> E. But we also know from 3.2.6 that we may construct 
the free cocartesian category CoCart(C) on C as an internal category in S; so 
if E is a Giraud frame then arbitrary functors C°P -> E correspond to cartesian 
functors CoCart(C)°? — E, which in turn correspond by 3.2.7 to inverse image 
functors [CoCart(C), S] -> E. Thus we have shown that fC, S] may be taken to 
be the Giraud frame [CoCart(C), S]. Moreover, it is clear that this definition of © 
extends to a 2-functor on Dgm: given a cocontinuous functor f : [C, S] — [D, S], 
we may regard it as a functor CP — [D,S], compose it with the unit map ÍD, S] > 
EID, S] (which is simply left Kan extension along the inclusion D —+ CoCart(D)), 
and then extend it to a cartesian functor CoCart(C)°? — EID, S], which in turn 
defines a morphism of Giraud frames EC, S} -> EID, S]. The argument for 2-cells 
is similar. 

Defining © in the general case is now easy, since, as a left adjoint, it must 
preserve Coinverters. Given a coinverter presentation 


f 
D,S] Ja’ [c,s} Ż—> x 
g 


of an object X of LocRress, we form the corresponding 2-cell 


poe eee 
[CoCart(D),S}]  a* — [CoCart(C),§| 


x 


g 
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in Ġğrms, and take its coinverter in this 2-category (equivalently, the inverter 


in BLop/S, cf. 4.1.7) to define UX. o 
Now consider the diagram 
í h 
AA R 
[D, S] la C, S ————> X 
—— > 
g 
ND,sS] NICS] nx 
=n h* 
[CoCart(D),S] | af (CoCart(C),S] ——> =X 
> 
J+ 


associated with a coinverter presentation of an object X of LocRresSs, where 
the vertical arrows are the unit maps. As we remarked in the proof of Propo- 
sition 4.5.4, mc,s) is the left Kan extension functor along the full embedding 
i: C — CoCart(C); so its right adjoint 8c (which exists as an indexed functor, 
simply because 7 is cocontinuous) is itself a morphism of LocPres,; also, the 
unit of the adjunction (7 4 8) is an isomorphism. More generally, we have 


Theorem 4.5.5 Let S be a topos with a natural number object, and X an object 
of LocPress. Then X is a Giraud frame iff ng: X — EX has a right adjoint in 
LocPres,, such that the unit of the adjunction is an isomorphism. 


Proof First suppose X is a Giraud frame. Then, as we saw in 4.5.1, we may 
choose its coinverter presentation so that h: [C,S] — X is actually a morphism 
of 6§rms. But Oc: [CoCart(C), S] > [C, S] is also a morphism of GFrms, since 
it has adjoints on both sides; and hc coinverts the 2-cell a*, since 


hc o a* o nps) = hocnics}°® = hoa 


which is an isomorphism, so we get a factorization @x: EX — X of h@c 
through h*. Similar arguments show that the composite 6x7x is isomorphic to the 
identity, and that the counit 2-cell from 7c,5)9c to the identity factors through 
h* to yield a 2-cell which is the counit of an adjunction (nx 7 6x), the unit being 
the isomorphism already mentioned. 

Conversely, suppose 7x has a right adjoint 6 in LocPress such that the unit 
of the adjunction is an isomorphism. Since Nx is a Giraud frame, we can use the 
Special Adjoint Functor Theorem (2.4.6) to obtain a further right adjoint 6, for 
0 (though the latter will not, of course, be a morphism of LocPress in general): 
and from the fact that the unit of (nx 4 0) is an isomorphism, it follows that 
the counit of (@ 4 @,) is an isomorphism. Also, @ is cartesian since it has a left 
adjoint; and so we have identified Y (the underlying ordinary category of X 
with a reflective subcategory of EX for which the reflector is cartesian. Hence 
by A4.3.9 it is a topos; so X is a Giraud frame. C 
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the smallest local operator on ME whose associated sheaf functor maps 6:(1¢) > 
lme to an isomorphism (cf. A4.5.14(c)). Since the unit map le — 6*6:(1¢) is an 
isomorphism, it is clear that ô itself factors through PE — ME. More generally, a 
geometric morphism f: F — ME over S factors through PE iff the cocontinuous 
S-indexed functor E — F to which it corresponds (which is, of course, just the 
canonical S-indexing of the composite f*6,) preserves 1, i.e. it is an F-valued 
probability measure on £. 

If € = {C,S] for an internal category C, then PE may be identified with 
[Conn ;(C),S], where Conns(C) denotes the free completion of C with respect 
to connected finite colimits. To see this, we do not need to give an explicit con- 
struction of Conns(C); but we note that, as usual, it may be identified with the 
full subcategory of [C°P, S] whose objects are connected finite colimits of repre- 
sentables, from which it is easy to see that the full embedding C — Conn,;(C) 
is an initial functor. Now a cocontinuous S-indexed functor [C,S] — F preserves 
1 (that is, it is a probability measure on [C, S]) iff the diagram of shape C°? in 
F which is its restriction to representables (cf. 2.5.8) has colimit 1; so by 2.5.12 
we deduce that the same is true of its unique connected-finite-limit-preserving 
extension to a functor Conns(C)°? — F. But the preservation of finite con- 
nected limits by the latter easily implies, as in 3.2.5, that the total category 
of the corresponding discrete opfibration is weakly filtered; hence by 2.6.5 it is 
filtered. So it is a Conn s(C)-torsor, and hence it induces a geometric morphism 
F — {Conn;(C),S]. The argument in the opposite direction is similar. It follows 
that, for any E£, we can construct a presentation of PE as an inverter in BYop/S 
from such a presentation of E, in the same way as we did for ME but replacing 
the functor CoCart(—) by Conn;(—). 

We next claim that, for any E, we have By (PE) ~ ME. Again, this could 
be proved by first considering toposes of the form |C, S], and noting that we 
have Fams(Conns(C)) ~ CoCart(C), since any finite colimit can be uniquely 
decomposed as a finite coproduct of connected colimits. But we can derive the 
same result more simply, by noting that any cocontinuous S-indexed functor 
f:E-—F can be factored as 


E ——> F/f(1) ——> F 


(just as in A1.2.9, but without the hypothesis that f preserves pullbacks) where 
the first factor is cocontinuous and preserves the terminal object. Conversely, 
given an object B of F and a cocontinuous 1-preserving functor E — F/B, we 
may compose with the forgetful functor to obtain a cocontinuous functor E > F. 
So F-valued measures on E are essentially the same thing as bags of F-valued 
probability measures on E; hence the toposes which represent the two notions 
must be equivalent. 

On the other hand, we do not in general have P(Bi€) ~ ME, since we 
do not have Conns(Fam,(C)) ~ CoCart(C); although any finite colimit can be 
written as a connected colimit of finite coproducts (for example, as a coequalizer 
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Corollary 4.5.6 The monad induced by the adjunction between the forgetful 
functor O§rms — LocPres, and its left adjoint = is a co-KZ-monad, and the 
adjunction is monadic. 


Proof The construction of the right adjoint to nx, for a Giraud frame X, 
in fact shows that it is the component of the counit of the adjunction at the 
object X; in particular, if X is itself of the form XY, then it is the component 
of the multiplication of the monad at Y. Hence the multiplication is (naturally) 
right adjoint to the unit of the monad. Also, we have shown in 4.5.5 that the 
objects of LocPres. which admit algebra structures for the monad are exactly 
the Giraud frames (cf. 1.1.13). So, to complete the proof, we have only to show 
that if E and F are Giraud frames then a cocontinuous functor f:E — F is a 
morphism of =-algebras if it is cartesian. But if f is a morphism of U-algebras, 
then the composite fg = Op f* is cartesian, whence f is cartesian since Og is a 
coreflector. Conversely, if f is cartesian, then it is a morphism of 6 Frtmg, and so 
induces a morphism of -algebras (namely, itself!) via the comparison functor. O 


Next, we note that the construction of SX may be factored into two stages. 
Let us, for want of a better name, write & for the full sub-2-category of LocPress 
whose objects are the Giraud frames; thus we have inclusions 


GFrms >—> K >——> LocPres, 


of which the first is bijective on objects and the second is full and faithful. 


Proposition 4.5.7 The inclusion R — LocPress has a left adjoint T. 
Moreover, for every object X of LocPress5, we have UX ~ UTX. 


Proof Given X, we may present it as usual as a coinverter 


D,S] la’ (C,s) “> x 
—— > 


g 


and consider the pushout 
h* 
[CoCart(C), S| ——> Ex 


8e t 


+ 
cs] —-s TX 


in GFrms, i.e. the pullback in BTop/S. From the construction of pullbacks of 
inclusions in 3.3.6, it is easily verified (by an argument like that in the proof of 
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4.5.1) that the above square is also a pushout in LocPress, and that At is the 
coinverter in G§rms (or in LocPres5) of the 2-cell 


+ 
Ig 
[CoCart(D),S} |a* [C,S] 
gt 


induced by a in the obvious way. Hence in particular there is a unique (up to 
isomorphism) cocontinuous functor u: TX — UX satisfying uh* = A*nc.g (u is 
in fact the left adjoint of the morphism ¢ in the pushout diagram, though we 
don’t need this). 

Since at o mp 5) = a, it is clear that we get a canonical morphism X + TX 
in LocPres,. Also, if k: X — E is any cocontinuous functor from X to a Giraud 
frame, then it extends uniquely to a Giraud frame morphism k*: UX — E, and 
now we have 


k* uth” S kuh” be = Kh’ nie sfc S k*nxhOc = khOc 


so that the pair (k*ut, kh) induces a unique cocontinuous functor k+: TX — E 
(which must in fact be the composite k*u). Thus we have shown that TX is 
the required left adjoint. Also, since for any Giraud frame E the categories of 
cocontinuous functors X — E and TX — E are equivalent, we see that DX and 
ETX have the same universal property in 6Frms, and so must be equivalent. 
o 


Thus, if we wish to characterize the ‘symmetric Giraud frames’ (that is, 
those which occur as EX for some X), it suffices to consider those of the form 
SE where E is already a Giraud frame. In order to do this, we shall investigate 
the comonad on ĠFrms induced by E and its right adjoint; but at this point 
it will be convenient to reverse our perspective to the more familiar ‘geometric 
one, and regard the structure in question as a monad on BYop/S. Rotating 
the symbol © through 90° rather than 180° as might have been expected, we 
shall denote the bounded S-topos corresponding to the Giraud frame SE by 
ME. (The reason for this notation is that, as suggested by F. W. Lawvere, it ic 
helpful to regard the points of ME — equivalently, the cocontinuous S-indexec 
functors E — S — as ‘measures’ or ‘distributions’ on E, those which derive fron 
points of £ - equivalently, which correspond to cartesian cocontinuous functors - 
being the ‘Dirac measures’ concentrated at a single point. We shall make thi 
idea more precise in 4.5.9 below.) 

The unit of the monad on BTop/S is the geometric morphism ôg: E + Mc 
whose inverse image is the X-algebra structure 0: DE — E; note that this mor 
phism is an S-essential inclusion in the sense of 3.1.1, since @ has an S-indexe 
left adjoint 72 which is full and faithful. The multiplication MME — ME i 
similarly the morphism whose inverse image is Ung: SE — USE; by 1.1.12, thi 
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Recall that we saw in 4.4.19 that By-algebras have a similar property with 
arbitrary S-indexed colimits replaced by coproducts; and indeed Bz€ is the free 
coproduct-completion of € in BTop/S. It is not true in general that ME is the 
free cocompletion of € in a similar ‘external’ sense. For a counterexample, take 
E to be a bounded Set-topos such that the geometric morphism y: € — Set 
is essential, but € has no points (we shall give an example of such a topos in 
D3.4.14). Then Yop (Set, E) is empty, but Top (Set, ME) contains (the measures 
corresponding to) all set-indexed copowers of 7, so it is not the free cocomple- 
tion of Top (Set, €). However, we shall see in 4.5.13 below that ME is the free 
A-cocompletion of € in BTop/S in the ‘intrinsic’ sense introduced in 1.1.16, for 
a suitable class A of geometric morphisms. 

We certainly have a comparison map B€ — ME for any €: for we have a 
geometric morphism BŁE/I — E over S, where I is the object of BE indexing 
the generic bag of points of €, and if we compose the inverse image of this with 
the forgetful functor B,€/I — BrE we obtain a cocontinuous S-indexed functor 
E — BrE, corresponding to a Giraud frame homomorphism DE — BE. 

In the case when £ is of the form |C, S], the comparison may be described 
more simply. In this case, we know that B,é is the topos [Fam; (C), S], where 
Fam ;(C) as usual denotes the free finite-coproduct-completion of C (cf. 4.4.17), 
and ME is [CoCart(C), S]. Since the full embedding C — CoCart(C) obviously 
factors through Fam ;(C), we obtain a corresponding factorization 


E —— > BrE ——> ME 


of the unit ĝe: € — ME, where both factors are essential inclusions. From the 
universal property of the lower bagdomain, it is easy to verify that the functor Br 
preserves inverters in BTop/S; hence, by taking a presentation of an arbitrary 
bounded S-topos E as an inverter of a 2-cell between diagram toposes (cf. 4.5.1), 
we may deduce the existence of the corresponding factorization of ĝe, for an 
arbitrary E. In particular, we deduce that the natural map BrE — ME is an 
inclusion for any £. In fact it is an essential inclusion: the left adjoint of its inverse 
image may be obtained by composing the left adjoint of the inverse image of the 
inclusion B€ — MBy,€ with the inverse image of M(E —> BLE). 

But there is another way in which we may factor ĝe. We noted in 4.4.15 that 
Bré€ is a representing object for pre-geometric morphisms with codomain £; 
that is, those morphisms which we get by forgetting the requirement that an 
inverse image functor must preserve the terminal object, but remembering that it 
must preserve pullbacks. Similarly, ME represents those S-indexed adjunctions 
with codomain € which we get by forgetting altogether the requirement that 
an inverse image functor should preserve finite limits. If we forget that it should 
preserve pullbacks, but remember that it should preserve the terminal object, we 
arrive at the notion of a probability measure on E, that is one for which ‘the total 
measure of £ is unity’. This too corresponds to a KZ-monad on SBTop/S; we may 
obtain its functor part as follows. Let PE be the subtopos of ME corresponding to 
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is right adjoint to 6px in OFrms, but because of the reversal of 1-cells it becomes 
left adjoint to the unit in BTop/S. Thus we have verified 


Lemma 4.5.8 The monad on BTop/S whose functor part is M is a 
KZ-monad. o 


Remark 4.5.9 As we noted in the proof of 3.2.9, the free cocartesian category 
CoCart(1) on the terminal object of Cat(S) is (equivalent to) Sp; so we see that 
MS ~ {S;,S] is the object classifier S[O] over S. In particular, S[O] carries 
an M-algebra structure. In terms of the identification of points of ME with 
‘measures on £’, the algebra structure map f: M(S{O]) — S[O] may be thought 
of as ‘integration’, in the following sense. We may think of an object A of € asa 
‘continuous S-valued function’ on the ‘generalized space’ € (this is an appealing 
way of viewing sheaves on traditional spaces), whose value at a point s: S — E 
is the object s*(A). Given a measure on E, that is a point m: S > ME, the 
‘integral’ of A with respect to the measure is the object of S classified by the 
composite 


M(A) Í 


s — > ME ——> M(S{0]) — sO, 


where A is the classifying map of A. It is straightforward to verify that this is 
indeed the value at A of the cocontinuous functor E — S corresponding to m; in 
particular, if m = d¢s is the Dirac measure corresponding to a point s of £, it is 
just s*(A). 


Example 4.5.10 Among examples of M-algebras, we have all toposes of the 
form [C, S] where C is cocartesian; for if C is cocartesian then the embedding 
C — CoCart(C) has a left adjoint, given by forming colimits in C, and hence 
[C, S] is an adjoint retract of [{CoCart(C), S] = M[C,S]. Hence also any retract 
in BTop/S of a topos of the form {C, S] with C cocartesian admits an M-algebra 
structure. (We shall study such toposes in greater detail in Section C4.3.) 


Proposition 4.5.11 Ifa bounded S-topos E has the structure of an M-algebra, 
then for any bounded S-topos F the S-indexed category BTop/S (F,E) (that is, 
the category whose I-indezed families are morphisms F/I — E/I over S/I, 
for each object I of S) is S-cocomplete. Moreover, for any geometric morphism 
f: F' — F the functor BTop/S (F, E) — BLop/S(F’,E) induced by composi- 
tion with f is cocontinuous, and similarly for the functor induced by composition 
with any M-algebra homomorphism g: € — E'. 


Proof If€ ~ MG isa free M-algebra, then BZop/S (F, E) is equivalent to the 
category LocPres, (G,F), which is clearly cocomplete since it it closed under 
(pointwise) colimits in the category of all indexed functors G — F. In general, 
any M-algebra € is a retract of a free M-algebra by 1.1.13, so the cocom- 
pleteness result follows from 1.1.10. The cocontinuity assertions are similarly 
straightforward. o 


464 BIOP/S AS A 2-CATEGORY B4.5 


Recall that we saw in 4.4.19 that By-algebras have a similar property with 
arbitrary S-indexed colimits replaced by coproducts; and indeed B;€ is the free 
coproduct-completion of E in BTop/S. It is not true in general that ME is the 
free cocompletion of € in a similar ‘external’ sense. For a counterexample, take 
E to be a bounded Set-topos such that the geometric morphism y: € + Set 
is essential, but € has no points (we shall give an example of such a topos in 
D3.4.14). Then Top (Set, £) is empty, but Top (Set, ME) contains (the measures 
corresponding to) all set-indexed copowers of 7;, so it is not the free cocomple- 
tion of Top (Set, E€). However, we shall see in 4.5.13 below that ME is the free 
A-cocompletion of € in BTop/S in the ‘intrinsic’ sense introduced in 1.1.16, for 
a suitable class A of geometric morphisms. 

We certainly have a comparison map BrE — ME for any £: for we have a 
geometric morphism B,€/I — E over S, where T is the object of Br € indexing 
the generic bag of points of €, and if we compose the inverse image of this with 
the forgetful functor BLE/I — BŁE we obtain a cocontinuous S-indexed functor 
E — BrE, corresponding to a Giraud frame homomorphism SE —> B;E. 

In the case when £ is of the form [C, S], the comparison may be described 
more simply. In this case, we know that BzE is the topos [Fam,(C),S], where 
Fam s(C) as usual denotes the free finite-coproduct-completion of C (cf. 4.4.17), 
and ME is [CoCart(C), S]. Since the full embedding C — CoCart(C) obviously 
factors through Fam,(C), we obtain a corresponding factorization 


E ———> BE ——> ME 


of the unit ĝe: E€ — ME, where both factors are essential inclusions. From the 
universal property of the lower bagdomain, it is easy to verify that the functor Br 
preserves inverters in BTop/S; hence, by taking a presentation of an arbitrary 
bounded S-topos € as an inverter of a 2-cell between diagram toposes (cf. 4.5.1), 
we may deduce the existence of the corresponding factorization of d¢, for an 
arbitrary €. In particular, we deduce that the natural map B€ — ME is an 
inclusion for any E. In fact it is an essential inclusion: the left adjoint of its inverse 
image may be obtained by composing the left adjoint of the inverse image of the 
inclusion ByE — MBE with the inverse image of M(€ > By, €). 

But there is another way in which we may factor ĝe. We noted in 4.4.15 that 
Br€ is a representing object for pre-geometric morphisms with codomain £; 
that is, those morphisms which we get by forgetting the requirement that an 
inverse image functor must preserve the terminal object, but remembering that it 
must preserve pullbacks. Similarly, ME represents those S-indexed adjunctions 
with codomain € which we get by forgetting altogether the requirement that 
an inverse image functor should preserve finite limits. If we forget that it should 
preserve pullbacks, but remember that it should preserve the terminal object, we 
arrive at the notion of a probability measure on £, that is one for which ‘the total 
measure of £ is unity’. This too corresponds to a KZ-monad on BZop/S; we may 
obtain its functor part as follows. Let PE be the subtopos of ME corresponding to 
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the smallest local operator on ME whose associated sheaf functor maps ô (le) > 
lye to an isomorphism (cf. A4.5.14(c)). Since the unit map lg — 6*61(1¢) is an 
isomorphism, it is clear that 6 itself factors through PE + ME. More generally, a 
geometric morphism f: F — ME over S factors through PE iff the cocontinuous 
S-indexed functor E — F to which it corresponds (which is, of course, just the 
canonical S-indexing of the composite f*d,) preserves 1, i.e. it is an F-valued 
probability measure on €. 

If € = [C,S] for an internal category C, then PE may be identified with 
[Conn s(C), S], where Conn;(C) denotes the free completion of C with respect 
to connected finite colimits. To see this, we do not need to give an explicit con- 
struction of Conny (C); but we note that, as usual, it may be identified with the 
full subcategory of [C°?,S] whose objects are connected finite colimits of repre- 
sentables, from which it is easy to see that the full embedding C — Conn;(C) 
is an initial functor. Now a cocontinuous S-indexed functor [C,S] -> F preserves 
1 (that is, it is a probability measure on [C,S]) iff the diagram of shape C® in 
F which is its restriction to representables (cf. 2.5.8) has colimit 1; so by 2.5.12 
we deduce that the same is true of its unique connected-finite-limit-preserving 
extension to a functor Conn;s(C)°? — F. But the preservation of finite con- 
nected limits by the latter easily implies, as in 3.2.5, that the total category 
of the corresponding discrete opfibration is weakly filtered; hence by 2.6.5 it is 
filtered. So it is a Conn;(C)-torsor, and hence it induces a geometric morphism 
F — [Conn; (C), S]. The argument in the opposite direction is similar. It follows 
that, for any E, we can construct a presentation of PE as an inverter in BTop/S 
from such a presentation of €, in the same way as we did for ME but replacing 
the functor CoCart(—) by Conny(—). 

We next claim that, for any E, we have By (PE) ~ ME. Again, this could 
be proved by first considering toposes of the form [C, S], and noting that we 
have Fam,(Conns(C)) ~ CoCart(C), since any finite colimit can be uniquely 
decomposed as a finite coproduct of connected colimits. But we can derive the 
same result more simply, by noting that any cocontinuous S-indexed functor 
f:E — F can be factored as 


E —> F/ f1) —>F 


(just as in A1.2.9, but without the hypothesis that f preserves pullbacks) where 
the first factor is cocontinuous and preserves the terminal object. Conversely, 
given an object B of F and a cocontinuous 1-preserving functor E — F/B, we 
may compose with the forgetful functor to obtain a cocontinuous functor E — F. 
So F-valued measures on E are essentially the same thing as bags of F-valued 
probability measures on E; hence the toposes which represent the two notions 
must be equivalent. 

On the other hand, we do not in general have P(BŁE) ~ ME, since we 
do not have Conns(Fam,(C)) ~ CoCart(C); although any finite colimit can be 
written as a connected colimit of finite coproducts (for example, as a coequalizer 
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(ili) = (i): Define 6: ME — E to be (dc)+(le). Then O5¢ & 1g, since ô is an 
S-essential inclusion; but idempotent 2-cells split in BTop/S, so by 1.1.14 and 
4.5.8 € has an M-algebra structure. (In fact, as is clear from the proof of (i) > 
(iii), @ itself is the algebra structure map, but we do not need to prove this.) 

Now if £ and €’ are M-algebras and hk: € — €’ is an algebra homomorphism, 
then it follows from the definition of ay in (i) = (ii) above that we have 


at (hf) =6' -af h ~ 6» Mh-af* ~hO-aft =h-a4(f) 
so h is cocontinuous. Conversely, if h is cocontinuous, then 
h8 Sh- ôq (1e) = ô4 (h) = 6 bh* = 0' - Mh, 


so it is an algebra homomorphism. 
The final assertion is simply a restatement of the fact that ME is the free 
M-algebra generated by £. o 


It is possible to give a direct proof of (i) = (iii) in 4.5.13, without making 
use of comma objects or Pitts’s theorem, simply by observing that the definition 
of a, in the proof of (i) = (ii) has the property that a*a, is isomorphic to the 
identity whenever a*a is isomorphic to the identity (as it must be if a is an 
inclusion). 

It follows from 1.1.18 that the M-algebras may also be characterized as the 
objects of BTop/S which are linearly B-complete, where B is the class of mor- 
phisms which occur as the top edges of comma squares whose bottom edges are 
S-essential. We do not have a simple characterization of this class; but we shall 
see in C3.3.18 that it may be replaced by the (larger) class of locally connected 
morphisms. 

We remark that the P-algebras in BYop/€, where P is the ‘probability 
measures’ monad, may be similarly characterized as those which are pointwise 
cocomplete with respect to the class of those S-essential morphisms a: F — G for 
which the left adjoint a; preserves 1. (Ifa is not merely S-essential but G-essential, 
the latter condition is equivalent to saying that a is connected, i.e. that a” is 
full and faithful, as we shall see in C3.3.3. However, in general it is a weaker 
condition than connectedness.) There is also a generally similar characterization 
of B,-algebras as those which are cocomplete with respect to the class of local 
homeomorphisms (that is, morphisms of the form G/A — G for some object A of 
G); however, in this case we have to repace the pointwise condition by the linear 
one, since the class of local homeomorphisms is not stable under ‘transfer across 
comma, squares’. 


Suggestions for further reading: Bunge [192], Bunge & Carboni [193], Bunge 
& Funk (197, 198]. 
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of morphisms between coproducts), there are in general many different such 
representations of a given colimit. 
Summarizing the above discussion, we have 


Theorem 4.5.12 For any bounded S-topos E€, we have a commutative square 
of essential inclusions 
E ———> BLE 


PE ———> ME 
whose diagonal is the unit map ĝe: E + ME, and which is natural in €. More- 
over, the functors By, and P both carry KZ-monad structures for which the 
natural inclusions into M are morphisms of monads; and the composite BLP is 
naturally equivalent to M. 0 


Finally in this section, we characterize the M-algebras in BYop/S in terms 
of the intrinsic cocompleteness notions introduced in 1.1.16. 


Theorem 4.5.13 Let S be a topos with a natural number object, and let A 
denote the class of S-essential morphisms in BYop/S. For a bounded S-topos €, 
the following are equivalent: 


(i) E admits an M -algebra structure. 
(ii) E is pointwise A-cocomplete. 
(iii) E is an A-cocomplete M-injective, where M is the class of inclusions in A. 


Moreover, a geometric morphism between two such toposes is A-cocontinuous 
iff it is an M-algebra homomorphism; and, for any E€, ME is the free point- 
wise A-cocompletion of E, in the sense that any geometric morphism f: E — F 
where F is pointwise A-cocomplete admits a unique (up to isomorphism) 
A-cocontinuous extension to a morphism ME => F. 


Proof (i) = (ii): Suppose (€, 6) is an M-algebra. Given a: F — G in A and an 
arbitrary geometric morphism f: F — €, the composite a; f* is a cocontinuous 
S-indexed functor E — G, and so corresponds to a geometric morphism G > ME 
over S, which we denote by a: ui fr. *, We define a4 (f) to be the composite 8-a nf. *. To 
verify the adjunction, suppose given g: G — E; then geometric transformations 
ai(f) — g correspond to geometric transformations af — deg, since @ is 
left adjoint to ĝe by 1.1.13; but these in turn correspond to S-indexed natural 
transformations aif* — g*, and hence to transformations f* — a*g*, i.e. to 
geometric transformations f > a*g. The Beck—Chevalley condition for comma 
squares is immediate from 4.1.8. 

(ii) => (iii) is immediate from 1.1.19, since we have already observed that 
inclusions are fully monic in Top (and in BVop/S). 


PART C 


TOPOSES AS SPACES 


Cl 


SHEAVES ON A LOCALE 


C1.1 Frames and nuclei 


In this chapter, our main aim is to describe the ‘geometric’ origins of topos 
theory, by setting up the topos of sheaves on a topological space, and indicating 
how topological properties of spaces and continuous maps may be translated 
into properties of toposes of sheaves and geometric morphisms between them. 
However, there is a significant ‘generalization’ of the notion of topological space, 
namely the notion of locale, which is of importance in the topos-theoretic context, 
and almost everything that can be said about toposes of sheaves on spaces carries 
over without extra effort to toposes of sheaves on locales. Since the notion of 
locale may not be familiar to the reader, we therefore begin with a couple of 
sections setting up the category of locales, and describing its relation to the 
category of spaces. 

If X is a topological space, we shall write O(X) for the lattice of open subsets 
of X; this lattice is complete (since arbitrary unions of open sets are open), and 
satisfies the infinite distributive law 


UAV VY = \V(U AV) 


ier wel 


for all U and all families (V; | 7 € J) of elements of O(X). (This is because 
the operations of arbitrary join and finite meet in O(X) coincide with the set- 
theoretic operations of union and intersection, which are well known to satisfy all 
possible distributive laws. However, the lattice-theoretic dual of this law may fail 
in O(X), since the meet of an infinite family of open sets is not their intersection 
but the interior of their intersection.) Moreover, if f: X — Y is a continuous map 
of topological spaces, the mapping f—~!: O(Y) — O(X) preserves finite meets 
and arbitrary joins. These two observations motivate the following definition: 


Definition 1.1.1 A frame is a complete lattice A satisfying the infinite 


distributive law 
an Vbi = V(a^ bi) 
ie! ie! 
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for alla € A and all families (b; | i € I) of elements of A. A frame homomorphism 
h: A — B is a mapping preserving finite meets and arbitrary joins. We write Frm 
for the category of frames and frame homomorphisms. 


Remark 1.1.2 A complete lattice is a frame iff it is a Heyting algebra 
(cf. A1.5.11), since the Adjoint Functor Theorem tells us that a A (—): A— A 
preserves arbitrary joins iff it has a right adjoint a = (—). (This is, of course, 
a special case of the assertion that geometric categories are Heyting categories, 
which we noted in A1.4.18, since a frame is just a geometric category which 
is a poset.) However, frame homomorphisms do not in general preserve the 
implication operation. 


In particular, we note that any complete Boolean algebra is a frame. Since 
frame homomorphisms are in particular lattice homomorphisms, they preserve 
complements of elements when they exist; hence a frame homomorphism between 
complete Boolean algebras is the same thing as a complete Boolean homomor- 
phism (i.e. a map preserving arbitrary meets and joins). We shall have more 
to say about the relationship between frames and complete Boolean algebras in 
1.1.21 below. 

In this section we study the ‘algebraic’ aspects of frame theory; its ‘topolog- 
ical’ aspects will be the subject of the next section. From the algebraic point of 
view, one of the most important results about frames is that free frames exist, i.e. 
that the forgetful functor Frm — Set has a left adjoint. There are several differ- 
ent constructions of this left adjoint, which correspond to different factorizations 
of the forgetful functor through various ‘intermediate’ categories of posets; we 
shall want to make use of all of them at different times in what follows, and so 
we devote the next few paragraphs to setting them up. 

We shall wish to distinguish notationally between the categories jSLat 
and mSLat of join-semilattices and meet-semilattices; abstractly, these are of 
course the same category, but their forgetful functors to Poset are different. 
(Incidentally, for us a semilattice will always have a unit element: that is, a join- 
semilattice has a bottom element and a meet-semilattice has a top element.) 
DLat will denote the category of (finitary) distributive lattices, and CjSLat 
the category of complete join-semilattices: that is, an object of CjSLat is a 
poset with arbitrary joins (and hence also arbitrary meets), and a morphism 
of CjSLat is a function preserving arbitrary joins (equivalently, having a right 
adjoint), but not necessarily preserving meets. We shall use the term domain for 
a poset having joins of (upwards) directed subsets, and write Dom for the cat- 
egory of domains (and functions preserving directed joins). Finally, a preframe 
is defined to be a poset with finite meets and directed joins, which satisfies the 
infinite distributive law of 1.1.1 whenever the family (b, | i € J) is directed 
(though the law may fail for other families, even if the joins happen to exist), 
and a preframe homomorphism is a function preserving finite meets and directed 
joins: we write PFrm for the category of preframes. 
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Clearly, we have forgetful functors 


Frm ———__———_> CjSLat 


PFrm ——————_> Dom 


DLat —— | > jSLat 


mSLat ———————————_> Set 


Lemma 1.1.3 All the functors in the above diagram have left adjoints. 


Proof We shall not give explicit constructions of all the left adjoints (those 
that we do not mention can easily be shown to exist using the Adjoint Func- 
tor Theorem, or by the adjoint lifting theorem A1.1.3). However, we shall give 
descriptions of the ones of particular interest to us. 

First, the free join-semilattice on a set A is the set KA of finite subsets of sA, 
ordered by inclusion (cf. D5.4.9): given a function f: A — B where B is a join- 
semilattice, its unique extension to a join-preserving map f: KA —> B is given 
by f(S) = V{f(s) | s € S}. Hence also the free meet-semilattice on A is simply 
K A°P. A similar argument shows that the free complete join-semilattice on a set 
A is the set PA of all subsets of A, ordered by inclusion. And the free domain 
on a set A is just A itself, with the discrete ordering; note that this is a domain, 
since its only directed subsets are singletons. 

The free functor mSLat — Frm sends a meet-semilattice A to the set LA 
of all lower (i.e. downwards-closed) subsets of A, ordered by inclusion; the unit 
map 7: A — LA sends a to the principal ideal | (a) = {b € A | b < a}. It is clear 
that LA is a frame (and indeed a coframe), since it is closed under arbitrary 
meets and joins in PA; and ņ preserves all finite meets (indeed, all meets which 
happen to exist in A). Now if B is a frame and f: A — B is a map preserving 
finite meets, the only possible candidate for a frame homomorphism f: LA — B 
extending f is given by 


FS) = VIs) | s €S}, 
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since each lower set is the union of the principal ideals which it contains; this 
map clearly does preserve joins (and extend f), and it also preserves finite meets 
since f(A) = f(14) = lg and 


F(S) AFT) =\/{F(s) 0 f(t) | 8 ES, tET)} 
=\V/{f(sAt)|s€S,teET} 
= \V{f(u) lue SOT} = f(SNT) 


where we used the infinite distributive law in B at the first step. 

A similar argument shows that the free distributive lattice on a meet- 
semilattice A is the set of finitely-generated lower sets in A (that is, sets which 
are finite unions of principal ideals); and the free preframe on A is the set TA of 
ideals (= directed lower sets) in A. (Note that the intersection of two ideals S 
and T in a meet-semilattice is an ideal; for if a and b are two elements of SAT, 
we can find upper bounds c and d for {a,b} in S and T respectively, and then 
c ^d is an upper bound for {a,b} in SAT.) 

For a join-semilattice A, the construction of JA yields the free complete join- 
semilattice on A: it is not necessarily closed under finite unions (or intersections) 
in PA, but it does have binary joins given by 


SVT={svt|seS,teT} 


(and the mapping a + |(a) preserves them), as well as a least element {0}. 
Since it also has directed joins (which are simply unions), it is a complete join- 
semilattice: the proof that it is freely generated by A is similar to those already 
given. The same construction, applied to a distributive lattice A, yields a frame 
which is (unsurprisingly) the free frame generated by A. 

We shall not give explicit constructions of the free functors CjSLat — Frm 
and PFrm — Frm here, since we shall not need them; but we shall discuss the 
relationship between Frm and these two categories further in 1.1.10 below. D 


In passing, we note 


Scholium 1.1.4 The free frame on any set A is of the form O(X) for a suitable 
topological space X. 


Proof From 1.1.3, we see that the free frame on A may be constructed as 
L(K A®°?), which is manifestly a topology on the set KA. Alternatively, we may 
describe it as the open-set lattice of the A-fold cartesian power of the Sierpiński 
space S = {0,1} with topology O(S) = {0, {1}, S}. To establish the isomorphism 
between O(S“) and L(K A°?), note that a nonempty ‘open rectangle’ [Jae 4 Ua 
in S^ is determined by the set {a € A | Ua = {1}}, which is necessarily finite 
(but the inclusion ordering on open rectangles is the reverse of the inclusion 
ordering on finite subsets of A), and that an arbitrary open subset of $4 is 
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determined by the set of open rectangles which it contains, by the definition of 
the product topology. E 


Since the functor O: Sp™® — Set, which sends a space to its set of open 
subsets, is representable by the Sierpiński space, the result just established says 
that elements of the free frame on A correspond (to continuous maps S4 — S, 
and hence) to natural transformations O^ — QO. In other words, the theory of 
frames is exactly the (infinitary) algebraic theory of the functor O, in the sense 
of [724]. 

All the categories in the diagram before 1.1.3, except for Dom and PFrm, are 
equationally presented (that is, they are the categories of models of (infinitary) 
algebraic theories, cf. D1.1.7(a)): hence they are all monadic over Set, and 
in particular they are complete and cocomplete. (The two exceptions are not 
monadic over Set: the monad induced by the forgetful functor Dom — Set 
and its left adjoint is the trivial one whose functor part is the identity, and that 
induced by the forgetful functor PFrm — Set is the same as that induced by 
mSLat — Set, since it is easily seen that every ideal in a meet-semilattice of 
the form KA is principal. On the other hand, the forgetful functor PFrm — 
mSLat is monadic; and PFrm is still complete and cocomplete. These results 
are proved in [548], but we shall not give the details here.) 


Proposition 1.1.5 The categories CjSLat and PFrm both have (symmetric) 
closed monoidal structures. 


Proof We consider first the category CjSLat. The algebraic theory of com- 
plete join-semilattices is commutative (also called entropic, or autonomous in 
the sense of [725]): that is, each of its operations commutes with each of the 
others (and with itself). Hence, given two complete semilattices A and B, the 
set of CjSLat-morphisms A — B is closed under (pointwise) joins in the set 
of all functions A — B, and so forms a complete semilattice [A, B]. If C is a 
third object of CjSLat, then CjSLat morphisms C — [A, B] are easily seen 
to correspond to functions f:C x A— B which are ‘bihomomorphisms’ in the 
sense that c++ f(c,a) is a homomorphism for each fixed a € A and a+ f(c,a) 
is a homomorphism for each fixed c € C. So we define A Q B to be the codomain 
of the universal bihomomorphism with domain A x B: explicitly, it is freely 
generated by symbols a @ b (a € A, b € B), subject to the relations 


(ai @b) =(Yai)@b and \/(a@b,) =a@(\/ b) 


iel wel jEJ JEJ 


for all a,b,a; and b,. Just as for the usual tensor product of abelian groups (or 
of modules over a commutative ring), it is straightforward to verify that this 
construction is commutative and associative up to coherent isomorphisms (for 
example, the ‘twist’ isomorphism AQB — B&A is the unique morphism sending 
a&b to b@a for each (a, b)), that the free complete semilattice on one generator 
is a unit for it (again, up to coherent isomorphism), and that (—) ® A is left 
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adjoint to [A, —]. (The free complete semilattice on one generator is simply the 
power-set P1, by the results of 1.1.3; we shall denote it by Q, because if we 
replace Set by a more general topos € in this discussion, the unit becomes the 
subobject classifier of £. Note also that the generator of Q is its top element T.) 

The argument for PFrm is similar: if A is a preframe, the infinite distributive 
law ensures that the binary meet operation A x A — A preserves directed joins, 
since if (a; | i € T) and (b; | i € J) are directed families (indexed by the same 
directed set J) then (a; Ab; | i € T) is final in the family (a; A b; | i,j € I). Thus 
the theory of preframes is ‘commutative’ in an appropriate sense: in particular, if 
A and B are preframes then the set [A, B] of preframe homomorphisms A — B is 
closed under (pointwise) finite meets and directed joins in the set of all maps A — 
B, and hence forms a preframe. As before, the tensor product of two preframes 
A and B (which we shall denote by A © B, since we wish to distinguish it from 
the CjSLat tensor product) is the codomain of the universal ‘bihomomorphism’ 
from A x B: equivalently, it is generated by elements a©b (a € A, b € B) subject 
to relations which ensure that a +> a © b is a preframe homomorphism for each 
b, and b++ a ©b is a homomorphism for each a. The unit of the tensor product 
is the free preframe on one generator: by the results of 1.1.3, this is the preframe 
I(K 1°), but since K1°? is isomorphic to K1 (even constructively, cf. D5.4.5(ii)) 
we may identify this with [(K1) = P1 = Q. (However, the generator of Q as a 
preframe is its bottom element L.) Oo 


Remark 1.1.6 Although Frm itself does not have a closed structure, its hom- 
sets do have an obvious partial ordering, given by f < g iff f(a) < g(a) for all 
a in the domain of f; and this partial ordering has (pointwise) directed joins, 
since a directed join of frame homomorphisms is both a preframe homomorphism 
and a complete-join-semilattice homomorphism. Thus Frm is enriched over the 
category Dom ~ a fact which we shall subsequently use when working in the 
opposite category Loc. 


Remark 1.1.7 In fact CjSLat has even more categorical structure than that 
described in 1.1.5. It has a self-duality (i.e. a functor (—)*: CjSLat — CjSLat°? 
whose square is the identity): for an object A, A* is simply the opposite poset 
A®°P (we have already noted that this has arbitrary joins if A does), and for 
a morphism f: A— B, f* is the right adjoint of f regarded as a morphism 
BoP — A°P. Moreover, the duality is enriched in the sense that we have [A, B] = 
[B*, A*], naturally in A and B: for the pointwise ordering on the right adjoints 
of morphisms A — B is the opposite of that on the original morphisms, and 
it becomes reversed again when we dualize the objects A and B. It follows 
in particular that we have natural isomorphisms A* © [Q, A*] & [A,*], i.e. 
the duality is representable in the enriched sense. (Of course, Q* is classically 
isomorphic to 2, but they are different in a non-Boolean topos, and so we choose 
to distinguish between them here.) Hence also we have natural isomorphisms 


[A, B] = [B*, A*] = [B*, [A, 9*]] = [(B* @ A), 9*] = (B* @ A)* = (A 8 B*)* 
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or equivalently (A ® B) & [A, B*]*: the interpretation of this last isomorphism 
is that we may identify A ® B with the poset of Galois connections (that is, 
contravariant adjunctions on the right) between A and B. Summarizing the 
foregoing, we say that CjSLat is a *-autonomous category in the sense of |72]. 


We recall that, given a symmetric monoidal category (C,®, I), we may form 
the category CMon(C) of commutative monoids in C, i.e. objects A equipped 
with morphisms m: A@ A — A and e: I — A satisfying the appropriate dia- 
grammatic form of the equations for commutative monoids. We also recall the 
following result (cf. [368]): 


Lemma 1.1.8 Let (C,@,J) be a symmetric monoidal category. Then 


(i) The category CMon(C) has finite coproducts; and the forgetful functor 
CMon(C) — C is universal among functors from categories D with finite 
coproducts to C which map the cocarteszan monoidal structure on D to the 
given monoidal structure on C. 


(ii) If the monoidal structure on C is closed, then the forgetful functor 
CMon(C) —> C creates any filtered colimits which exist in C. 


Proof (i) It is straightforward to verify that if (A,m,e) and (A’,m’,e’) 
are commutative monoids in C, then A @ A’, with its obvious monoid 
structure, is their coproduct in CMon(C): the coprojections are given 
by 1@e€: AZAQI-+ABA' and e€@1:A’SIQA'+ABA', and if 
f: A— A" and f': A’ — A" are two monoid homomorphisms the induced homo- 
morphism A @ A’ — A” is m”(f & f’). Similarly, the unit J with its obvious 
monoid structure is an initial object of CMon(C). On the other hand, if D is 
any category with finite coproducts, then each object B of D has a unique com- 
mutative monoid structure given by the codiagonal map B+ B — B and the 
unique morphism 0 — B; so any functor D — C which maps the finite coproduct 
structure onto @ has a unique factorization through CMon(C) — C. 

(ii) Suppose A = lim;¢,A; is a filtered colimit in C, where each A; has a 
commutative monoid structure and the transition maps A; — A, are monoid 
homomorphisms. Since A Q (—) preserves colimits, it is easy to see that we have 
AQA ©& lim; j,erx1(A; ® A,); but the fact that J is filtered implies that the 
diagonal I — I x J is final (cf. B2.5.12), and so A@ A S lim;¢;(A; ® A;). Hence 
the multiplications m;: A; ® A; — A; induce a unique morphism m: A @ A — A; 
and we similarly have a morphism e: J — A induced by any of the e;: I > A; 
(since J is nonempty and connected). The verification that these morphisms are 
a commutative monoid structure on A, and that they make A into the colimit 
of the A; in CMon(C), is straightforward. D 


We note that the argument of 1.1.8(ii) also works for reflexive coequalizers 
(cf. A1.2.12); thus we could have replaced the word ‘filtered’ by ‘sifted’ in the 
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statement. Also, since arbitrary colimits may be constructed from finite coprod- 
ucts, reflexive coequalizers and filtered colimits, 1.1.8 in principle tells us how to 
construct arbitrary colimits in CMon(C) in terms of C. 

More explicitly, the identification of finite coproducts in CMon(C) with ten- 
sor products may be extended to pushouts, by the following device. Given a 
commutative monoid (A, m,e) in C, we may define an A-module to be an object 
B of C equipped with an action map 8: A & B — B which is associative and 
unitary in the obvious sense. Provided C has reflexive coequalizers, and the 
functor A Q (—) preserves them, the category A-Mod(C) of A-modules in C has 
a monoidal structure, with the tensor product B &4 C of two modules defined 
to be the coequalizer of 


B@1 
BRAC Z3 BQC 
187 


(note that since A is commutative we do not have to distinguish between left and 
right A-modules), and unit given by A itself. Now if f: A — B is a morphism of 
CMon(C), then B acquires an A-module structure given by 


f@l 


AQB— > BOB BP 


moreover, the multiplication of B factors through the quotient map B & B —> 
B®, B, making B into a commutative monoid in A-Mod(C). Conversely, any 
commutative monoid B in A-Mod(C) is in particular a commutative monoid in 
C, and its unit map in the former category becomes a monoid homomorphism 
A — B in the latter. These two constructions establish an equivalence between 
the co-slice category A\CMon(C) and CMon(A-Mod(C)); so, in particular, we 
obtain 


Corollary 1.1.9 Suppose C is a symmetric monoidal category with reflexive 
coequalizers which are preserved by the functors A & (—), and let f: A> B 
and g: AC be two commutative monoid homomorphisms in C with common 
domain. Then (the codomain of) their pushout may be identified with the A- 
module tensor product B &4 C. oO 


The significance of 1.1.8 and 1.1.9 for us is contained in 


Proposition 1.1.10 The category Frm may be identified with coreflective 
(full) subcategories of both CMon(CjSLat) and CMon(PFrm). 


Proof If A is a frame, then the infinite distributive law of 1.1.1 says precisely 
that the binary meet map A: A x A— A is a bihomomorphism of complete 
join-semilattices; so it induces a map m: A & A — A. And the top element 1 
of A corresponds to a unique CjSLat morphism e: Q — A. The fact that m 
and e satisfy the equations for commutative monoids is immediate from the 
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fact that ^ and 1 do so (with respect to the cartesian monoidal structure); and 
it is clear that if B is another frame then a CjSLat morphism A — B is a 
morphism of commutative monoids iff it also preserves finite meets, i.e. iff it is a 
frame homomorphism. Thus we have identified Frm with a full subcategory of 
CMon(CjSLat). 

Of course, not every commutative monoid in CjSLat is of this form; but we 
may characterize those which are, as follows: (A, m,e) is a frame iff a < e(T) and 
a < m(a®a) for alla € A. For these conditions clearly hold in a frame; conversely, 
if the conditions hold, then for any a,b € A we have m(a@b) < m(a&e(T)) =a 
and m(a &b) < b similarly, and if c < a and c < b then c < m(c®c) < m(a@b), 
so m(a Qb) is the meet of a and b. Now, if (A,m, e) is an arbitrary commutative 
monoid in CjSLat, consider the set 


A; ={a€Aj|a<e(T) anda < m(a&a)}; 


it is clear that A; is a sub-complete-semilattice of A, since if a is the join of 
a family (a; | i € J) then m(a @ a) > m(a; Q ai) > a; for each i. Moreover, 
if a and b are in Ay so is m(a & b), using the fact that m is commutative and 
associative; and hence m maps Ay ® Ay into Af. Similarly, since e(T) € Ay, 
e factors through As — A, and thus Ay is a sub-commutative-monoid of A. 
By the remarks above, it is a frame; and it is also clear that any commutative 
monoid homomorphism from a frame to A must take values in Ay. So we have 
constructed the required coreflection. 

The arguments in the preframe case are very similar. This time we consider 
the binary join map V: A x A — A ofa frame A as a preframe bihomomorphism: 
it is a bihomomorphism for finite meets by the (finitary) distributive law, and for 
directed joins (in fact for all nonempty joins) since a V (Vier bi) = Vies(a V bi) 
provided a appears at least once on the right-hand side. The induced pre- 
frame morphism m: A © A — A, together with the unique preframe morphism 
e: Q — A sending the preframe generator L of Q to 04, defines a commutative 
monoid structure on A; and a preframe homomorphism between frames is a 
homomorphism of commutative monoids iff it preserves finite joins, i.e. iff it is a 
frame homomorphism. A commutative monoid (A, m,e) in PFrm arises in this 
way from a frame iff e(_L) is the least element of A and m(a@®a) < a for alla € A; 
using this, we may prove as before that Frm is coreflective in CMon(PFrm). 

o 


Remark 1.1.11 It follows from 1.1.8(i) and 1.1.10 that if A and B are frames 
then their coproduct in Frm may be identified with both their complete- 
semilattice tensor product A & B and their preframe tensor product A © B; 
so these two are isomorphic as posets. However, the isomorphism does not iden- 
tify the generators of A & B with those of A © B; if we write 1 and v2 for the 
coprojections in Frm, then a @ b is 4\(a) A v2(b), and a ©b is v(a) V v2(b). It 
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follows easily that they are related to each other by the equations 
(a©®b)=(a@l1lg)V(14@b) and (a@@b) =(aO0g)A(04 08). 
Similar remarks apply to pushouts in Frm, using 1.1.9. 


The representations of frames as commutative monoids in CjSLat and in 
PFrm will be extensively exploited in Sections C3.1 and C3.2 respectively. 
However, before leaving the topic for now, we note one useful application 
of 1.1.10: 


Corollary 1.1.12 Let (A; |i € J) be the vertices of a filtered diagram in Frm, 
whose transition maps A; — A; are all injective. Then the legs A; — lime, A; 
of the colimiting cone in Frm are also injective. 


Proof By 1.1.8(ii) and 1.1.10, it suffices to prove the corresponding result in 
the category CjSLat. But in the latter category we can appeal to the duality 
of 1.1.7: in other words, it suffices to show that, given a cofiltered diagram of 
complete semilattices (B; | 1 € I) and surjective homomorphisms, the induced 
maps à; : lim;¢;B; — B, are surjective. Let b € B; for some i; since the forgetful 
functor CjSLat — Set creates limits, we have to construct a family of elements 
(b, € B, | j € I) such that b; = b and such that (bk) = b; for each edge 
3: By — B; of the diagram. For each j, we may choose morphisms a: By — B; 
and y: By — B; in the diagram with common domain; we define b; = y(a*(b)), 
where a* is the dual of a. It is clear that, if we can show b; to be independent 
of the choice of the pair (a,y), then (b, | 7 € J) will be a compatible family. 
But if we had made a different choice (a’,y’), we could find ô: Bı — Bk and 
6’: Bi — By in the diagram with aô = a’d’ and yô = +06’, so that 


y(a*(b)) = yd(d"a*(b)) 


= y's ("a (b)) 

= y'(a"* (b)) 
where we have used the fact that ô and 6’ are surjective and so the counits of 
the adjunctions (6 4 6*) and (6’ 4 6’*) are identities. 0 


Since Frm is monadic over Set, it is an effective regular category as defined in 
Section A1.3: in particular, for any frame A, there is a bijection between regular 
quotients of A (that is, isomorphism classes of regular epimorphisms A — B) 
and equivalence relations on A. Since monomorphisms in Frm are injective frame 
maps, the latter may be identified with frame congruences on A, that is subsets 
R C Ax A which are both equivalence relations on (the underlying set of) A 
and subframes of A x A. But there are a number of other ways of representing 
regular quotients of A, which will be of use to us. First we note that, since a 
frame congruence R is closed under arbitrary joins in A x A, each R-equivalence 
class has a greatest member (the join of all its members); thus we can represent 
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R by the mapping (j, say) A — A which sends each element of A to the greatest 
member of its R-equivalence class, since we then have (a,b) € R iff 7(a) = j (b). 
And the properties of 7 needed to ensure that the relation defined in this way is 
a frame congruence are simply 


(i) a < j(a); 
(ii) j(a) = j(j(a)); and 
(iii) j(a Ab) = j(a) A j(b) 


for all a,b € A. We call such a mapping a nucleus on A. (The above conditions 
should be compared with the definition of a local operator in a topos (A4.4.1): a 
local operator is simply an internal nucleus on the internal frame Q, and much 
of the discussion of nuclei which follows in this section exactly parallels the 
discussion of local operators in Section A4.5.) Further, a nucleus j is determined 
by its image (equivalently, by (ii), its firset A; = {a € A | j(a) = a}), since 
conditions (i) and (ii) (plus the fact that j is order-preserving) say that J is left 
adjoint to the inclusion A; — A. Moreover, in this context condition (iii) on 7 is 
equivalent (by A4.3.1) to the assertion that A; is an exponential ideal as defined 
in Section A1.5, i.e. that (a=b) € A, whenever b € A;, where = is the Heyting 
implication in A (cf. 1.1.2). We have thus established 


Proposition 1.1.13 For any frame A, there are bijections between any two of 
the following sets: 


(i) the set of regular quotients of A, i.e. isomorphism classes of surjective 
frame homomorphisms with domain A; 


(ii) the set of frame congruences on A; 
(iii) the set of nuclei on A; 


(iv) the set of fixsets in A, 1.e. subsets which are exponential ideals and closed 
under arbitrary meets. o 


We remark that it is possible to proceed directly from a regular epimorphism 
q: A — B to the nucleus j on A which corresponds to it: in fact j is the composite 
qxq, where g,: B — Ais the right adjoint of q. (This is because q,(b) is the largest 
a € A satisfying g(a) = b.) And, given j, we may recover q directly: A;, ordered 
as a subset of A, is isomorphic to the quotient A/R (since it contains one element 
of each R-equivalence class) and hence a frame; and j itself, considered as a map 
A — A, is a (surjective) frame homomorphism (it preserves joins because it is 
left adjoint to the inclusion). 


Remark 1.1.14 The three conditions in the definition of a nucleus may be 
combined into a single equation: for any frame A, a function j: A— A is a 
nucleus iff it satisfies the condition that 


(a= j(b)) = (3(@) = 3(b)) 
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for all a,b € A. To prove this, suppose first that j is a nucleus; then from 
a < j(a) we obtain (a = j(b)) > (j(a) = j(b)), since = is order-reversing in its 
first variable. But we also have 


j(a) A (a= j(b)) < jla) A j(a= j(b)) 
= j(a ^ (a= j(b))) 
< j(i(b)) = jC), 


so (a => j(b)) < (j(a) = j(b)). Conversely, suppose j satisfies the condition. 
Putting a = b, we obtain (a > j(a)) = 14, so a < j(a) for all a. Similarly, 
putting a = j(b) yields j(j(b)) < j(b), whence j(j(b)) = j(b). Next, ifa < b 
then a < j(b)}, whence we obtain (j(a) = j(b)) = 14, so j is order-preserving. 
Finally, from a Ab < j(a ^b) we obtain a < (b= j(a A b)) = (j(b)= j (a A b)), so 
a \j(b) < jla ^b), and a similar argument yields j(a) A j(b) < j(a Ab); but the 
reverse inequality follows from the fact that j is order-preserving. However, in 
practice it is usually easier to verify the three separate conditions, for a given j, 
than to verify the single equation given here. 


Of course, each of the sets described in 1.1.13 has a canonical ordering; but 
some of the bijections between them are order-reversing. If we order congruences 
by inclusion, this corresponds to the pointwise ordering on nuclei (i.e. j < k iff 
j(a) < k(a) for all a € A), but it is the reverse of the inclusion ordering on 
fixsets, since larger nuclei have fewer fixed points. For definiteness, we shall deal 
with nuclei in what follows, and write N(A) for the set of all nuclei on A with 
its pointwise ordering. 


Proposition 1.1.15 For any frame A, N(A) is a frame. 


Proof Itis straightforward to verify that the (pointwise) meet of an arbitrary 
family of nuclei is a nucleus; this defines arbitrary meets in N(A), and shows 
that it is a complete lattice. However, joins in N(A) are not in general computed 
pointwise; so it is not possible to deduce the infinite distributive law in N(A) 
directly from that in A. One possible proof, which may be found in [520], involves 
constructing the Heyting implication operation on N(A) and appealing to 1.1.2; 
we shall give a simpler proof, due to J. T. Wilson [1229], which in fact shows 
that the fixsets in A form a coframe in the inclusion ordering. For this, we note 
first that an arbitrary intersection of fixsets is a fixset. The join of two fixsets 
B and C may be described as the set {b Ac |b € B,c € C}; this follows from 
the fact that meets of nuclei are computed pointwise, since an element a of A is 
(j ^ k)-fixed iff it is the meet of j(a) and k(a). Now suppose B and Cj, i € I, 
are fixsets, and let a € (),<;(B V Ci). Then we have a = b Ac; for each i, where 
b is the least element of B above a and c; is the least element of C; above a. So 


(b=>a) = (b= (bAc;)) = (b>c;) € C; for each i 
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since C; is an exponential ideal, whence a = (b^a) = (b^ (b= a)) € BY Nicer Ci 
Thus we have 

(\Bvc)CBV()Cs 

icl i€l 


but the reverse inclusion holds in any complete lattice. g 


We next consider some important examples of nuclei and fixsets: the first 
three of the following examples should be compared with their topos-theoretic 
analogues introduced in A4.5.1, A4.5.3 and A4.5.9 respectively. 


Examples 1.1.16 (a) Let A be a frame, a an element of A. The principal 
ideal | (a) is easily seen to be a frame, and the infinite distributive law implies 
that (—) Aa: A —> | (a) is a (surjective) frame homomorphism. So by 1.1.13 it 
corresponds to a nucleus on A. Since the right adjoint of (—) Aa is a>(-—), it is 
easy to see that this nucleus is simply a>(—): A — A. We denote this nucleus 
by o(a), and call it the open nucleus determined by A, for reasons which will 
become clear in 1.2.6 below. 

(b) Again, let a be an element of a frame A. The principal filter | (a) generated 
by a is also a frame, and (—) Va: A — 7 (a) is a surjective frame homomorphism 
by distributivity. The right adjoint of this homomorphism is simply the inclusion 
T (a) — A, and so the corresponding nucleus is simply (—) V a (and the corre- 
sponding fixset is } (a) itself). We denote this nucleus by c(a), and call it the 
closed nucleus determined by a. We note that o(a) and c(a) are complementary 
elements of N(A), for any a: for their pointwise meet o(a) A c(a) satisfies 


(o(a) A c(a))(b) = (a=b) A (a V b) 
= (aA (a= b)) V (b ^A (a =b)) 
= (a ^b) Vb=b 


using standard identities for Heyting algebras (cf. A1.5.11), and so it is the 
identity mapping, i.e. the least element of N (A). On the other hand, their join 
corresponds to the intersection of the corresponding fixsets, i.e. to 


{b€ A| (a< b) and (a=b) = b}, 


but this is the singleton {1}, i.e. the smallest fixset in A. 

(c) We call a fixset A; (or the corresponding nucleus j) dense if it has non- 
trivial intersection with every nontrivial open fixset in A. It is not hard to see 
that this condition is equivalent to saying that j(0) = 0; for if j(0) = a then 
A;N Aja) = {1}, by the argument given above, and conversely if 0 € A; then 
“a = (a => 0) € A; N Aoa) for any a. We note that every frame contains a 
smallest dense firset, namely the subset A.. = {7a | a € A}: this is a fixset, 
since it is closed under arbitrary meets (—— being left adjoint to the inclusion 
A_. — A) and an exponential ideal because (a = 7b) = ~(a A b) for any a 
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and b. And A~ is dense since -7=0 = 71 = 0; but if A; is any dense fixset then 
~ma = (~a >00) € A, for all a € A. 

(d) We shall also occasionally need to invoke the stronger notion of flat 
nucleus: a nucleus j is called flat if it preserves finite joins (including the empty 
join 0). Equivalently, a fixset A, is flat if it is a sublattice of A. In general, flat 
nuclei are much less common than dense ones; we note in particular that the 
largest dense nucleus >> is flat iff A is a Stone algebra, a condition which we 
shall discuss in greater detail in Section D4.6. 

(e) Let (P,T) be a site (as defined in A2.1.9) whose underlying category P 
is a poset. (Such a site is occasionally referred to as a posite.) The subterminal 
objects in the functor category [P°P, Set] are those functors F such that each 
F(p) is either empty or a singleton; so, up to isomorphism, we may identify 
them with the subsets {p € P | F(p) 4 0}, i.e. with the lower subsets of P. The 
lower sets which correspond to subterminal objects of the subcategory Sh(P, T) 
are those sets I with the closure property that, if p is any element of P and 
R € T(p) is a cover all of whose members are (more strictly, have their domains) 
in J, then p itself must be in I. We call such subsets T-ideals; the name derives 
from the fact that, if P is a distributive lattice considered as a coherent category, 
and T is its coherent coverage, then these subsets are exactly the ideals of P in 
the classical sense. (Thus the result we are about to establish includes the fact, 
already mentioned in the proof of 1.1.3, that the ideals of a distributive lattice 
form a frame.) We write T-Idl(P) for the set of all T-ideals of P, ordered by 
inclusion; we claim that it is a fixset in the frame LP of all lower sets in P (so 
that in particular it is a frame — though we could have deduced that from the fact 
that we have identified it with the lattice of subterminal objects of a cocomplete 
topos). It is clearly closed under arbitrary intersections in LP; so we need only 
verify that it is an exponential ideal. But the Heyting implication in LP is given 
by 


(=J) = {p € P | (Yq < pX((4 € I)> (4 € J))}, 


since the latter is the largest lower set whose intersection with J is contained in 
J. If J is a T-ideal, and p € P has a covering family R all of whose members 
are in (I= J), then-for any q < p such that q € I, we have a covering family R’ 
on q, by the stability condition (C’) of A2.1.9, each of whose members lies below 
some member of R, and is therefore in J since it is in the lower set I. So every 
such q is in J; hence p € (I> J), as required. 

We note that T-Idl(P) is a dense fixset in LP (that is, it contains the least 
element 0 of LP) iff every T-covering family is nonempty. (We shall return to this 
condition in 2.2.4(e) below, and again in 3.1.18.) If in addition every T-covering 
family is (upwards) directed as a subset of P, then T-Idl(P) is flat in LP; for 
if J and J are T-ideals and R € T(p) is a directed cover all of whose members 
lie in JU J, then either R C J or R C J (if not, we can find gq, € R\ J and 
q2 E€ R\ J, and then an upper bound for {q1, q2} in R cannot be in either J or 
J), so p € IU J. More generally, if S and T are two coverages on P such that 
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S C T, and every cover in T \ S is directed, then the same argument shows that 
T-Id\(P) is flat in S-Idl(P). In particular, taking P to be a frame, S to be the 
coherent coverage on P (i.e. the coverage given by all finite joins in P), and T to 
be the coverage given by all joins which are either finite or directed, we see that 
the S-ideals are the ideals in the usual sense and the T-ideals are the principal 
ideals; thus we have shown that every frame occurs, up to isomorphism, as a flat 
fizset in one of the form IA for a distributive lattice A. 

We remark that the use of non-constructive logic in the preceding paragraph 
cannot be avoided: the complementations can be eliminated, but if this is done 
the argument is then seen to depend on an instance of the ‘dual Frobenius rule’ 
that (¢ v (Yr)y(x)) may be deduced from (Vx)(¢ V y(x)) - cf. D1.3.9. However, 
the final italicized result is constructive: it is easy to see directly that any frame 
A is both a sublattice of, and a fixset in, its ideal lattice TA. 


A further important tool in elucidating the structure of N(A) is the following: 


Lemma 1.1.17 Any nucleus on a frame is expressible as a join of nuclei of 
the form o(a) A c(b). 


Proof Let A be a frame, and j an arbitrary nucleus on A. For each a € A, 
consider the nucleus ka = o(a) A c(j(a)). We have 


kq(a) = (aa) A (j(a) Va) = 1A j(a) = j(a) . 
But for any b, we have 


(a=b) A j(a) < j(a=>b) ^ j(a) = j((a =b) Na) < j(b), 


so that 
ka(b) = (a=>b) A (j(a) V b) 
= ((a= b) ^ j(a)) V ((a=>b) A b) 
< j(b) V b = j (b) 
Thus j is the (pointwise) join of the nuclei ka, a € A. o 


Next, we investigate the functoriality of the construction A +» N(A). For 
this we need 


Lemma 1.1.18 Leth: A— B be a frame homomorphism, a € A. Then 


(i) h factors through A > Aga) iff h(a) = 1B. 
(ii) h factors through A > Aca) iff h(a) = Op. 
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(iii) There are pushout squares 


Alep and yee ENE: 


Aola) ——> Bonca)) Ac(ay ——> Be(n(a)) 
in Frm. 


Proof (i) One direction is immediate, since a maps to the top element of 
Aj(a)- Conversely, suppose h(a) = 1g. Then for any b € A, we have 


h(a=>b) < h(a) => h(b) = 1=>h(b) = h(b) 


(where the first inequality derives from the fact that h preserves finite meets); 
so h identifies any two elements of A which are identified by o(a). 
(ii) is similar, using the fact that a maps to the bottom element of A_,q). 
(iii) follows immediately from (i) and (ii). 


Proposition 1.1.19 The assignment A ++ N(A) defines a functor Frm —> 
Frm; and the frame homomorphisms c: A — N(A) define a natural transforma- 
tion from the identity to this functor, which is both monic and epic. 


Proof Given a frame homomorphism h: A — B, the operation of pushing 
out maps along h preserves regular epimorphisms, and so defines a mapping 
N(A) — N(B), which we take to be N(h). It requires verification that N(h) is 
a frame homomorphism; but it clearly preserves joins, since they correspond to 
cointersections of regular quotients, and so we need only consider finite meets. 

First we consider a meet of the form o(a) A c(b). From 1.1.18(iii), we have 
N(h)(o(a)) = o(h(a)) and N(h)(c(b)) = c(h(b)); and since o(a) A c(b) is com- 
plementary to c(a) V o(b), by 1.1.16(b), a straightforward calculation shows that 
N(h)(o(a) A c(b)) = o(h(a)) A c(h(b)). Also, since 0o(a;) A o(a2) = o(a; V a2) 
and c(b,) A c(b2) = c(b) A b2), the meet of two nuclei of this form is again of 
the same form, and N(h) preserves all such meets. The general case now follows 
from 1.1.17: by this result, plus the infinite distributive law in N(A), we can 
reduce an arbitrary meet j A k to a join of pairwise meets of nuclei of the form 
o(a) A c(b), and hence it is preserved by N(h). 

Next, we observe that c is a frame homomorphism: it preserves finite meets 
by distributivity of A, and arbitrary joins because joins commute with joins — 
note that joins of closed nuclei are actually computed pointwise. The fact that 
c is a natural transformation is immediate from 1.1.18(iii); it is monic because 
we can recover a from c(a) by applying it to the least element of A, and epic 
because we saw in 1.1.17 that N(A) is generated as a frame by the elements c(a) 
and their complements (and frame homomorphisms preserve complements). O 
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Corollary 1.1.20 
(i) For any frame A, c: A— N(A) is universal among frame homomorphisms 
h: A — B mapping all elements of A to complemented elements of B. 


(ii) c: A— N(A) is an isomorphism iff A is a complete Boolean algebra. 


Proof (i) By 1.1.16(b), we know that c has this property. Suppose given 
another such homomorphism h. Then, for any a € A, N(h)(c(a)) = c(h(a)) is a 
closed nucleus on B; but so is N(h)(o(a)), since o(b1) = 7c(b1) = c(b2) if bı and 
b2 are complementary elements of B. It now follows from 1.1.17 that N(h) maps 
arbitrary nuclei on A to closed nuclei on B, i.e. it factors through c: B — N(B). 
This factorization is a frame homomorphism N(A) — B extending h; and it is 
unique, since c: A — N(A) is epic. 

(ii) It is clear that the identity morphism on A has the universal property 
mentioned in (i) iff all elements of A are already complemented, i.e. iff A is a 
Boolean algebra. o 


Remark 1.1.21 Using 1.1.20, we might try to construct a left adjoint for 
the inclusion functor CBool — Frm, where CBool is the category of complete 
Boolean algebras, as follows. Given a frame A, we may define a sequence of frames 
Na(A) for all ordinals a, and frame homomorphisms ca g: Na( A) > Ng(A) for 
all œ < 8, by recursion: 


(i) No(A) = A, coo = 1a (in fact coq is the identity for all a); 
(ii) Na+ı(4) = N(Na(A)), Cs,a+1 = C` Ch a (where the unsubscripted ‘c’ is 
the homomorphism of 1.1.19); 
(iii) if A is a nonzero limit ordinal, then N, (A) is the colimit in Frm of the 
diagram formed by the Ny(A) and the ca, g for a, 3 < A, and the cy, are 
the legs of the colimiting cone. 


An easy extension of 1.1.20 shows that any frame homomorphism h: A —> B, 
where B is a complete Boolean algebra, factors uniquely through 
Coa: A + Na (A) for all a; hence if there exists a such that Na(A) is Boolean 
(equivalently, such that ca,a+1 is an isomorphism), then N.(A) is the reflection 
of A in CBool. 

However, in general this reflection does not exist: for it is well known that 
the forgetful functor CBool — Set has no left adjoint (specifically, the free 
complete Boolean algebra on a countable infinity of generators does not exist), 
and so if we take A to be the free frame on a countable infinity of generators, 
i.e. the open-set lattice of the countable power of Sierpinski space (cf. 1.1.4), 
then the frames N,(A) are all non-Boolean. But it is also easy to see that the 
morphisms Co, a are epic (as well as monic) for all a: given h,k: Nal A) 3 B such 
that h -Coa = k- Coa, we may prove by induction on 8 that h-cgq = K- Ca 
for all 8 < a. So we conclude that the frame A = O(S) has a proper class 
of epimorphic images in Frm, no two of which are isomorphic in the co-slice 
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category A\Frm. In particular, the category Frm is not well-copowered, and it 
does not have a cogenerating set of objects (cf. A1.4.17). 


Finally in this section, we briefly introduce the ‘fibrewise’ versions of dense- 
ness and closedness for nuclei, which we shall need for our development of the 
theory of localic groups in Chapter C5. Given a frame homomorphism h: A — B, 
we say that a nucleus j on B is fibrewise dense relative to A if j(h(a)) = h(a) 
for alla € A. We also say j is fibrewise closed relative to A if is the least nucleus 
in its A-equivalence class, where two nuclei are called A-equivalent if they agree 
on all elements of the form h(a), a € A. The reason for the names is that in the 
‘classical’ case when h is the frame homomorphism O(X) — O(Y) induced by 
a continuous map of spaces f: Y — X, and the nucleus j corresponds to a sub- 
space Y’ of Y, then j is fibrewise dense (resp. fibrewise closed) relative to O(X) 
iff Y’ N f—!(zx) is dense (resp. closed) in the fibre f~'(z), for each point z € X. 
(We shall prove this in 1.2.16 below.) However, we do not have a corresponding 
notion of ‘fibrewise open nucleus’, as might have been expected; fibrewise closed 
nuclei are not in general complemented in N(B). (We may see this by taking 
h to be the identity map B — B, in which case every nucleus on B becomes 
fibrewise closed.) 

Since f preserves the bottom element, it is clear that fibrewise dense implies 
dense in the sense of 1.1.16(c), and similarly closed in the sense of 1.1.16(b) 
implies fibrewise closed, since c(b) is the least nucleus sending 0 to b. If A is the 
two-element frame {0,1}, then the converse implications hold, since any nucleus 
fixes 1; however, constructively it is not true that the converse implications 
hold even for fibrewise closedness and denseness relative to the initial frame Q - 
we shall sometimes use the terms ‘weakly closed’ and ‘strongly dense’ for the 
fibrewise notions relative to Q, to distinguish them from the notions defined in 
1.1.16. We note the following elementary properties of fibrewise closedness and 
denseness: 


Lemma 1.1.22 Leth: A — B be a frame homomorphism. Then 


(i) There exists a largest A-fibrewise dense nucleus on B. 


(ii) For any nucleus j on B, there exists an A-fibrewise closed nucleus j^ (its 
A-fibrewise closure) which agrees with j on elements of the form h(a), 
ace A. 


(iii) The A-fibrewise closed nuclei on B form a subframe N4(B) of N(B), and 
there 1s a pushout square 


in Frm. 
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Proof Since the notions depend only on the image of h: A — B, we shall 
simplify our notation by assuming that h is injective and identifying A with a 
subframe of B. (Indeed, for much of this proof, we could generalize to the case 
when A is an arbitrary subset of B; but we shall not need this extra generality.) 

(i) We have to show that an arbitrary join of fibrewise dense nuclei is fibrewise 
dense. But joins of nuclei correspond to intersections of fixsets, as we observed 
in the proof of 1.1.15; and it is clear that j is fibrewise dense iff the fixset Bj 
contains A. So the result is immediate. 

(ii) Clearly, we may define j4 to be the meet of all nuclei agreeing with j on 
the subframe A, since meets of nuclei are constructed pointwise. Alternatively, 
we may define 74 to be the join 


V/{(0(a) A e(j(a))) | a € A}, 


and verify as in the proof of 1.1.17 that we have j4(b) < j(b) for all b, with 
equality if b € A. But this definition of 74 depends only on the A-equivalence 
class of j, as defined above; so j4 is the least element of this equivalence class. 

(iii) We claim that N4(B) is exactly the subframe of N(B) generated by all 
elements of the form o(a) (a € A) or c(b) (b € B). It is clear from the second 
construction of the fibrewise closure in (ii) that every fibrewise closed nucleus 
belongs to this subframe; for the reverse inclusion, it suffices to show that any 
nucleus of the form o(a) A c(b) is fibrewise closed, since the set of nuclei of this 
form is closed under finite meets, and it is clear that joins of fibrewise closed 
nuclei are fibrewise closed (since, by (ii), they form a coreflective sub-poset of 
N(B)). But if j is any nucleus agreeing with o(a) A c(b) on elements of A, then 
in particular j(a) = (a V b), whence for any b’ we have 


jb) > j(a) Nj(asb’) > bA (a=b); 


and 7(b’) > b’ in any case, so j(b’) > (b V b’) A (a=b) = (c(b) A o(a))(b'). 

In particular, we have shown that N,4(B) is a subframe of N(B). To show 
that the diagram is a pushout, it suffices to note that this subframe contains 
the image of c: B — N(B), and that a frame homomorphism k: B — C factors 
(uniquely) through B — N4(B) iff k(a) (that is, k(h(a))) is complemented in C 
for all a € A, by the argument in the proof of 1.1.20(i). g 


We note that the fibrewise closure j4 of j may be characterized as the unique 
fibrewise closed nucleus k on B such that B, is fibrewise dense as a fixset in By. 
We shall derive some further elementary properties of these notions in 1.2.14 
below. 


Suggestions for further reading: Johnstone [520], Johnstone & Vickers [548], 
Simmons [1110, 1111], Vajner [1182]. 
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C1.2 Locales and spaces 


In the previous section, we studied frames from an ‘algebraic’ point of view. 
However, it is important to recognize that they have topological aspects as well: 
we are primarily interested in thinking of frames as if they were lattices of the 
form O(X) for a topological space X, even though in fact they are not all of this 
form. For this reason, it is convenient to introduce a different terminology and 
notation, which we shall use when thinking of frames from a topological point 
of view. 


Definition 1.2.1 A locale is the same thing as a frame; however, we use letters 
like X,Y,... to denote locales, and write O(X) for the frame corresponding 
to a locale X. Elements of O(X) will be denoted by letters like U,V,.... A 
continuous map of locales f: X — Y is defined to be a frame homomorphism 
f*: O(Y) > O(X). We write Loc for the category of locales — that is, for the 
category Frm’°?. 


Thus the functor O: Sp ~ Frm™ becomes nameless as a functor Sp —> Loc: 
if X is a space, we write X also for the locale which ‘is’ the frame O(X) of 
open subsets of X. (If we allow consideration of non-sober spaces ~- which we 
shall shortly legislate out of existence — this notation is potentially ambiguous; 
for the moment, we shall remove the ambiguity by writing X, for the locale 
corresponding to a space X.) 

We define a point of a locale X to be a continuous map p: 1 — X, where 1 
denotes (the locale corresponding to) the one-point space; equivalently, it is a 
frame homomorphism p*: O(X) > O(1). Such a map p* is determined by the 
subset F = {U € O(X) | p*(U) = 1} of O(X); the requirement that p* be a 
frame homomorphism corresponds to the requirement that F be a completely 
prime filter in O(X), that is a filter such that V S € F implies SN F 4 Q, for 
any S C O(X). So we frequently identify points of X with such filters; that is, 
instead of regarding open sets as sets of points (as is usual in topology), locale 
theory regards points as (appropriate) sets of ‘open sets’. (There is clearly scope 
for a more symmetric development of topology, in which we take both ‘point’ 
and ‘open set’ as primitive notions, together with a binary relation between 
them. For an indication of this line of development, see [1205].) We note that 
if X is a space and p a point of X (in the classical sense), then the completely 
prime filter in O(X) which corresponds to p is simply {U € O(X) | p € U}, the 
neighbourhood filter of p. 

Classically, a filter is completely prime iff its complement is a principal prime 
ideal; so it is tempting to identify the points of a locale with the prime elements 
of the corresponding frame (i.e. those which generate prime principal ideals). 
However, we shall do our best to resist this temptation, since we wish as many 
as possible of our arguments concerning locales to be constructively valid, and so 
applicable to internal locales in an arbitrary topos. (There are some results — for 
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example 1.2.4 below — where we shall have to employ non-constructive reasoning; 
we shall try always to indicate these explicitly.) 

Writing X, for the set of all points of a locale X, we have a map 
dx: O(X) > P(X,) (where P(A) denotes the power-set of A), given by 
@x(U) = {p € Xp | p*(U) = 1}. It is easy to verify that ¢x is a frame homo- 
morphism, and so its image is a topology on Xp; from now on, we regard Xp 
as a space equipped with this topology. Moreover, if f: X — Y is a continuous 
map of locales, then composition with f* defines a mapping Xp — Yp, which is 
easily seen to be continuous with respect to the topologies just defined; that is, 
X ++ X, becomes a functor Loc — Sp. 


Lemma 1.2.2 The functor X +> Xp just defined is right adjoint to the functor 
X + X,: Sp — Loc. Moreover, the adjunction is idempotent. 


Proof For a space X, the mapping nx which sends a point p of X to the 
completely prime filter {U € O(X) | p € U} is a continuous map X — Xjp, since 
we have nx'(¢x,(U)) = U for any U € O(X). And for a locale Y, we have a 
continuous map ey : Yp — Y given by e}, = gy: O(Y) > O(Y,). These maps are 
easily verified to be natural in X and Y, and to satisfy the triangular identities 
for an adjunction. We note that ¢y is always surjective, by the definition of the 
topology on Yp; if Y is a locale of the form X;, then it is bijective, since if U 
and V are distinct open subsets of X then there must be a point of X lying 
in just one of them, and composition with 7x yields a point of Y lying in just 
one of ġy (U) and dy (V). Thus ex, is an isomorphism for all spaces X, i.e. the 
adjunction is idempotent. (It follows from the general theory of adjunctions that 
ny, must be an isomorphism for all locales Y; in fact this could have been proved 
directly, by a similar argument.) o 


We call a space X sober if nx is an isomorphism; in fact, given the way 
in which the topology is defined on Xip, this is equivalent to demanding that 
nx be bijective, i.e. that each completely prime filter in O(X) is of the form 
{U € O(X) |p € U} for a unique p E€ X. And we call a locale Y spatial if ey is 
an isomorphism. We thus obtain three immediate corollaries of 1.2.2: 


Corollary 1.2.3 
(i) The full subcategory Sob of Sp whose objects are sober spaces is reflective 
in Sp, the reflector being gwen by X ++ Xip. We call Xj, the sobrification 
of X. 
(ii) The full subcategory SLoc of Loc consisting of spatial locales is coreflec- 
tive, the coreflector being given by Y > Yp- 


(iii) The adjunction of 1.2.2 restricts to an equivalence of categories between 
Sob and SLoc. O 


In passing, we note that the To axiom for spaces is equivalent to the assertion 
that 1x is injective, since nx (p) = nx (q) iff there is no open set in Xcontaining 
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just one of p and q. Thus sober spaces are necessarily Tp. In the opposite 
direction, we have 


Lemma 1.2.4 Hausdorff spaces are sober. 


Proof Let X be a Hausdorff space, and F a completely prime filter in O(X). 
Then 


v= LU | U € (O(X)\ F)} 


is an open set which is not the whole of X (since it is not in F) but has the 
property that any open set strictly containing it must be in F. Now if x and y are 
two distinct points of X \ V, then we can find disjoint open neighbourhoods M, 
N of x and y; and then VU M and V UN are open sets in F whose intersection 
is not in F, contradicting the fact that F is a filter. So X \ V is a singleton {z}, 
from which it follows easily that F = {U | x € U}. o 


We remark that the use of complementation in the above proof is unavoidable; 
unlike the other results proved so far in this chapter, 1.2.4 is not constructively 
valid. We shall give a counterexample in 1.6.8 below. 

There is no implication in either direction between sobriety and the 7) axiom. 
If X is an infinite set equipped with the cofinite topology, then the family of all 
nonempty open subsets of X is a completely prime filter which is not in the image 
of nx; so X is not sober. And the Sierpiński space S is easily seen to be sober but 
not Tı. (It is also of interest to note that the space K A (with topology L(K A°?)) 
considered in the proof of 1.1.4 is not sober if A is infinite: the completely prime 
filters of L(K A°?) correspond bijectively to arbitrary subsets of A, not just to 
finite ones - and in fact the sobrification of KA turns out to be homeomorphic 
to S4.) 

Despite the foregoing examples, it seems to be the case that one does not 
lose any interesting examples of topological spaces by restricting one’s attention 
to sober spaces. (Essentially, this is because most of the interesting topological 
properties of a space — such as compactness and connectedness — reside in its 
open-set lattice; and we can always replace a non-sober space by a sober one 
having the same open-set lattice, namely its sobrification.) So from now on, 
unless we specifically say otherwise, all spaces will be assumed to be sober; and 
we shall drop the use of the notation X, — a (sober) space will be identified 
with the spatial locale to which it corresponds. (Thus the spatial coreflection of 
a locale Y will simply be denoted by Yp, not Y,.; and if we ever do encounter 
a non-sober space we shall denote its sobrification by X, rather than Xj». The 
point is that it really doesn’t matter, for a lot of the things we wish to do, 
whether we consider a particular object to be a sober space or a spatial locale.) 

In passing from spaces to locales, we have thus sacrificed the ‘pathological’ 
non-sober spaces, and gained the non-spatial locales. The reader may be forgiven 
for wondering whether this is a genuine gain: are not the non-spatial locales (of 
which, be it noted, we haven’t given any examples yet) just as pathological as the 
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non-sober spaces? Before we can give a straight answer to this question (cf. 1.2.8 
and 1.2.9 below), we need to consider sublocales, i.e. regular monomorphisms 
in Loc. 

We define a sublocale of a locale X to be a regular subobject of X in Loc; i.e. 
it is the locale corresponding to a regular quotient of O(X). We write Sub(X) 
for the lattice of sublocales of X; note that the natural ordering on it (defined as 
in Section A1.3) corresponds to that on fixsets in O(X), so as a lattice Sub(X) 
is isomorphic to N(O(X))°?. (This notation conflicts with our usual practice, 
introduced in Section A1.3, whereby Sub(X) denotes the preorder of all subob- 
jects of X; but we shall not have occasion to consider this latter concept in the 
category of locales — one reason is that Loc is not well-powered, as we saw in 
1.1.21. We shall also use the phrase ‘image factorization’ in Loc in the opposite 
of our usual sense, introduced in Section A1.3: that is, the image factorization of 
a continuous map of locales is its factorization into an epimorphism followed by 
a regular monomorphism, corresponding to the surjection—injection factorization 
of the corresponding frame homomorphism. Again, this nonstandard usage may 
be justified by the fact that Loc is not regular; in fact its regular epimorphisms 
are not even stable under composition — see [980].) 

We note also that if Y is a subspace of a space X, then the definition of 
subspace topology implies that i~!: O(X) — O(Y) is a surjective frame homo- 
morphism (where 2: Y — X is the inclusion); so the sublocales of a space include 
all its subspaces. Incidentally, we should mention that not all subspaces of a sober 
space need be sober; but we have 


Lemma 1.2.5 Let X be a (not necessarily sober) space. Then there is a closure 
operation Y ++ Y on subsets of X, such that for any Y, Y is the largest subspace 
for which the inclusion Y — Y induces a bijection of open-set lattices. If X is 
sober, then Y is (homeomorphic to) the sobrification of Y, for any Y C X. 


Proof We recall that a subset of X is said to be locally closed if it is express- 
ible as the intersection of an open and a closed subset. We define Y (which is 
sometimes called the subclosure of Y ) to be the set of all points x such that every 
locally closed set containing x meets Y. 

Now if U and V are open sets in X having the same intersection with Y, then 
they have the same intersection with Y; for the locally closed sets U \ V and 
V \ U do not meet Y, and hence cannot meet Y. Thus O(Y) = O(Y). On the 
other hand, if x is a point of X \ Ý, then it belongs to the difference of two open 
sets having the same intersection with Y, and so cannot belong to any subspace 
having the same open-set lattice as Y. So Y is the largest subspace of X with 
this property. 

Now suppose X is sober, and let F be a completely prime filter of open 
sets in Y. Then G = {U € O(X) | UNY e F} is a completely prime filter 
in O(X), and so defines a unique point x of X such that U € G iff x e U. 
Now if C = UN (X \ V) is a locally closed set containing x, then U € G but 
V gG, sS0UNY ZVNY, ie. CNY £0. Sox eY. Conversely, if x € Y, 
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then {U € O(Y) | x € U} is a completely prime filter in O(Y), and so the 
intersections of these sets with Y form a completely prime filter in O(Y). Thus 
we have set up a bijection between the points of Y and those of Y,; it is easy to 
see that it must be a homeomorphism. o 


A space X is said to be a Tp-space if every singleton subset of X is locally 
closed. For such spaces, it is clear that every subspace Y is subclosed (i.e. satisfies 
Y = Y), and so every subspace of a sober Tp-space is sober. Note also that open 
subspaces and closed subspaces of sober spaces are always sober. 

Reverting to our main theme, we now know that if X is a (sober) space 
then Sub(X) contains a copy of the poset of subclosed subsets of X. (However, 
they do not form a sublattice of Sub(X): unions of spatial sublocales are spatial, 
but we shall see that intersections need not be.) Next, we revisit the first three 
examples of nuclei studied in 1.1.16, to indicate our localic notation for them. 


Examples 1.2.6 (a) Let X be a locale, U € O(X). Then we have a 
sublocale of X corresponding to the open nucleus o(U). We denote this 
sublocale by U itself; in other words, we identify O(X) with the poset of 
open sublocales of X. This identification is ‘harmless’ because the mapping 
0: O(X) — N(O(X)) S Sub(X)°? sends finite meets to joins and arbitrary joins 
to meets, using 1.1.19 and the fact that open nuclei are complementary to closed 
ones. In keeping with it, we shall henceforth tend to denote the top and bottom 
elements of O(X) by X and @ respectively, rather than 1 and 0; and we shall tend 
to use |] and N, rather than \/ and A, to denote joins and finite meets in O(X). 
Note that if X is a space, and U is an open subset of X, we may identify the 
sublocale U with the space obtained by equipping U with the subspace topology, 
since the open-set lattice of the latter may be identified with the principal ideal 
| (U) in O(X). In particular, an open sublocale of a spatial locale is spatial. 

(b) For the closed sublocale of X corresponding to the nucleus c(U) on O(X), 
we shall use the notation CU. Once again, if X is a space and U an open subset 
of X, we may identify this sublocale with the space obtained by equipping the 
closed set X \ U with the subspace topology; so a closed sublocale of a spatial 
locale is spatial. (However, the use of complementation is an indication that this 
result, unlike that cited at the end of (a), is not constructively valid. Indeed, the 
assertion ‘every closed sublocale of a discrete locale is spatial’ implies the law of 
excluded middle: for in a discrete locale (that is, one corresponding to a frame of 
the form PX) every point is an open sublocale, and hence every spatial sublocale 
(being the union of its points) is open. The quoted assertion thus implies that 
the lattice of open sublocales of any discrete locale (that is, PX for any X) is 
Boolean.) 

(c) Of course, we call a sublocale of X dense if it corresponds to a dense 
nucleus on O(X). It is again easy to see that any dense subspace of a space 
X gives rise to a dense sublocale of X; but in general a space will have many 
more dense sublocales than dense subspaces. In particular, we write X, for the 
smallest dense sublocale of X, corresponding to the nucleus => (‘b stands for 
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‘Boolean’: recall that O(X,) = (O(X))— is a Boolean algebra). It is rarely 
spatial: for example, if X is a sober Tp-space without isolated points (i.e. such 
that {x} is not open for any x € X), then the subspaces X \ {x} are all dense, 
so X+ is contained in their intersection, and thus cannot have any points. 


The fact that intersections of dense sublocales are dense is thus the key 
difference between the categories Sob and Loc. In fact, Loc is in a sense the 
minimal modification of Sob which makes this result possible: 


Lemma 1.2.7 Assuming classical logic, any locale is expressible as an inter- 
section of dense spatial sublocales of a spatial locale. 


Proof We saw in 1.1.4 that free frames are spatial locales; and every frame is 
a quotient of the free frame on its underlying set, i.e. any locale is a sublocale 
of a spatial locale. To obtain the stronger statement above, note first that any 
sublocale has a closure: that is, a regular monomorphism Y — X in Loc has a 
factorization Y => Y >=> X, where Y > Y is dense and Y >> X is closed. (If 
j is the nucleus on O(X) corresponding to Y, then Y corresponds to c(j()).) 
Since closed sublocales of spatial locales are spatial, we now know that any locale 
Y is expressible as a dense sublocale of a spatial locale X. Lemma 1.1.17 then 
expresses it as an intersection of sublocales of the form U ULV, which are spatial 
because they are unions of spatial sublocales of X, and dense in X because they 
contain Y. o 


We now give a couple of interesting examples of non-spatial locales, which we 
hope may convince even the classically-minded reader that something has been 
gained by the switch from spaces to locales. 


Example 1.2.8 Let A be a nonempty set, and let X be the set of all functions 
N — A, equipped with the Tychonoff topology where A is regarded as a discrete 
space, i.e. the topology in which the basic open neighbourhoods of a function f 
have the form 


{g E€ X | gln = fln} 


for some n > 0 (where g|n means the restriction of g to {0,1,...,n — 1} CN). 
Note that X is (a Hausdorff space, and therefore) sober. For each a € A, let Xa 
be the subspace {f € X | a € im f}; clearly Xa is dense in X. Let Y = e4 Xa, 
where the intersection is computed in the lattice of sublocales of X; then Y is also 
dense in X. However, Y, is the intersection of the X4 in the lattice of subspaces 
of X; so if A is uncountable then Yp, = 0, and hence Y is not spatial. 
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We may identify O(X) with the Lindenbaum algebra (cf. D1.4.14) of the 
geometric propositional theory of functions N — A, that is the theory over the 
signature with primitive propositions p(n, a) indexed by N x A whose axioms are 


(T F Vaca p(n, @)) for all n, and 
((p(n, a) A p(n,b))F L) for all n, a,b with a Æ b. 


If we add the further axioms 


(TEH Ven P(n, a)) for all a, 


we obtain the geometric theory of surjections N > A, whose Lindenbaum algebra 
is O(Y). The fact that this theory is always consistent, even though it may have 
no models in Set (i.e. Y may have no points) will be of importance in the 
representation theorems of Chapter C5. 


Example 1.2.9 For another example in a similar vein, consider the localic 
translation of 1.1.12: if (Xi | i € I) are the vertices of a cofiltered diagram 
in Loc whose transition maps X, — X; are all epimorphisms, then the legs 
(limj¢,X;) —> X; of the limiting cone are also epic. This is well known to be 
false for spaces (even for discrete spaces): again, let A be an uncountable set, let 
I be the poset of finite subsets of A ordered by reverse inclusion, for each i € I 
let X; be the set of injections 2 — N (with the discrete topology), and for 7 D1 
let X, — X; be the restriction map. Then a point of the inverse limit lim j¢;X; 
in Sp would be an injection A — N, so no such points exist; but the inverse 
limit in Loc is nontrivial, since it maps epimorphically to each X;. 

Once again, we may think of the frame corresponding to this locale as a 
Lindenbaum algebra: this time for the theory of injections A — N, which we 
may present using the same signature as the theory of 1.2.8, with the second and 
third groups of axioms displayed there, plus 


((p(m, d) A p(n,a)) F L) forall (m,n, a) with m £n. 


The reader may be prompted to ask at this point whether the theory of bijec- 
tions A — N (i.e. the propositional theory with all four groups of axioms from 
1.2.8 and 1.2.9) also has a nontrivial Lindenbaum algebra when A is uncount- 
able. The answer is yes, but it seems harder to give a direct locale-theoretic 
proof of this with the tools currently at our disposal. (One possible proof runs 
as follows: since the theory of injections A — N is consistent by 1.2.9, it has a 
model in a non-degenerate Boolean topos B by D3.1.16; but in a Boolean topos 
the Cantor—Bernstein theorem is valid by D4.1.11, and so we can construct a 
bijection A — N in B.) We shall also encounter the intersection of the theories 
of 1.2.8 and 1.2.9 in 3.3.11(c) below (and see also D4.1.9). 
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Restating Lemma 1.1.18(iii) in localic terms, we have 


Lemma 1.2.10 Let f: X —Y be a continuous map of locales, U € O(Y). 
Then there are pullback squares 


fs and C(f*(U)) ——> Cu 


nee 


in Loc. o 


The ‘open’ half of 1.2.10 tells us that there is no ambiguity, given our identi- 
fication of elements of O(X) with open sublocales of X, in using f* both for the 
frame homomorphism corresponding to a continuous map f: X — Y of locales 
and (as usual) for the effect of pulling back sublocales of Y along f. We shall 
write fı: Sub(X) — Sub(Y) for the left adjoint of f*, i.e. the map which sends 
a sublocale S of X to the image of the composite S — X — Y. (In general, fı 
does not restrict to a map O(X) — O(Y); we shall investigate the circumstances 
in which it does so in Section C3.1.) 

Although sublocales are the ‘natural’ localic generalization of subspaces, the 
fact that they fail to form a Boolean algebra is sometimes inconvenient. For some 
purposes, it is useful to restrict our attention to the Boolean algebra Sub(X)° 
of complemented sublocales of X: note that this contains all open or closed 
sublocales of X. (However, it is rarely complete as a lattice.) For future reference, 
we note a couple of results concerning complemented sublocales. 


Lemma 1.2.11 Let f: X — Y be an epimorphism in Loc. Then every com- 
plemented sublocale S of Y satisfies S = fif*(S). In particular, f* is injective 
on complemented sublocales. 


Proof Let T be the complement of S in Sub(Y). The maps f* and fı both 
preserve finite joins: f, because it is a left adjoint, and f* because it is a pene 
homomorphism (by 1.1.19). So fif*(S)U fif*(T) = fif*(Y) = f(X) = Y (the 
last step using the fact that f is epic). But from the adjunction we have fif*(S) < 
S and f,f*(T) <T, so 
S= SN (FF (SUA (T)) 

=(SN fif US) USN AFT) 

< fif*(S)U(SNT) 

= fif"(S)uUd. 


The second assertion is immediate from the first. E) 
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Example 1.2.12 To show that the result of 1.2.11 fails for arbitrary sublo- 
cales, let Y be a sober Tp-space without isolated points, let X be the same 
set retopologized with the discrete topology, and let f: X — Y be the identity 
map. Then clearly f~!: O(Y) — O(X) is injective, i.e. f is an epimorphism in 
Loc; but the pullback of Y, along f is the empty sublocale of X, since it has no 
points and every sublocale of X is (open, by 1.1.20(ii), and hence) spatial. In 
particular, we note from this example that epimorphisms in Loc are not stable 
under pullback. 


Lemma 1.2.13 Let Y be a spatial locale, S a complemented sublocale of Y. 
Then (assuming classical logic) 


(i) S is spatial. 
(ii) For any morphism f: X — Y in Sob, the pullback of S along f in Loc 
coincides with the pullback in Sob. 


Proof (i) Again, let T denote the complement of S in Sub(Y). Since the 
terminal locale 1 has only two sublocales 1 and (this is where we use classical 
logic!), it is clear that any point of Y defines an atom in Sub(Y), and hence must 
factor through either S — Y or T — Y. So, writing S, for the spatial part of S 
as before, we have Sp UT, > Yp = Y; but Sp < S and Tp < T, whence Sp = S 
follows as in the proof of 1.2.11. 

(ii) Clearly, a pullback of a complemented sublocale is complemented. So this 
follows immediately from (i) and the fact that the ‘spatial part’ functor X > Xp, 
being a right adjoint, preserves pullbacks. o 


Not every spatial sublocale of a spatial locale is complemented. If X is a 
(sober) space and S a (subclosed) subspace such that both S and X \ S are 
dense in X (for example, if X = R and S = Q), then any sublocale T of X 
satisfying SUT = X must contain all the points of X \ S; so both S and T are 
dense and thus SN T > X+. Hence S does not have a complement in Sub(X). 
In fact it can be shown that a subclosed S defines a complemented sublocale of 
X iff there does not exist a nonempty subset Y C X such that both SN Y and 
Y \ S are dense in Y; see [484]. 

If X is a locale, we write X4 for the locale defined by O(X4) = N(O(X)), and 
call it the dissolution of X. Translating 1.1.19 and 1.1.20 into localic language, 
we see that X + Xa is a functor Loc — Loc, and that there is a natural 
transformation dy: Xa — X which is both monic and epic, and has the property 
that pullback along dx induces a bijection between arbitrary sublocales of X and 
closed sublocales of X4. (Thus one dissolves a locale by declaring everything to 
be closed.) If we assume classical logic (so that O(1) is Boolean), then dx induces 
a bijection between the points of X4 and those of X; indeed, for a space X, the 
spatial part Xap of Xa is simply X retopologized by declaring all subclosed sets 
to be closed (in particular if X is a Tp-space, this is just the discrete topology 
on the underlying set of X). However, X4 is rarely spatial, and the construction 
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X + Xq is rarely idempotent. We note, on the other hand, that the morphism 
dx: Xa — X is stably epic in Loc (that is, all its pullbacks are epic), in contrast 
to 1.2.12; this follows easily from 1.1.22(iii), since the bottom edge of the frame 
pushout displayed there is a monomorphism in Frm. 

Next, we translate the elementary properties of fibrewise dense and fibrewise 
closed nuclei into localic terms. Of course, given a locale map f: Y — X and 
a sublocale Y’ of Y, we say Y’ — Y is fibrewise dense (resp. fibrewise closed) 
over X if the corresponding nucleus on O(Y) is fibrewise dense (resp. fibrewise 
closed) relative to O(X). 


Lemma 1.2.14 

(i) Suppose given locale maps Y' — Y — X — X' of which the first is an 
inclusion. If Y' — Y is fibrewise dense over X (resp. fibrewise closed over 
X'), then it is fibrewise dense over X’ (resp. fibrewise closed over X). 
The converse implications hold if X — X’ is an inclusion. 

(ii) Ifi: Y'= Y is fibrewise dense over X and f: Y — X is an epimorphism, 
then the composite fi: Y' — X is an epimorphism. 

(iii) Arbitrary intersections of fibrewise dense sublocales (over a given base 
locale X ) are fibrewise dense. 


(iv) Arbitrary intersections, and finite unions, of fibrewise closed sublocales 
are fibrewise closed. 


(v) Fibrewise closedness is stable under pullback; that is, given a pullback 
square 
Z ———> Y' 


Z —————— Y 


of locales over X where the vertical morphisms are inclusions, then Z' — 
Z is fibrewise closed over X if Y'= Y is. 


(vi) Fibrewise closure is functorial; that is, given a commutative square as 


in the statement of (iv), the composite Dl A ey factors through 


we x 
Y“ = Y, where (—) denotes fibrewise closure over X. 


(vii) Fibrewise closedness is stable under change of base; that is, given a locale 
morphism X' > X and a fibrewise closed inclusion Y’ — Y in Loc/X, 
the induced morphism Y’ xx X' > Y xx X' is fibrewise closed over X’. 


Proof The ‘forward’ implications in (i) are immediate from the definitions of 
fibrewise dense and fibrewise closed nuclei; the reverse ones from the fact that 
they depend only on the image of the frame homomorphism O(X) > O(Y). For 
(ii), we use the fact that 7,(f2)* = 7.7*f* = f* if i is fibrewise dense; so (fi)* 
must be injective if f* is. 
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(iii) is a translation of 1.1.22(i), and (iv) is a translation of the fact, included 
in 1.1.22(iii), that the fibrewise closed nuclei form a subframe of the frame of all 
nuclei. (v) also follows from 1.1.22(iii), since pulling back sublocales of Y along 
g: Z = Y corresponds to applying the frame homomorphism N(g*) of 1.1.19 to 
nuclei on O(Y), and it is clear from the pushout diagram in the statement of 
1.1.22(iii) that this operation preserves fibrewise closed nuclei. 

(vi) is now immediate from (v), since the pullback along Z — Y of the 
fibrewise closure of Y’ — Y is fibrewise closed in Z and contains Z’. Finally, for 
(vii), we note that it follows from (v) that Y’ xx X’ > Y x x X’ is fibrewise 
closed over X; hence by (i) it is fibrewise closed over X’. O 


Unfortunately, the analogue of 1.2.14(vii) for fibrewise denseness is false in 
general; we shall see a counterexample shortly. However, it is true in the case 
when the morphism Y — X is open, as we shall see in 3.1.14 below. We note an 
alternative characterization of fibrewise denseness: 


Lemma 1.2.15 Let f: Y — X be a locale morphism, and Y' — Y a sublocale 
of Y. The following are equivalent: 


(i) Y'= Y is fibrewise dense over X. 


(ii) For each closed sublocale X’ — X, the pullback of Y' — Y along X’ — X 
is a dense inclusion. 


(iii) For each locally closed sublocale X’ — X, the pullback of Y' — Y along 
X' > X is a dense inclusion. 


Proof Since denseness (in the usual sense) is by definition stable under pull- 
back along open inclusions, the equivalence of (ii) and (iii) is immediate. For (i) 
& (ii), we note that if we pull back Y and Y’ along a closed inclusion CU — X, 
we obtain the sublocales Cf*(U) and Cj(f*(U)) respectively (where j is the 
nucleus on O(Y) corresponding to the sublocale Y’), and so the assertion that j 
fixes f*(U) is exactly the assertion that the second of these is a dense sublocale 
of the first. O 


Corollary 1.2.16 ‘Assuming classical logic, if X is a (sober) Tp-space, then 
an inclusion of spaces over X is fibrewise dense iff its pullback over every point 
of X is a dense inclusion. 


Proof One direction is immediate from 1.2.15 (and the fact that the points 
of a Tp-space, being locally closed, are complemented as sublocales, so that by 
1.2.13 we do not have to worry about the difference between localic and spatial 
pullbacks). For the converse, we use the fact that every closed sublocale of X is 
spatial (1.2.6(b)), and hence the union of its points. O 


The analogue of 1.2.16 for fibrewise closedness is also true, by 1.2.14(vi). 
Hence, given f: Y — X in Sob, where X is a Tp-space, and a (subclosed) 
subspace Y’ of Y, we form its fibrewise closure over X by forming the union, 
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over all points x € X, of the closure of Y’ N f~'(z) in the fibre f~!(x). This 
explains the use of the word ‘fibrewise’ for these notions. 

However, the Tp assumption cannot be omitted from 1.2.16. Let Y be the set 
of real numbers with the Euclidean topology, let X be the same set topologized so 
that the only nonempty open sets are the cofinite sets containing 0, let f: Y > X 
be the identity mapping, and let Y’ = Y \ {0}. Then it is easily seen using the 
criterion of 1.2.15(ii) that Y’ — Y is fibrewise dense over X; but its pullback 
over the non-locally-closed point 0 of X is not dense. 

We conclude this section with some remarks on regularity and the Hausdorff 
property for locales. Classically, a space X is regular iff every open set can be 
expressed as a union of open sets whose closures it contains; correspondingly, we 
define a locale to be regular if every U € O(X) satisfies U = U{V € O(X) | 
V aU}, where V dU is an abbreviation for ‘there exists W € O(X) such that 
VOW =f and UUW = X (equivalently, such that V < CW < U in Sub(X))’. 
We say X is Hausdorff if the diagonal X — X x X is a closed sublocale: 
because locale products do not coincide with space products in general, this is 
actually a stronger condition for spaces than the usual Hausdorff axiom (that is, 
there exist Hausdorff spaces which are not Hausdorff as locales). For this reason, 
Hausdorff locales (in our sense) have been called ‘strongly Hausdorff locales’ 
by some authors (e.g. (476, 520]); however, we now prefer the unadorned name 
‘Hausdorff’ for this property. There is no totally satisfactory analogue of the Tı 
property for locales (see (886, 1057]), but the following condition is sometimes 
useful: we say X is a Ty-locale (short for ‘totally unordered’) if, for any two 
locale maps f,g: Y 3 X, the relation f < g (in the partial ordering of 1.1.6) 
implies f = g. For a (sober) space X, this clearly implies that the points of X 
are discretely ordered in the ‘specialization’ ordering (x < y iff x is in the closure 
of {y}), and hence that X is a T\-space, but once again the locale-theoretic 
condition is stronger (see [886]). 

We stress that, in the definition of Hausdorff locale, the word ‘closed’ is 
used in its original strong sense, not in the weak ‘fibrewise’ sense of 1.1.22. 
Constructively, there are weak ‘fibrewise’ versions of all the basic separation 
axioms (see [533]), but we shall not need to consider them here. 


Lemma 1.2.17 
(i) Regular locales are Hausdorff. 


(ii) In a regular locale, every sublocale is an intersection of open sublocales. 


(iii) Hausdorff locales are Ty. 


Proof (i) Suppose X is regular; let (a,b): X — X x X be the closure of the 
diagonal sublocale. We must show that the two projections a and 6 are equal. 
Now if U,V are any two opens in X such that V a U, then since the open 
sublocale =V x V of X x X (where ~V is the complement of the closure of V, 
i.e. the Heyting negation of V in O(X)) is disjoint from the diagonal it is also 
disjoint from X; that is, a*(~V)Nb*(V) = 0 in O(X). But -V UU is the whole of 
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X , so a* (=V) Ua*(U) is the whole of X. It follows that we have b*(V) < a*(U), 
for all such pairs (U,V). But U = U{V € O(X) | V <U}, and b* preserves this 
union, so 6*(U) < a*(U). By symmetry, we also have a*(U) < b*(U); soa = b, 
as required. : 

(ii) It suffices by 1.1.17 to show that every closed sublocale is an intersection 
of open sublocales. But this is immediate from the definition, since for any 
U € O(X) we have 


CU < MV |V sv} 
< MEV | V su} 
=u. 


(iii) We claim that if f < g then (f,g): Y —> X x X factors through the 
closure of the diagonal. For if U x V is any open rectangle disjoint from the 
diagonal we have (f,g)*(U x V) = fF*U A g*V < gUNg*V = (UNV) = 89. 
Hence if the diagonal is closed we have f = g. o 


At this point we shall break off our study of locale theory, in order to begin 
the development of sheaf theory. However, further parts of the theory of locales 
will be developed as they are required from time to time in the remaining sections 
of this chapter, and also in Chapters C3-C5. For a more connected account of 
the subject, see [520]. 


Suggestions for further reading: Fourman & Grayson [363], Isbell [476, 482, 
484], Johnstone [520, 528, 533], Joyal & Tierney [560], Kock & Plewe [639], 
Plewe (980, 981], Sambin [1078, 1079], Valentini (1183, 1184], Vickers [1205]. 


C1.3 Sheaves, local homeomorphisms and frame-valued sets 


There are three, substantially different, ways of defining the category of sheaves 
on a space or locale. All three of them were introduced (for spaces, at least) at 
different points in Part A, and they were shown to be toposes. In this section, 
we shall recall the three definitions (generalizing them to locales in the process) 
and give a direct proof that they yield equivalent categories. 

The first and most familiar, originally introduced in A2.1.8, regards a sheaf 
as a special sort of presheaf. 


Definition 1.3.1 (a) A presheaf (of sets) on a locale X is a functor O(.X)°? — 
Set. Thus a presheaf F assigns to each open U > X a set F(U) (whose elements 
are commonly called sections of F' over U), and to each inclusion V < U of open 
sublocales a restriction map F(U) > F(V), subject to the obvious compatibility 
properties. 

(6) A presheaf F on X is said to be a sheaf if, given any family of sections 
(si E€ F(U;) | i € I) which are compatible in the sense that, for each pair (i,j), the 
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restrictions of s; and sj are equal in F(U,;U;), there is a unique s € F(U,<; Ui) 
whose restriction to F'(U;) equals s;, for alli € J. Informally, ‘compatible families 
of sections can be uniquely patched together’. 

(c) A morphism of sheaves f: F — G is simply a natural transformation of 
functors. We write Sh(X) for the category of sheaves on X. 


The second definition requires consideration of local homeomorphisms 
between locales. Of course, we say a continuous map p: E — X of locales is 
a local homeomorphism if E can be covered by open sublocales U for which the 
composite U — E — X is (isomorphic to) the inclusion of an open sublocale of 
X. We shall need the following elementary facts about local homeomorphisms: 


Lemma 1.3.2 
(i) A composite of local homeomorphisms is a local homeomorphism. 


(ii) A pullback of a local homeomorphism is a local homeomorphism. 


(iii) Given a commutative triangle of continuous maps 


X 


if p and q are local homeomorphisms then so is f. 


(iv) The inclusion functor from the category of locales and local homeomor- 
phisms to Loc creates pullbacks. 


(v) If X is a spatial locale, then so is the domain of any local homeomorphism 
p:E—>X. 


Proof (i) Let 


q 
fe ey 


be a composable pair of local homeomorphisms. Suppose that U € O(E) and 
V €e O(X) are such that the composites U — E — X and V — X — Y are 
isomorphic to the inclusions of opens U € O(X), V € O(Y) respectively. Then 
it is easy to see that the composite U N p*(V) — E — X is isomorphic to 
UNV = X, and that this in turn composes with q to yield an open sublocale of 
Y. But since p* preserves unions of open sublocales and the injinite distributive 
law holds in O(£), E can be covered by open sublocales of the form U N p*(V). 
Thus gp is a local homeomorphism. 
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(ii) Let 
g 


E ——>F 
f 
X —— Y 
be a pullback square in Loc, where q is a local homeomorphism. For any U € 


O(F), the square 
g*(U) ——> U 


g 
E ——> F 


is also a pullback by 1.2.10, and hence so is its composite with the square above; 
thus if U is such that U — F — Y is an open inclusion, then so is g*(U) — 
E — X. Since g* preserves unions of open sublocales, the result follows. 

(iii) It is clear that if E can be covered by open sublocales which map by 
local homeomorphisms to F, then E — F is itself a local homeomorphism; so 
we may reduce to the case where p is the inclusion of an open sublocale U — X. 
Now if V € O(F) is such that V — F — X is the inclusion of an open sublocale 
V, then f *(V) may be identified with the open sublocale U N V of U, and the 
composite UM V >= U — F with the inclusion q*(U) NV — F. So, once again, 
the result follows from the fact that f* preserves unions of open sublocales. 

(iv) From (ii), we know that if p: E — X and q: F > X are local homeomor- 
phisms, then so are the projections from the pullback E x x F' in Loc. This does 
not suffice to show that it is a pullback in the subcategory; we need to show that 
if f: Z — E and g: Z — F are local homeomorphisms satisfying pf = qg, then 
the induced map (f,g): Z > E xx F is also a local homeomorphism. But this 
follows from (iii), since (by (i)) both Z and E x x F admit local homeomorphisms 
into X. 

(v) If X is spatial, then so is any open sublocale of X (1.2.6(a)); hence the 
assertion that p: E — X is alocal homeomorphism implies that E can be covered 
by spatial (open) sublocales, and is thus spatial. ad 


In this chapter, we shall write LH for the category of locales and local home- 
omorphisms between them. In Part A, we used this name for the category of 
spaces and local homeomorphisms, but the difference is not a substantial one: 
although we have ‘enlarged’ the category LH, Lemma 1.3.2(v) tells us that we 
have not changed the category LH/X when X is a space. 1.3.2(iii) says that 
LH/X is a full subcategory of Loc/X, for any X; and 1.3.2(iv), plus the (obvi- 
ous) fact that identity morphisms are local homeomorphisms, says that it is 
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closed under finite limits. Our second definition of the category of sheaves on X 
is simply as the slice category LH/X (cf. Al.2.7). 

We remark in passing that local homeomorphisms are often known as ‘étale 
maps’; this is based on a mistranslation of the French term ‘espace étalé’ (lit- 
erally, ‘displayed space’) for (the domain of) the local homeomorphism over X 
corresponding to a given sheaf on X under the equivalence of 1.3.11 below, and 
does not derive from the ‘étale morphisms’ of algebraic geometry. For this reason, 
we shall avoid using the term as far as possible. 

The third definition of the category of sheaves is as the category of O(X)- 
valued sets, which was introduced at the end of Section A3.3 as the category 
of maps in a certain allegory Mat(O(X))[S]. We give a more explicit definition 
here: 


Definition 1.3.3 (a) An O(X)-valued set is a pair (A, €) where A is a set and 
€: Ax A O(X) is a function satisfying 


€(@1, a2) = e(a2,a1) 


and 
€(@1, 22) N €(G2, a3) < €(a), a3) 


for all a), a2,a3 € A. Thus € is a ‘symmetric and transitive O(X )-valued relation 
on A’. We do not require € to be reflexive; we sometimes abbreviate e(a, a) to 
e(a), and call it the extent of the element a. 

(b) A map of O(X)-valued sets $:(A,6€) > (B,5) is a function 
$: Bx A> O(X) satisfying 


(b, a) < 6(b) N e(a), 
(bi, a1) N ô(bı, ba) N e(a, a2) < (ba, a2), 
$(b1, a) b(b2, a) < 4(b1, b2), 


and 
e(a) <|_J{o(6, a) | b € B} 


for all a,a,,a2 € A and all 6,b,,b2 € B. Informally, ¢ is an ‘O(X)-valued 
functional relation from (A, €) to (B,4)’. 

(c) Given maps ¢: (A,e) > (B,6) and 4: (B,d) > (C,y), the composite 
wo: (A, €) > (C, y) is defined by 


w(c,a) =(J{w(c,b) N $(6,a) |b € B}. 


It is straightforward to verify that yọ is a map of O(X)-valued sets (i.e. that 
it satisfies the conditions in (b)), and that composition is associative. Further, € 
itself acts as an identity map (A, <€) — (A, €) for any O(X)-valued set (A, €); so 
we have a category of O(X)-valued sets, which we denote by Set(O(X)). 


506 SHEAVES ON A LOCALE C1.3 


We now embark on proving the equivalence of the categories Sh(X), LH/X 
and Set(O(X)), for any locale X. First we define a functor T: Loc/X — Sh(X): 
given a locale p: E — X over X, ['(p) is the presheaf whose sections over U € 
O(X) are the continuous sections of p over U, i.e. the commutative triangles 


E 
va ha 
U > X 


in Loc. To verify that T (p) is a sheaf, we need to show that open coverings are 
colimits in Loc, i.e. 


Lemma 1.3.4 Let X be a locale, and (U; | i € I) a family of open sublocales 
of X with union U. Then the inclusion maps U; — U form a colimit cone for 
the diagram of locales whose vertices are all the U; and the U; Ñ Uj, and whose 
edges are the inclusions U; Ñ Uj — U; and U; NU; = Uj. 


Proof We recall that colimits in Loc are just limits in Frm, and that the 
latter are created by the forgetful functor Frm — Set. So this is easy to verify, 
bearing in mind that O(U) may be identified with the principal ideal | (U) in 
O(X), and the frame map O(U) — O(U;) corresponding to the inclusion is 
identified with (—) A Uj. o 


It follows that a compatible family of continuous maps s;: U; > E may be 
uniquely patched together to a continuous map s: U — E; and if the s; are all 
sections of p, then so is s, by the uniqueness of maps out of colimits. Thus ['(p) 
is a sheaf, as claimed. Also, it is clear that if 


E ——— F 


X 


is a map of locales over X, then composing sections of p with f yields a morphism 
of sheaves I (f): P(p) — I'(q). Thus T becomes a functor Loc/X — Sh(X). (Its 
restriction to the subcategory LH/X will be part of the equivalence which we 
seek.) 

Next, we define a functor ©: [O(X)°?, Set] + Set(O(X)). Given two sections 
s E€ F(U), t € F(V) of a presheaf F on X, we let €-(s,t) denote the union of all 
open W < U N V such that the restrictions of s and t to W are equal. (If F is 
a sheaf, then we may alternatively define e-(s,t) as the largest open sublocale 
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on which the restrictions of s and ¢ are equal, because we can patch together 
their restrictions to all the open sublocales where they agree.) In particular, 
€r(s) = erp(s,s) is just the open U such that s € F(U). It is now easy to 
see that the disjoint union O(F) = [[yeocx) F(U), structured with the map 
ef: O(F) x O(F) > O(X), is an O(X)-valued set, as defined in 1.3.3(a). Given 
a morphism of presheaves f: F — G, we define O( f): O(G) x O(F) > O(X) b 


O(f)(t, s) = ea(t, f(s) - 


Again, it is straightforward to verify that O(f) is a map of O(X)-valued 
sets; the key observation is that we have €r(s1,S82) < eg(f(s1), f(s2)) for any 
$1, 8g E O(F). And if g: G — H is another morphism of presheaves, then 


O(g)O(F)(u, s) =|_J{en (u, g(t) A ealt, f(s)) | t € O(G)} 


> ey (u, gf(8s)) Nec(f(s), f(s) 
= ey (u, 9f(s)) N eu (9f(s), 9f(s)) 
= €H(u, gf (s)) 


and conversely 


eH (u, g(t)) Nealt, f(s) < en (u, 9(t)) O en (g(t), 9f(s)) < en (u, 9f(8)) 
for any t; so O(g)O(f) = O(g f), and we have proved 
Lemma 1.3.5 © is a functor [O(X)®P, Set] — Set(O(X)). o 


The second of our circle of three functors will be the restriction of O to 
the subcategory Sh(X). Before we complete the circle by defining a functor 
Set(O(X)) — LH/X, we pause to note that the O(X)-valued sets which are 
the images of sheaves under the functor © are of a rather special type. Given an 
O(X)-set (A,€), we define a singleton of (A,€) to be a mapping a: A > O(X) 
satisfying 

(i) o(a) Ne(a,b) < o(b) and 

(ii) o(a)N0(b) < e(a,b) 
for all a,b € A. For any a € A, the mapping b + €(a, b) is easily seen to be a 
singleton, which we denote by a. 


Definition 1.3.6 We say an O(X)-set (A, €) is complete if every singleton of 
(A, €) is of the form a for a unique a € A. 


Lemma 1.3.7 For any sheaf F on X, the O(X)-valued set O(F) is complete. 


Proof Let o be a singleton of O(F), and consider the family of all sections 
si € F(U;) such that o(s;) = U;. From the definition of a singleton, it is clear 
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that each F(U;) contains at most one such element, and that the set of all such 
elements is closed under restriction; so they form a compatible family. Now it 
is straightforward to verify that ø = § iff s is the unique section obtained by 
patching the s; together. O 


In the converse direction, if a € A and U < e(a), then the function b + 
e(a, b)}QU is readily seen to be a singleton; so the assertion that (A, €) is complete 
implies that for every such pair (a, U) there exists a unique aly € A with e(a|y) = 
e(a,a\y) = U. Thus the assignment U +> {a € A | e(a) = U} can be made into 
a presheaf F on X. And this presheaf is a sheaf, because if (a; € F(U,) |i € I) 
is a compatible family of sections of it, then the function b +> U;<, e(a, b) is 
again a singleton, and it is of the form a iff a is a patching of the a;. Thus the 
O(X )-valued sets which occur in the image of © are precisely the complete ones. 

Complete O(X)-valued sets are easier to work with than arbitrary ones, 
because maps between them have a particularly simple description: 


Lemma 1.3.8 Let ¢: (A,€) > (B,46) be a map of O(X)-valued sets, and sup- 
pose (B,6) is complete. Then there is a unique function f: A— B such that 
o(b,a) = 6(b, f(a)) for alla € A, b € B. Moreover, we have ô( f(ai), f(a2)) > 
€(a),@2) for all a1,a2 E A, with equality if a, = a2. 


Proof For each a € A, the mapping b+ ¢(b,a) is a singleton of (B, ô), so we 
can write it as f(a) for a unique f(a). This defines the function f. Now we have 


= LU 60, f(a)) = U (6,4) = e(a) 
bEB bE B 


by one of the identities in the definition of a map of O(X)-valued sets, which 
proves the last assertion of the lemma; and 


6(f(a1), F(a2)) = (f (a1), a2) 
( 


(f(a), a1) N e(a, a2) 


= II 


f(ai)) A e(a, a2) 


ô( 
€(a1) N e(a, a2) = e(a, a2) 


which proves the penultimate one. o 


It is also easy to verify that composition of maps between complete O(X)- 
valued sets, as defined in 1.3.3(c), corresponds to ordinary composition of 
the representing functions as defined in 1.3.8; so the full subcategory of com- 
plete O(X)-valued sets becomes a concrete category. Moreover, we do not lose 
anything by restricting our attention to complete O( X)-valued sets: 


Lemma 1.3.9 For any O(X)-valued set (A,€), there is a complete O(X)- 
valued set (A,€) which is isomorphic to it in Set(O(X)). 
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Proof We define A to be the set of all singletons of (A, €), with 


é€(o,7) = LJ (o(a) N 7(a)). 


acA 


It is straightforward to verify that (A, è) is an O(X)-valued set. Moreover, if 
a,b € A, then 


(à, b) > àla) Na(d) 
= e(a) N e(a, b) = e(a, b), 


the reverse inequality being trivial. Now if r is a singleton of (A,é), let 
o: A— O(X) be defined by o(a) = 17(@); then it is easily verified that o is 
a singleton of (A, €), and it is the unique singleton such that r = 6. Thus (A, è) 
is complete. Finally, we define 1: Ax A> O(X) by t(o,a) = o(a); then e sat- 
isfies the conditions of 1.3.3(b) both as a map (A,e) — (A,é) and as a map 
(A, €) — (A,€), and the composites of these two maps either way round are the 
appropriate identity maps. o 


Of course, the function A —> A which corresponds, via 1.3.8, to the isomor- 
phism u: (A,€) > (A, ë) is simply a + a. With a little more effort, one may 
show that the construction (A,«€) +> (A,é) of 1.3.9 is functorial, and that ¿ is 
a natural isomorphism; thus the category Set(O(X)) is equivalent to its full 
subcategory CSet(O(X)) of complete O(X)-valued sets. (Combining this with 
the remarks after 1.3.7, it is now easy to construct an inverse equivalence for 
0: Sh(X) > Set(O(X)).) For the next step of constructing a functor to LH/X, 
we shall take our domain to be CSet(O(X)) rather than Set(O(X)); this is not 
strictly essential, but it does considerably simplify the description of how the 
functor acts on morphisms. 

Given a (complete) O(X )-valued set (A, €), we construct a diagram D(A, €) in 
Loc/X as follows. For the vertices, we take one copy of the inclusion e(a) — X 
for each a € A, and one copy of e(a,b) — X for each pair (a,b). And for each 
(a,b), we take the inclusions e(a, b) — e(a) and e(a,b) — e(b) as edges of the 
diagram. We define A(A, €) to be the colimit of this diagram in Loc/X; for the 
moment, let us denote it by p: E > X. 

We need to prove that p is a local homeomorphism: but for each a € A, 
the ath leg àa: e(a) — E of the colimiting cone is a sublocale inclusion (i.e. a 
regular monomorphism), since its composite with p is an (open) inclusion. And 
these sublocales clearly cover E; so it suffices to prove that they are open. But 
since O(F) is a limit in Frm, we can give an explicit description of it, much 
as in 1.3.4: its elements correspond to families (U, | a € A) of opens in X 
such that Ua < e(a) for each a and U, N e(a,b) = U, N €(a,b) for each (a,b). 
Under this correspondence, (the image of) Aa clearly corresponds to the family 
(e(a, b) |b € A). 
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So far, we have not made any use of completeness of (A, €); but in defining 
the effect of A on morphisms of O(X)-valued sets, it will be convenient to 
assume that they are induced by functions on the underlying sets as in 1.3.8. So 
let @: (A,€) — (B,ô) be a map of O(X)-valued sets, and let f: A— B be the 
function corresponding to it. Then each vertex of the form e(a) in the diagram 
D(A, €) maps by the identity to the vertex 6(f(a)) of D(B,0), and each vertex 
e(a,6) maps by an inclusion to 6(f(a), f(b)); composing these maps with the 
appropriate legs of the colimit cone for D(B, 4), we obtain a cone under D(A, €), 
and hence a continuous map (necessarily a local homeomorphism, by 1.3.2(iii)) 
from the colimit A(A, €) to A(B, ô). It is easy to verify that this construction is 
functorial, and so we have 


Lemma 1.3.10 A is a functor CSet(O(X)) — LH/X. o 


Theorem 1.3.11 For any locale X, the categories Sh( X), LH/X, Set(O(X)) 
and CSet(O(X)) are all equivalent. 


Proof The equivalence of Set(O(X)) and CSet(O(X)) was dealt with in 1.3.9 
and the remarks immediately following it. And we have constructed functors 


Sh(X) -2> Cset(0(x)) -> LH/X > Sh(X); 


so we need to show that the composite of these three functors, in any of the 
three possible orders, is isomorphic to the identity. 

First, let F be a sheaf on X. Each s € F(U) can be regarded as an element 
s € O(F) with e(s) = U, and so it defines a vertex of the diagram D(O(F)) 
which is a copy of the inclusion U — X. Thus the sth leg A, of the colimiting 
cone is a continuous section of AO(F) over U. Moreover, if t is the restriction of 
s to an open V < U, then e(s,t) = V, and so the commutative diagram 


e(s, t) >——> €(s) 
Xs 
At 
e(t) ——> AO(F) 


implies that A; is the restriction of A, to V; thus s+ A, is a natural transfor- 
mation, i.e. a morphism of sheaves A: F — TAO(F). And from the definition of 
the three functors on morphisms, it is readily seen that A is itself natural in F. 
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So it remains to check that À is bijective. Given an open U >> X anda 
continuous section s of AO(F’) over U, we may form the pullback 


V, >_> V 
At 


U — > AO(F) 


for each t € F(V) (as V varies over O(X)). Note that à; and s are both (local 
homeomorphisms, and hence) open inclusions; so V, is an element of O(X ) con- 
tained in UM V. And since the A+, as t varies over all sections of F, cover 
AO(F), the V, form an open covering of U. Moreover, the restrictions t|y, form 
a compatible family, since V; Vy < e(t, t’), and so they patch to a unique sec- 
tion 3 € F(U), which is easily seen to be the unique element of F(U) satisfying 
As = s. Thus À is a natural isomorphism from the identity functor on Sh(X) 
to TAO. 

The proof that OTA is isomorphic to the identity on CSet(O(X)) is almost 
identical to the above; recall that we have already, after 1.3.7, observed that 
sheaves are essentially the same thing as complete O(X )-valued sets. 

Finally, we consider the composite AOT. Given a local homeomorphism 
p: E— X, the elements of OI (p) are the continuous sections of p over arbi- 
trary open sublocales of X, and they define a cone (with vertex p) under the 
diagram D(OI(p)) in Loc/X, and thus a morphism ¢,: AOT (p) > p in Loc/X 
(and hence in LH/X). Once again, it is easy to see that @, is natural in p, so 
we have only to verify that it is an isomorphism for all p. 

Let s be a continuous section of p over U € O(X). Then the diagram 


U = U 
Às S 


bp 
A@I(E) ——> E 


(where we have written AOT (E) for the domain of AOT(p)) commutes, by the 
definition of p; but since I (p) is a sheaf, it is not hard to verify that this square 
is a pullback. Noting that the vertical morphisms in the diagram are in fact 
open inclusions, we may rephrase this as saying that, for each U € O(E) such 
that U — E — X is an open inclusion, the pullback of ¢, along U — E is an 
isomorphism. But by assumption E is covered by such opens U, and hence @y is 
an isomorphism. 


We recall that two of the three functors used in the proof of 1.3.11 were origi- 
nally defined on larger categories than those on which we used them. (Of course, 
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the extension of © to [(O(X)°?, Set] does not take values in CSet(O(X)); but 
we can remedy this by composing it with the completion functor of 1.3.9.) If 
we analyse how much of the proof survives without the assumption that F is 
a sheaf (resp. that p: E — X is a local homeomorphism), we see that we still 
have a natural map A: F — TA@(F) (resp. ¢: AOT (p) > p), which becomes an 
isomorphism whenever the additional hypothesis is satisfied. Thus we have 


Scholium 1.3.12 
(i) The inclusion Sh(X) — [O(X)°?, Set] has a left adjoint, given by the 
composite TAO. 


(ii) The inclusion LH/X — Loc/X has a right adjoint, given by the compos- 
ite AOT. o 


Of course, the left adjoint to the inclusion Sh(X) — [O(X)°P, Set] may be 
constructed by other means; see for example A4.4.4, or 2.2.6 below. We note 
that either half of 1.3.12 implies that Sh(X) is complete and cocomplete, since 
both [O(X)°?, Set] and Loc/X are complete and cocomplete. 

Much of the power of sheaf theory lies in the fruitful interplay of the 
three different, but equivalent, descriptions of the category of sheaves on a 
locale: whenever we need to prove something about categories of sheaves, we 
can appeal to whichever description yields the simplest proof. For example, 


Corollary 1.3.13 Any slice category of a category of the form Sh(X) is again 
of this form. 


Proof This is easy if we regard sheaves as local homeomorphisms, because 
(as we saw in Section Al.1) (LH/X)/(p: E — X) is isomorphic to LH/E. O 


In other cases, it may be equally easy to use any of the three descriptions: 
Proposition 1.3.14 The equivalent categories of 1.3.11 are all toposes. 


Proof Of course, it suffices to prove that any one of them is a topos. But we 
have already given three different proofs of this in Part A: for Sh(X) in A2.1.8 
(see also A4.3.9), for Set(O(X)) in A3.4.13, and for LH/X in A4.2.4(e). (The 
last of these three relied on the fact that X was a space rather than a general 
locale, but the other two did not.) We shall not repeat the details here. o 


To conclude this section, we note that the locale X may be recovered, up to 
isomorphism, from its category of sheaves: 


Proposition 1.3.15 For any locale X, the lattice of subterminal objects (i.e. 
subobjects of 1) in Sh(X) is isomorphic to O(X). 


Proof Ít is easy to see (it follows from the fact that Sh(X) is reflective in 
[O(X)°?, Set]) that the monomorphisms in Sh(X) are exactly the injective nat- 
ural transformations between sheaves. Since the terminal object of Sh(X) is the 
constant functor with value 1, a sub-presheaf S of it is thus determined by the 
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set {U € O(X) | S(U) = 1}, which must be a downward-closed subset of O(X). 
But if S is a sheaf, this subset must be closed under coverings, and is thus a 
principal ideal |(U), for some U € O(X). This establishes a bijection between 
Sub(1) and O(X), which is readily seen to be order-preserving. 

Alternatively, we could argue in terms of local homeomorphisms. It is clear 
that the inclusion of an open sublocale is a local homeomorphism, and it is again 
not difficult to verify that these are exactly the monomorphisms in LH. Thus 
the subterminal objects of LH/X (equivalently, the subobjects of X in LH) are 
exactly the (inclusions of) open sublocales of X. o 


Suggestions for further reading: Fourman & Scott [366], Gray [415], 
Tennison [1163]. 


C1.4 Continuous maps 


In the previous section, we considered sheaves on a single locale X. We are now 
ready to investigate the functoriality of the construction X +> Sh(X). 

Let f: X — Y be a continuous map of locales. By 1.3.2(ii), we know the 
pullback functor f*: Loc/Y — Loc/X restricts to a functor (which we shall 
still denote by f*) from LH/Y to LH/X, and by 1.3.11 we can regard this as a 
functor f*: Sh(Y) > Sh(X), which we call the inverse image functor associated 
with f. 


Lemma 1.4.1 For any f, f*: Sh(Y) —> Sh(X) is cartesian. 


Proof Asa functor Loc/Y — Loc/X, f* preserves all limits, since it has a 
left adjoint (A1.2.8). And the inclusion LH/X — Loc/X creates finite limits, 
by 1.3.2(iv); so the restriction of f* preserves them. oO 


Of course, we have also used f* as the name for the frame homomorphism 
O(Y) — O(X) corresponding to f. This notation is consistent with that just 
introduced: if we identify open sublocales of Y with subterminal objects of 
LH/Y, as in 1.3.15, then 1.2.10 tells us that f*(U) is indeed the pullback of 
U along f. 


Remark 1.4.2 Before moving on, we pause to note a special case of the con- 
struction of 1.4.1. Suppose X is the terminal locale 1, and Y is a (sober) space; 
then continuous maps X — Y may be identified with points of Y. Moreover, the 
domain of any local homeomorphism p: E — Y is a space, by 1.3.2(v). Now the 
inclusion Sob — Loc does not preserve all pullbacks, as we saw in Section C1.2; 
but by 1.2.10 it preserves pullbacks of open inclusions, and from this one may 
easily verify that it preserves pullbacks of local homeomorphisms (essentially, 
because the proof of 1.3.2(ii) works in spaces as well as locales). Thus, for a 
point y: 1 — Y, the functor y*: Sh(Y) — Sh(1) = Set sends p: E — Y to the 
fibre E, = {e € E | p(e) = y} of p over y (which of course has the discrete 
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topology as a subspace of E). We call this set the stalk of p (or of the sheaf on 
Y which corresponds to it) at the point y. 


Next, we observe that f*: O(Y) — O(X) preserves covers and finite inter- 
sections — that is, those operations on open sublocales which are used in the 
definition of a sheaf (1.3.1). Thus it is immediate that if F is a sheaf on X, the 
composite 


oy) ws O(x)P sie 


is a sheaf on Y. And this construction is clearly functorial in F, so it defines a 
functor f,: Sh(X) — Sh(Y), which we call the direct image functor associated 
with f. 


Theorem 1.4.3 For any f: X > Y, the inverse image functor f*: Sh(Y) > 
Sh(X) is left adjoint to fx: Sh(X) > Sh(Y). 


Proof Let p: E—/Y be a local homeomorphism, and F a sheaf on X. 
By the proof of 1.3.11, we know that p is a colimit in Loc/Y of a diagram 
D(OT(p)) whose vertices are open inclusions U; >> Y, the legs A;: U; > E of 
the colimiting cone themselves being open inclusions. Since f* preserves unions 
of open sublocales, it follows that f*(p) is the union of the open sublocales 
f* (Ai): f*(Ui) — f*(E) , and hence (by 1.3.4) it is the colimit in Loc/X of the 
f*(U,). Thus a morphism f*(p) —> AO(F) in LH/X corresponds to a compat- 
ible family of morphisms U; — AO(F), i.e. to a compatible family of sections 
si E TAO(F)(f*(U;)) = F(f*(U;)). (Warning: ‘compatible’ here means that the 
restrictions of s; and sj agree on the open sublocale which is the intersection of 
f*(U;) and f*(U;) in f*(E), not on their intersection in X.) But precisely the 
same data defines a morphism p > AO(f,(F)) in LH/Y; so we have set up a 
bijection between these two homn-sets, which is clearly natural in both p and F. 

o 


Combining 1.4.1 and 1.4.3, we see that the pair of functors (fs, f*) forms 
a geometric morphism Sh(X) — Sh(Y), as defined in A4.1.1. (In A4.1.11 we 
gave an alternative construction of this morphism, in which both f, and f* were 
defined in terms of sheaves rather than local homeomorphisms; of course, by the 
uniqueness of adjoints, the functor f* defined there is naturally isomorphic to 
the one introduced in this section.) 

If g: Y — Z is a continuous map of locales composable with f, then the com- 
posite frame homomorphism (gf)*: O(Z) — O(X) is simply f*g*, from which 
it follows easily that the functor (gf): Sh(X) — Sh(Z) is the composite g, fx. 
Similarly, it is easy to see that (gf)*: LH/Z — LH/X is canonically isomorphic 
(though not necessarily equal) to the composite f*g*. Thus we have 


Proposition 1.4.4 The assignment X ++ Sh(X) is the object-map of a 
(pseudo-)functor Loc — Yop. o 
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But there is a further level of structure to be considered. Recall from 1.1.6 
that Loc is actually a 2-category (albeit a locally ordered one; i.e. its hom- 
categories are posets): given a parallel pair of continuous maps f,g: X 3 Y, we 
write f < g if f*(V) < g*(V) for all V € O(Y). (On the subcategory Sob, this 
structure is normally defined in terms of the ‘specialization’ partial ordering on 
points: given points x, y of a space X, we write x < y if every open set containing 
x also contains y — note that the Tọ axiom is equivalent to the assertion that this 
relation is a partial order rather than a preorder, but we are working under the 
assumption that all spaces are sober and hence Tọ — and then, given continuous 
maps of spaces f,g: X 3 Y, we set f < gif f(x) < g(x) for all x € X. It is not 
hard to verify that this definition is equivalent to saying that f~'(V) C g“'(V) 
for all V € O(Y).) 

Now we may regard an inequality f < g as a natural transformation f* — g*, 
when the latter are regarded as functors between the posets O(Y) and O(X); 
and if we regard them as functors O(Y)°? = O(X)°P, then the direction of 
the natural transformation is of course reversed. Thus we see that the inequality 
f < g induces a natural transformation g, > f+: Sh(X) — Sh(Y), and hence by 
adjointness a natural transformation f* — g*: Sh(Y) — Sh(X). Alternatively, 
we could have constructed the latter directly: if p: E — Y is a local homeomor- 
phism, then for each V € O(F) such that V — E — Y is an open inclusion, 
we have a morphism f*(V) — g*(V) in Loc/X, and these morphisms combine 
to give a natural transformation between the diagrams in Loc/X whose colim- 
its are f*(E) and g*(E) respectively. It is clear that this construction yields a 
functor from the ordered set Loc (X,Y) to the category Top (Sh(X), Sh(Y)) 
of geometric morphisms and geometric transformations between them; so the 
assignment X ++ Sh(X) is actually a 2-functor. Moreover, we have 


Proposition 1.4.5 Let X and Y be locales. Then the functor Loc(X,Y) > 
Top (Sh(X), Sh(Y)) gust described is (one half of) an equivalence of categories. 


Proof Let us, for the moment, write G(f) for the geometric morphism 
Sh(X) — Sh(Y) induced by a continuous map f: X > Y. If g: Sh(X) > Sh(Y) 
is an arbitrary geometric morphism, then since g* preserves 1 and monomor- 
phisms it restricts to a functor Subsn(y)(1) > Subsn(x)(1) — equivalently, by 
1.3.15, to an order-preserving map O(Y) —> O(X). Moreover, this map preserves 
finite intersections (since they are products in Sh(Y)); and it has a right adjoint 
(since g, also restricts to a functor between subobjects of 1) and hence preserves 
arbitrary unions. So it is a frame homomorphism, we let F(g): X — Y denote 
the corresponding locale map. Now we noted earlier that the restriction of G(f)* 
to subterminal objects is just f*, i.e. that FG(f) = f, for any locale map f. And 
for a geometric morphism g, we know that g* is determined up to isomorphism 
by its restriction to subterminal objects, since every object of LH/Y can be 
written as a colimit of a diagram in Sub(1), and g* preserves colimits; hence we 
have GF(g) © g. Similarly, if h is another geometric morphism Sh(X) > Sh(Y), 
then a natural transformation g* — h* is uniquely determined by its values at 
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subterminal objects; hence there is at most one such transformation, and it exists 
iff g*(V) < h*(V) for all V € O(Y). So (F is functorial, and) the isomorphism 
GF(g) = g is natural in g. O 


From now on, we shall usually not distinguish notationally between a con- 
tinuous map of locales and the geometric morphism of sheaf toposes which it 
induces; in other words, we shall identify Loc with the full sub-2-category of 
Top whose objects are toposes of the form Sh(X). In fact we can do slightly 
more than this: since Loc has a terminal object 1, and it is easy to see (using 
any of the three descriptions of the last section) that Sh(1) is equivalent to Set, 
we can in fact regard Loc = Loc/1 as a full sub-2-category of Top/Set. 

The next problem is obviously to identify which objects of Top/Set lie 
in this sub-2-category; i.e. which toposes over Set are equivalent to ones of 
the form Sh(X). One advantage of working with locales rather than spaces 
is that the answer to this question takes on a particularly simple form. First 
we recall from A4.1.9 that, for a topos E£, there is (up to canonical iso- 
morphism) at most one geometric morphism € — Set, and that such a 
morphism exists iff € is locally small and has set-indexed copowers. The lat- 
ter condition does not imply that € has all set-indexed coproducts, as we 
saw in Example A2.1.7, but it does imply the existence of some nontrivial 
coproducts: 


Lemma 1.4.6 If there exists a geometric morphism y: E — Set, then any set- 
indexed family of subterminal objects in E has a coproduct. 


Proof Let (U; | i € I) be such a family. For each i, let xi: 1 — Q be the 
classifying map of U; — 1; then since y*(J) is the J-indexed copower of 1, the 
x, induce a map x: y*(J) — Q. Let m: U — y*(I) be the subobject classified by 
x. Then for each 7 € J we have a pullback 

U 


eS ar) 


peas 


E 


where v, is the ith coprojection. But pullback along m has a right adjoint (A2.3.3) 
and so preserves any colimits which exist; so ly is the coproduct of the @,; in 
E/U, i.e. U is the coproduct of the U; in E. oO 


An easy generalization of the proof of 1.4.6 shows that, under the same 
hypothesis on €, we may form the coproduct of an arbitrary set-indexed family 
(A; |i € I) of objects of €, provided there exists some A € ob E which contains 
each A; as a subobject. 
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We recall that in A4.6.1 we defined a geometric morphism f: € — F to be 
localic if every object of E is expressible as a quotient of a subobject of an object 
of the form f*A, A € ob F. Now we can state 


Theorem 1.4.7 Let y: € — Set be a topos defined over Set. The following 
conditions are equivalent: 


(i) E is equivalent to a topos of the form Sh(X), for some locale X. 
(ii) The subterminal objects of E form a generating set. 


(iii) The geometric morphism y is localic. 


It was of course this theorem which inspired the use of the term ‘localic’ for 
geometric morphisms. 


Proof (i) = (ii): We already know that Sh(X) is generated by subterminal 
objects, since every object of LH/X is the colimit of a diagram whose vertices 
are subterminal objects. (Alternatively, we could argue that the representable 
functors form a generating set for [(O(X)°?, Set], and hence also for the full 
subcategory Sh(X) (since they lie in this subcategory); but they are precisely 
the subterminal objects of Sh(X), by 1.3.15.) 

(ii) > (iii): If the subterminal objects of € generate, then the family of all 
maps from such objects to a given object A of £ is jointly epic; and this family 
is a set, since E is locally small. So by 1.4.6 we can construct an epimorphism 
U — A where U is a coproduct of subterminal objects, and hence (by the proof 
of 1.4.6) a subobject of some y*(J). So y is localic. 

(iii) > (ii): Conversely, if y is localic, then for any object A we have an 
epimorphism U -» A where U is a subobject of some y*(J), and hence (by 
pulling back along the coprojections) an J-indexed coproduct of subterminal 
objects. So the family of all maps from subterminal objects to A is epic, i.e. the 
subterminal objects form a separating family for £ — and in fact they form a set, 
since we can identify Sube(1) with y,(Q¢). But E is balanced (A1.6.2), so any 
separating set is also a generating set. 

(ii) > (i): Suppose £ is generated by subterminal objects. The lattice Subg (1) 
of subterminal objects of E£ is a Heyting algebra, by A1.5.10 (or by A1.6.3). But 
it also has set-indexed unions by 1.4.6, since we can construct these as images 
of maps from coproducts to 1; so by 1.1.2 it is a frame, and we take it to be 
O(X). Now for any object A of E, the restriction of the functor €(—, A) to 
Sub(1) is a presheaf P(A) on X; and in fact it is a sheaf, since the construction 
of unions in O(X) just described enables us to show that they are colimits (in 
£) of diagrams like that described in 1.3.4. Clearly, given a morphism f: A > B 
of E, composition with f yields a natural transformation P(A) > P(B); so P 
is a functor E > Sh(X). In the opposite direction, given a sheaf F on X, we 
can regard the diagram D(O(F)) constructed in the last section as a diagram 
in € (whose vertices are subterminal objects) rather than in LH/X; and it has 
a colimit in € by 1.4.6, since we may construct colimits in the usual way from 
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coproducts and coequalizers. We take this colimit to be Q(F); then the proof in 
1.3.10 that A was functorial also shows that Q is a functor Sh(X) > £. 

Now if F is any sheaf on X, the proof given in 1.3.11 that TAO(F) S F can 
be carried through (with suitable changes of notation) in € rather than in LH/X, 
and it yields a proof that PQ(F) = F, naturally in F. So it remains to show 
that we have a natural isomorphism QP(A) = A, for A € ob €. We certainly 
have a natural transformation €: QP — leg, since the family of all maps from 
subterminal objects to A forms a cone under the diagram whose colimit is @ P(A). 
And, by the construction of this diagram, each morphism from a subterminal 
object to A factors through «4. Moreover, the factorization is unique, since the 
argument used to prove PQ(F) = F shows that any morphism U — QP(A), 
where U is subterminal, is one of the legs of the colimit cone. So the fact that 
the subterminal objects form a generating set implies that €4 is an isomorphism 
for any A. oO 


Remark 1.4.8 If we consider how much of the proof of (ii) > (i) above sur- 
vives without the assumption that € is generated by subterminal objects, we 
see that this assumption was only used in the proof that € is an isomorphism. 
Hence, for a general Set-topos y: E — Set, the functor Q is left adjoint to P, 
and identifies Sh( X) (where O(X) is the frame of subterminal objects of €) with 
a coreflective subcategory of €. In fact it can be shown that Q preserves finite 
limits; so the pair (P,Q) is a geometric morphism, and this yields another con- 
struction of the hyperconnected-localic factorization of the geometric morphism 
y, as originally considered in A4.6.5. In particular, we note once again that the 
sub-2-category LLop/Set of localic toposes over Set is reflective in Top/Set 
(cf. A4.6.12), and hence that the functor Loc — ZYop/Set which sends X to 
Sh(X) preserves limits (in an appropriate 2-categorical sense). This is one more 
advantage of working with locales rather than spaces: as we have already seen, 
the inclusion Sob — Loc fails to preserve even finite limits (the examples we 
saw involved intersections of subspaces, but there are also easy examples to show 
that a product of two spatial locales can fail to be spatial), and hence so does 
X ++ Sh(X) considered as a functor on Sp. 


Remark 1.4.9 If we wish to characterize toposes of the form Sh(X) for a 
space (as opposed to a locale) X, we may do so by adding to the conditions 
of 1.4.7(ii) or (iii) the requirement that the family of all inverse image functors 
E — Set should be jointly conservative. We know that this is true for the topos 
of sheaves on a space, by the description of these functors in 1.4.2. Conversely, if 
Y is a non-spatial locale, then any point of Y factors through its spatial part Yp, 
and so the corresponding inverse image functor Sh(Y) — Set inverts any inclu- 
sion between subterminal objects which is inverted by the frame homomorphism 
O(Y) > O(Y;). 


Before leaving this topic, we pause to note a very special case of 1.4.7, which 
fulfils a promise made at the end of Section A2.1. 
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Corollary 1.4.10 For aSet-topos y: E — Set, the following are equivalent: 


(i) y ts an equivalence. 


(ii) E is non-degenerate (i.e. 0 — 1 is not an isomorphism), and tts terminal 
object is a generator. 


Proof (i) > (ii) since we know that Set has these properties. 

(ii) > (i): If (ii) holds, then by 1.4.7 E€ is equivalent to Sh(X), where X is 
the locale defined by O(X) = Sube(1). But if U is any subterminal object of € 
such that U > 1 is not an isomorphism, then 0 — U must be an isomorphism 
(since there are no morphisms from 1 to either 0 or U); so, given that £ is non- 
degenerate, O(X) has just two elements, and X is the terminal object 1 of Loc. 

o 


C1.5 Some topological properties of toposes 


In this section, our aim is to show how some familiar topological properties 
of locales and continuous maps are reflected in categorical properties of the 
corresponding sheaf toposes and geometric morphisms. This subject will be 
investigated in much greater detail in Chapter C3 below; but it will be use- 
ful to introduce some of the definitions now, before we move on to considering 
sheaves on sites. In fact, despite the title of the section, almost all the proper- 
ties we study will be properties of geometric morphisms rather than properties 
of toposes: it turns out that many things which are traditionally thought of 
as properties of a space (or locale) X are actually properties of the geometric 
morphism Sh(X) — Set. Thus the act of translating these properties into their 
topos-theoretic form allows us to ‘relativize’ them; that is, to apply them to arbi- 
trary geometric morphisms and thence, by specializing to the case of morphisms 
Sh(X) — Sh(Y), to arbitrary continuous maps of locales. We shall investigate 
this point of view further in Section C1.6. 

First, and most importantly, we show how sheaf toposes motivate the defi- 
nitions of geometric inclusions and surjections, introduced in Section A4.2. One 
half of the following result was in fact proved there (see A4.2.12(c)), but the 
other was deliberately postponed until now. 


Proposition 1.5.1 Let f: X —Y be a continuous map of locales. 


(i) f is a sublocale inclusion (i.e. a regular monomorphism in Loc) iff the 
direct image functor f,: Sh(X) — Sh(Y) is full and faithful. 

(ii) f is an epimorphism in Loc iff the inverse image functor f*: Sh(Y) > Sh(X 
is faithful. 


Proof (i) f is a sublocale inclusion iff the frame homomorphism 
f*: O(Y) — O(X) is surjective. If this condition holds, then it is easy to see that, 
for functors F,G: O(Y)°? = Set, any natural transformation F o f* — Go f* 
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derives from a unique natural transformation F — G; so f+ is full and faithful. 
Conversely, if f, is full and faithful, then the counit of (f* 4 f.) is an isomor- 
phism; so any subterminal object U of Sh(X) satisfies U = f*f,(U), i.e. the 
restriction of f* to subterminal objects is surjective. 

(ii) This requires a little care, because in general epimorphisms are not stable 
under pullback in Loc, as we saw in 1.2.12. However, it follows from 1.2.11 that 
epimorphisms in Loc are stable under pullback along open inclusions. From here, 
a standard argument along the lines of 1.3.2(ii) shows that they are stable under 
pullback along local homeomorphisms. Now if 


h 
Be ee ee 
k 
p q 
Y 


is a parallel pair of morphisms in LH/Y having the same image under f*, then 
h and k are equalized by the projection f*(E) — E, and so if f is epimorphic 
they must be equal. 

Conversely, if f*: Sh(Y) — Sh(X) is faithful then it is also conservative, 
since Sh(Y) is balanced; so in particular it reflects isomorphisms between subter- 
minal objects, and hence f*: O(Y) — O(X) is injective, i.e. f is an epimorphism 
in Loc. oO 


Corollary 1.5.2 Let f: X —Y be a continuous map of locales. Then the 
image factorization of the induced geometric morphism Sh(X) — Sh(Y), as 
constructed in A4.2.10, corresponds to the factorization of f in Loc as an epi- 
morphism followed by a regular monomorphism. o 


For spaces, the analogue of 1.5.1 does not look quite so neat as it does for 
locales. If f: X — Y is a subspace inclusion (resp. a continuous surjection), 
then the corresponding map of locales is a regular monomorphism (resp. an 
epimorphism); but the converses do not hold without mild separation hypothe- 
ses on the spaces involved. For the inclusion case, all we require is that X 
should be a To-space, which is covered by our blanket assumption of sobriety 
(if f7}: O(Y) — O(X) is surjective, then two points of X with the same image 
under f must lie in exactly the same open sets of X); but in the surjection 
case, the most that we can deduce from knowing that f~! is injective is that the 
image of f is superdense in Y, i.e. that its subclosure (as defined in 1.2.5) is the 
whole of Y. If Y is a Tp-space, this forces f to be surjective; but in general it 
does not. Nevertheless, we shall commonly use the term surjection to refer to an 
epimorphism in Loc. 

Next, we consider open maps. Of course, we say a continuous map f: X — Y 
of locales is open if, for every open sublocale U of X, its image fi(U) in Sub(Y) 
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is an open sublocale of Y. Once again, this is not quite equivalent to the usual 
notion of openness for continuous maps of spaces; it is implied by the latter, but 
in the opposite direction it only implies that the set-theoretic image of each open 
subset of X is ‘almost open’ in Y, in the sense that its subclosure is open. Even 
when X and Y are both sober, this does not suffice to make all such images open 
(see [292]), but it obviously does so if Y is a Tp-space. 

We shall need the following result from locale theory. 


Lemma 1.5.3 For a continuous map f: X — Y of locales, the following are 
equivalent: 


(i) f is an open map. 
(ii) The frame homomorphism f*: O(Y) — O(X) has a left adjoint fi, which 
satisfies the Frobenius reciprocity condition that 


AUN FV) = f(U) nV 


for all U € O(X), V € O(Y). 


(ii) f*: O(Y) — O(X) is a homomorphism of complete Heyting algebras; i.e. 
it preserves arbitrary meets and the Heyting implication operation. 


Proof (i)e (ii): The assertion that f* has a left adjoint is equivalent to saying 
that, for every U € O(X), there is a unique smallest open sublocale V = f,(U) 
of Y such that U — X — Y factors through V — Y. For openness, we need to 
know in addition that f.(U) actually coincides with fi(U); equivalently, that the 
induced map f: U + f.(U) is a locale epimorphism. But if we identify the frames 
O(U) and O(f.(U)) with the appropriate principal ideals in O(X) and O(Y), 
then the frame homomorphism f* sends an open V < f.(U) to f*(V)NU, and 
this is easily seen to have a left adjoint, namely f, restricted to open sublocales 
contained in U. So the Frobenius reciprocity condition implies that the counit 
of this adjunction is the identity, and hence that f* is injective. Conversely, if 
f* is injective, then the Frobenius reciprocity condition holds for pairs (U, V) 
such that V < f.(U); to deduce the general case, we note that U < f* f.(U) by 
adjointness, and so 


SUNA FV) = FUNK FUN F(V)) = feU N FF (UJNV)) = f(U) OV, 


the last step by the special case already established. 

(ii) > (iii): f* preserves arbitrary meets iff it has a left adjoint, by the adjoint 
functor theorem; and then it also preserves implication iff the Frobenius reci- 
procity condition holds, by A1.5.8. CJ 


Theorem 1.5.4 Let f: X —Y be a continuous map of locales. Then f is 
open iff the inverse image functor f*: Sh(Y) — Sh(X) is a Heyting functor 
(i.e. commutes with universal quantification). 
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Proof Suppose f is open. Then the same is true of the pullback of f along 
any open inclusion V => Y (since, by 1.2.10, this is the restriction of f to an 
open sublocale of its domain), and hence it is also true for the pullback of f 
along any local homeomorphism E&E — Y: if U is an open sublocale of f*(£), 
we can express it as a union of open sublocales U N f*(V) where V = E is an 
open sublocale such that V — E — Y is an open inclusion, and the image of U 
under the morphism f*(E) — E is the union of the images of the U N f*(V), 
and so open in E. (In fact we shall see later that open maps of locales are stable 
under arbitrary pullback.) Thus, for any object A of Sh(Y), we may deduce 
from 1.5.3(ii) that the functor Subgp(y)(A) —> Subgnix)(f*(A)) obtained by 
applying f* to subobjects of A has a left adjoint fA. Moreover, the Frobenius 
reciprocity condition is tantamount to saying that these left adjoints commute 
with pullback along open inclusions (that is, monomorphisms in Sh(Y)); and a 
similar argument to the one above enables us to show that they commute with 
pullback along arbitrary local homeomorphisms, i.e. the square 


ie 
Sub(f*(B)) ——> Sub(B) 


(fh) h 
A 
Sub(f*(A)) —> Sub(A) 


commutes up to isomorphism for any h: A > B in Sh(Y). Taking right adjoints 
of all the functors in this diagram, we deduce that 


Sub(A) Ls Sub(f*(A)) 


Yh Veoh 


* 


Sub( B) — > Sub(f*(B)) 


also commutes up to isomorphism, i.e. f* is a Heyting functor. 

Conversely, suppose f* is a Heyting functor. Since the Heyting implication 
in the subobject lattices of a Heyting category is definable in terms of uni- 
versal quantification (A1.4.13), f* also preserves this operation; in particular, 
f*: O(Y) — O(X) is a Heyting algebra homomorphism. But if y: Sh(Y) — Set 
is the unique geometric morphism, then y*(J/) is the J-indexed copower of 1 in 
Sh(Y), and its subobjects correspond to J-indexed families of subobjects of 1, 
and universal quantification along y*(J) — 1 corresponds to taking intersections 
of such families. So f* preserves all such intersections, and we have verified con- 
dition (iii) of 1.5.3. g 
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In view of this result, we define an arbitrary geometric morphism f: E — F 
to be open if its inverse image is a Heyting functor. Open geometric morphisms 
will be studied in detail in Section C3.1. 


Next, we consider compactness. Of course, a locale X is said to be compact 
if the top element X of O(X) is inaccessible by directed joins: i.e. if, whenever 
we have a directed family (U; | i € I) of open sublocales of X whose union 
is the whole of X, then some U; is the whole of X. This translates easily into 
topos-theoretic terms: 


Lemma 1.5.5 A locale X is compact iff the direct image functor 
Yx: Sh(X) — Set preserves directed unions of subobjects. 


Proof We recall that y,. is (isomorphic to) the representable functor 
Sh(X)(1,—). Now, given an object A of Sh(X) which is the directed union 
of a family of subobjects (A; | i € J), for each (x: 1 — A) € 7,(A) we may form 
the pullback 

U; >———> 1 7 


A; >———> A 
and then the U; form a directed family of subterminal objects whose union is 1. 
So if X is compact, one of the U; must be the whole of 1; i.e. z must factor 
through some A; — A. So y,(A) is the union of the 7,(A;). The converse follows 
immediately from considering subobjects of 1. o 


We shall study morphisms whose direct image functors preserve directed 
unions of subterminal objects in Section C3.2, under the name proper maps; we 
shall see that there is an important sense in which the class of proper maps is 
‘dual’ to that of open maps. 

For not-necessarily-localic morphisms, the condition that a direct image func- 
tor should preserve directed unions seems less ‘natural’ than the condition that it 
should preserve all filtered colimits; however, the latter is strictly stronger (even 
in the localic case). We shall study geometric morphisms with this property, 
under the name tidy morphisms, in Section C3.4. 

We digress briefly to consider a still stronger form of compactness. By a focal 
point of a space X, we mean a point x whose only neighbourhood is the whole 
of X. (The name is justified by the fact that a point is focal iff every sequence 
converges to it.) Clearly, in a Tp-space (in particular, in a sober space) there 
can be at most one focal point. We say a space is local if it has a focal point; 
this name derives from the fact that the prime spectrum of a local ring is a local 
space (the focal point being that which corresponds to the unique maximal ideal 
of the ring). An equivalent frame-theoretic condition is that the top element of 
O(X) is inaccessible by arbitrary joins (i.e. that {X} is a completely prime filter 
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in O(X) — equivalently, that every open cover of X has a singleton subcover), 
and we take this as the definition of localness for a locale. 


Lemma 1.5.6 A locale X is local iff the direct image functor Yx: Sh(X) — Set 
has a right adjoint. 


Proof If +, has a right adjoint, then it must be the inverse image part of a 
geometric morphism c: Set — Sh(X), and the latter must correspond by 1.4.5 
to a point of the locale X. The completely prime filter in O(X ) corresponding to 
this point is the set {U € O(X) | y.(U) = 1}; but from the description of y, as a 
representable functor, this set is the singleton {X}. So if such a point exists then 
X is local. Conversely, if X is local, then from the description of stalk functors 
given in 1.4.2 we see that the stalk functor at the focal point is isomorphic to 
Sh(X) (1, —); so the latter has a right adjoint. o 


We call a geometric morphism f: € > F local if f, has a right adjoint f* 
which is full and faithful. (Note that the latter condition is automatic in the 
situation considered in 1.5.6, since the point ¢ must be an inclusion.) Local 
morphisms will be considered in Section C3.6. 


Next, we consider global and local connectedness. Traditionally, a space X is 
called connected if O(X) has precisely two complemented elements Ø and X. As 
our definition for locales, we shall take the classically equivalent, but construc- 
tively more sensible, condition that, if (U; | i € J) is a pairwise-disjoint family 
of open sublocales whose union is the whole of X, then exactly one U; must be 
the whole of X. (For the equivalence, note that for each i € I the open U; is 
complemented by the union of the rest of the U,; so it must be either @ or X.) 


Lemma 1.5.7 Fora locale X, the following are equivalent: 


(i) X is connected. 

(ii) The terminal object of Sh(X) is indecomposable (cf. A1.1.10). 
(iii) The inverse image functor y*: Set — Sh(X) is full and faithful. 
(iv) The direct image functor yx: Sh( X) — Set preserves coproducts. 


Proof (i) © (ii) is trivial, given the identification of subterminal objects of 
Sh(X) with elements of O(X) (1.3.15). 

(ii) > (iv): Since yẹ is the functor represented by the terminal object of 
Sh(X), an element of 7.([],<,; 4i) corresponds to a morphism 1 —> [],.; Ai, 
and hence (by pulling back the coprojections) to an J-indexed coproduct decom- 
position of 1 in Sh(X). So (ii) says that every element of 7,.([],<, Ai) derives 
from an element of 7.(A;) for some i; i.e. the former is the disjoint union of the 
latter. 

(iv) => (iii): If y,. preserves coproducts, then so does the composite Yy". 
But this composite also preserves 1; and every set J is an I-fold coproduct of 
copies of 1, so we have 7.7*(J) = [J fiez 1 = J. Thus the counit of the adjunction 
(y* 4+.) is an isomorphism; equivalently, y* is full and faithful. 
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(iii) > (ii): Suppose given a coproduct decomposition 1 = [ |,e; Ui in Sh(X). 
Then the inclusions U; — 1 combine to form a morphism 1 S 7*(1) — 7*(J). 
If this equals y*(i) for some i: 1 — J in Set, then the corresponding U; is the 
whole of X. g 


We therefore define a geometric morphism to be connected if its inverse image 
part is full and faithful, or equivalently if the unit of the adjunction is an iso- 
morphism. We note that hyperconnected morphisms, as defined in A4.6.6, are 
connected; and so are local morphisms as defined earlier in this section, since 
fa f* is isomorphic to the identity functor iff its right adjoint f, f# is. In 3.4.14 
below we shall prove that 1.5.7(iii) and (iv) are equivalent over an arbitrary 
base, provided we interpret the coproducts in (iv) as being indexed by objects 
of the base. 

We say a locale X is locally connected if every open sublocale of X is express- 
ible as a union of connected open sublocales. It is easy to see that, when restricted 
to spatial locales, this is equivalent to any of the usual definitions of local con- 
nectedness for spaces. Also, it is a local property: that is, not only is it inherited 
by open sublocales, but if X has an open cover by locally connected sublocales 
V;, then X is locally connected (since any U € O(X) is the union of the sublo- 
cales U N V;, and these are unions of connected open sublocales). Hence if X is 
locally connected, so is the domain of any local homeomorphism p: E — X. We 
shall also require 


Lemma 1.5.8 If X is locally connected, then any open sublocale of X has a 
pairwise-disjoint covering by connected open sublocales. 


Proof It suffices to construct such a covering of X itself. Let (U; | i € J) 
be an arbitrary family of connected open sublocales covering X; we define an 
equivalence relation R on J to be the transitive closure of the relation which 
relates i and 2’ precisely when U; N U; # 0. Let J be the set of R-equivalence 
classes, and for each j € J let Vj = U;e; Ui; then by construction (plus the 
infinite distributive law in O(X)) the V; are pairwise disjoint, and they cover X. 
We must show that they are connected. But if (Wp | k € K) is a pairwise-disjoint 
open cover of some V,, then for each i € j the sublocales (W N U; | k € K) 
form a pairwise-disjoint open cover of U;, and so one of them, say Wy(;) N Ui, 
must be the whole of U;. This defines a function f: 7 — K; but if U; nN Uy #0 
we must have f(i) = f(2’), since the Wọ are disjoint, and so f is constant on the 
equivalence class j. Hence there exists k € K such that Wp contains every U;, 
t € j, and so Wk = Vj. CJ 


It is easy to see that the disjoint decomposition of X into connected open 
sublocales, which we have constructed in the proof of 1.5.8, is unique; of course, 
we call its members the components of X. Although the proof of 1.5.8 appears 
to require classical logic, in that the equivalence relation R is defined in terms 
of an inequality, we shall see when we reach Section C3.3 that the definition of 
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local connectedness implies the (classically vacuous) property of openness (i.e. 
the property that the unique map from X to the terminal locale 1 is open), 
and for open locales we may use the notion of positivity developed in 3.1.17 
as a substitute for nonemptiness. Thus the constructively valid version of 1.5.8 
asserts that, for any locally connected locale X, there is a family of connected 
open sublocales {U; | i € I} which is ‘pairwise disjoint’ in the sense that U; N U; 
is positive iff 2 = j, and whose union is the whole of X. 


Proposition 1.5.9 For a locale X, the following are equivalent: 


(i) X is locally connected. 
(ii) The geometric morphism y: Sh(X) — Set is essential, i.e. y* has a left 
adjoint y. 
(iii) y*: Set — Sh(X) is a cartesian closed functor (i.e. preserves exponen- 
tials). 


Proof (i) = (ii): We define y in terms of local homeomorphisms; specifically, 
we define y:(p: E — X) to be the set of components of E (which we have already 
observed to be locally connected). This is functorial, because the image of a 
connected locale under a continuous map is readily seen to be connected, and 
so a map f: E — F between (the domains of) two local homeomorphisms over 
X maps each component of E into a component of F. Now suppose given a 
morphism f: (p: E > X) > y*(J) in LH/X. Since y*(J) is the disjoint union 
of the subobjects v;: 1 — y*(I), the inverse images of these subobjects along f 
form a disjoint decomposition of E, and so each component of E is contained 
in just one of them; this defines a function y(p) — J. Conversely, given such 
a function g, we may define a morphism p — y*(J) in LH/X, by setting its 
restriction to the kth component Ex of E equal to the composite 


Valk) 
Ey, ——> 1 — > (J). 


These two constructions are clearly inverse to each other, and define the required 
adjunction (y 4 y*). 

(ii) > (iii): Given that y* has a left adjoint, we know from A1.5.8 that it is 
cartesian closed iff the Frobenius reciprocity condition 


WE x *(1)) = y(E) x I 


holds for all objects E of Sh(X), J of Set. But E x y*(J) is the coproduct of I 
copies of E in Sh(X), since E x (—) preserves coproducts; and y also preserves 
coproducts since it is a left adjoint, so this is immediate. 

(iii) > (i): For any object E of Sh(X), morphisms E — y*(I) correspond to 
I-indexed coproduct (= disjoint union) decompositions of EF in Sh(X). So the 
assertion that 7* preserves the exponential J’ is equivalent to saying that, for any 
E, the map from the set of J!-indexed decompositions of E to the set of J-indexed 
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families of J-indexed decompositions of Æ, which sends a decomposition (Ep | 
f € J’) to the family of decompositions in which the jth member of the ith 
family is |] f=; Ey, is bijective. But if this holds for all J and J, then the lattice 
Sub‘(£) of complemented subobjects of E is complete, since given any family of 
complemented subobjects (E; | i € I) we can form the J-indexed family of {0, 1}- 
indexed decompositions consisting of the E; and their complements, and then one 
of the elements of the {0, 1}/-indexed decomposition corresponding to this must 
be the meet of the E; in Sub? (E). And a similar argument shows that Sub‘°(£) 
must satisfy the complete distributive law; but since it is a Boolean algebra, this 
forces it to be atomic, i.e. every object of Sh(X) has a coproduct decomposition 
into subobjects which admit no nontrivial decompositions. In particular, this 
holds for every subterminal object in Sh(X); but this is precisely the statement 
that X is locally connected. CJ 


Proposition 1.5.9 illustrates one of the problems we encounter in seeking to 
‘relativize’ topological properties: it is sometimes the case that there are several 
conditions which are equivalent for morphisms of the form Sh(X) — Set, but 
not for more general geometric morphisms. In this case, the conditions in (ii) 
and (iii) are not equivalent for a general f: F — E: the mere existence of a left 
adjoint for f* does not suffice to ensure that Frobenius reciprocity holds, and 
the mere preservation of exponentials by f* does not suffice to ensure that it has 
a left adjoint. The answer in this case (as in many others) is to demand that the 
appropriate condition should hold in the context of €-indexed categories (where 
E is the codomain of f): thus we define a geometric morphism f to be locally 
connected if f* has an €-indexed left adjoint (and this is equivalent to saying 
that it is cartesian closed as an €-indexed functor, as we shall see in 3.3.1 below). 

Before concluding this section, we feel bound to give one example of a topo- 
logical property which is ‘absolute’ in the sense that it really is a property of 
the topos Sh(X) rather than of the geometric morphism Sh(X) — Set. The 
best-known such property is extremal disconnectedness: recall that a topological 
space X is said to be eztremally disconnected if the closure of each open sub- 
set of X is open. An equivalent formulation, which makes sense for locales as 
well as spaces, is that the Heyting negation map 7: O(X) — O(X) takes values 
in the sublattice of complemented elements of O(X); equivalently again, the 
fixset O(X).. corresponding to the smallest dense sublocale of X (cf. 1.2.6(c)) 
coincides with the sublattice of complemented elements of O(X). 


Lemma 1.5.10 A locale X is extremally disconnected iff the canonical mor- 
phism (T, L): 1111 — Q~- is an isomorphism in Sh(X). 


Proof The object 1 II 1 of Sh(X) is the sheaf of sections of the codiagonal 
map X IX — X; so its sections over an open sublocale U correspond to comple- 
mented elements of O(U). Similarly, since elements of Q(U) are arbitrary open 
sublocales of U, the elements of Q~ (U) are the regular open sublocales of U, i.e. 
those which coincide with the interior of their closure in U. So the assertion that 
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1111 — Q- is an isomorphism is equivalent to saying that every open sublocale 
of X is extremally disconnected. But it is easy to verify that extremal discon- 
nectedness is inherited by open sublocales; so this happens iff X is extremally 
disconnected. CJ 


Toposes satisfying the condition in Lemma 1.5.10 are studied in greater detail 
in Section D4.6. 


Suggestions for further reading: Grayson [417, 418], Johnstone [520, 523, 
524], Joyal & Tierney [560]. 


C1.6 Internal locales 


The concept of frame, as defined in Section Cl.1, is clearly capable of being 
interpreted constructively; and so we can talk about internal frames and locales 
in any topos, in particular in a topos Sh(X) of sheaves on a locale. The main 
result to be proved in this section is that the category Loc(Sh(X )) of internal 
locales in Sh(X) is equivalent to the slice category Loc/X of locales over X; so 
any constructively valid result about locales, when interpreted in Sh(X), yields a 
result about locales over X. (This is, once again, an instance of the superiority of 
locales over spaces: we can also talk about internal topological spaces in Sh(X ), 
but these correspond to a rather restricted class of spaces over X, as we shall 
see in 1.6.7 below.) For this section, we shall assume some familiarity with the 
interpretation of logic in a topos, as set out in Chapter D1, and with the notion 
of completeness for internal posets (cf. B2.3.9). 

We recall first from A2.1.8 that the subobject classifier Qy in Sh(Y) is the 
sheaf given by Qy (U) = {V € O(Y) | V < U} for each U € O(Y). As it is in any 
topos (A1.6.3), Q is an internal Heyting algebra in Sh(Y); it is also complete as 
an internal poset in Sh(Y), by B2.3.8(a). Now suppose we are given a continuous 
map f: Y — X of locales, inducing a geometric morphism f: Sh(Y) — Sh(X) 
as in 1.4.4; then the direct image functor f, preserves Heyting algebras (because 
it is cartesian) and complete internal posets (by B2.3.7), and so the sheaf f, (Qy) 
on X (defined by f.(Qy)(U) = {V € O(Y) | V < f*(U)}) is an internal complete 
Heyting algebra, and hence an internal frame, in Sh(X). Let us write Z(f) for 
the corresponding internal locale in Sh(X), and call it the internalization of f. 


Proposition 1.6.1 The assignment f = T(f) is a functor Loc/X — 
Loc(Sh(X)). 
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Proof Suppose given a commutative triangle of locale maps 


Then for each U € O(X), h* restricts to a frame homomorphism 
hi: gx(Qz)(U) > fx(Qy)(U), since h*g*(U) = f*(U). Moreover, these maps 
form a natural transformation g,(Qz) — f,(Qy), since h* commutes with binary 
intersections. This natural transformation is clearly an internal lattice homomor- 
phism in Sh(X); to prove that it is an internal frame homomorphism, it is enough 
to show that the right adjoints of the hj, also form a natural transformation. But 
the right adjoint of hj, is easily seen to send V < f*(U) to hu(V)M g*(U), and 
this is natural in U since 


h, (V N f*(U’)) Ng*(U") = h, (V) N hy h*g*(U") N g*(U") 
= h, (V) N g*(U’) 
= (h.(V) N g*(U)) N g*(U’) 


whenever U’ < U in O(X). Thus we have an internal locale morphism 
T(h): I(f) — I(g); and it is clearly functorial in h. o 


In the opposite direction, suppose we are given an internal frame L in Sh(X). 
Since the global sections functor F +> F(X) is the direct image of the (unique) 
geometric morphism Sh(X) — Set (cf. A4.1.9). it follows from an argument 
we have already used in this section that L(X) is a complete Heyting algebra, 
and hence a frame, in Set. Moreover, since we may identify elements of O(X) 
with subterminal objects in Sh(X), and since elements of L(U) correspond to 
morphisms U — L in Sh(X), the completeness of L as an internal poset tells us 
that, for each V < U in O(X), the restriction map pẹ : L(U) — L(V) has a left 
adjoint of,: L(V) > L(U), and that the (Beck-Chevalley) square 


L(V;) +> L(Vi Va) 


LO Se 


commutes whenever Vi, V2 < U (cf. B1.4.2). (In particular, taking Vi = Vz, this 
says that pU¥.of, is the identity on L(V).) Since L is an internal Heyting algebra 
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in Sh(X), the maps p¥ are also Heyting algebra homomorphisms, from which 
we deduce by A1.5.8 that the Frobenius reciprocity law 


o% (y A py(z)) = oy(y) Ax 


holds for x € L(U), y € L(V). In particular, taking U = X and y to be the top 
element Ty of L(V), we obtain 


oX pf (2) = £ As(V) 


where s(V) = of (Ty) € L(X). It follows easily that L(V) may be identified, 
via oY, with the principal ideal {y € L(X) | y < s(V)} of L(X). 

We claim that s: O(X) — L(X) is a frame homomorphism. To see this, note 
first that if we chase the element Ty, around the Beck—Chevalley square above 
(with U = X) and then apply ae to the result, we obtain the equality 


s(ViN V2) = oy? p% (s(Vi)) = s(V;) A (V2), 


so s preserves binary meets (and clearly s(X) = Tx). Now suppose U = (J;e; Ui 
in O(X). Then Ty = Vier ou (Tu;) in L(U), since L is a sheaf and each side of 
the equation restricts to Ty, in L(U;), for each i. Applying o% to this equation 
(and noting that of, being a left adjoint, preserves joins), we obtain s(U) = 
Vie 8(Ui), so s preserves joins. 

Thus we have proved 


Proposition 1.6.2 For any internal frame L in Sh(X), there exists a locale 
morphism f: Y — X such that L = O(T(f)). 


Proof ‘Take Y to be the locale defined by setting O(Y) = L(X), and f to 
be the locale map defined by setting f* = s. Then for each U € O(X) we have 
LU) = {x € L(X) | x < f*(U)}, as already noted; it is easy to check that 


these isomorphisms are natural in U, and so define the required isomorphism of 
internal frames in Sh(X). o 


To complete the proof of the equivalence, we must make the assignment 
L= (f:Y — X) of Proposition 1.6.2 into a functor €: Loc(Sh(X)) — Loc/X 
(E stands for ‘externalization’, of course). But this is straightforward: an inter- 
nal frame homomorphism h*: L — L’ in Sh(X) will induce an external frame 
homomorphism h*(X): L(X) — L'(X); and because the diagram 


h*(U) 
L(U) ———> L'(U) 
oU o¥ 
X X 
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commutes (since h* preserves joins) and h*(U) preserves the top element, we 
have h*(X)(s(U)) = s’(U), i.e. h*(X) is a frame homomorphism under O(X). 
Once again, it is trivial to verify that this construction is functorial; so we have 


Theorem 1.6.3 For any locale X, the categories Loc(Sh(X)) and Loc/X are 
equivalent. 


Proof We have constructed the functors T and E, and we verified in 1.6.2 
that TE is isomorphic to the identity on Loc(Sh(X)) (we did not verify that 
the isomorphism is natural, but that is entirely straightforward); so it remains 
to show that €Z is naturally isomorphic to the identity on Loc/X. It is 
clear from the constructions that for any f: Y — X we have O(domET(f)) = 
(f«(Qy))(X) S O(Y), so we have only to verify that the frame homomorphism 
s: O(X) —> (f,(Qy))(X) coincides with f*. But this is easy, since if we identify 
f.(QY)(U) with {V € O(Y) | V < f*(U)} then its top element Ty is f*(U), 
and ø% is identified with the inclusion map. Once again, the naturality of this 
isomorphism €T(f) = f is easy to verify. 0 


We recall from 1.3.11 that Sh(X) itself is equivalent to a full subcategory of 
Loc/X, namely LH/X; so Theorem 1.6.3 identifies it with a full subcategory of 
Loc(Sh(X)). Of course, the objects of this subcategory are the discrete internal 
locales (that is, those whose frames are power objects 94): if f: E— X isa 
local homeomorphism, then Sh(E) ~ Sh(X)/T(f), where T(f) is the sheaf of 
sections of f, and so f(g) S f. f*(Qx) S OA by A2.3.2 and A1.5.2. More 
generally, it is straightforward to check that the functors Z and € both preserve 
local homeomorphisms, and so we obtain 


Scholium 1.6.4 For any locale morphism f:Y — X, the topos Sh(Y) is 
equivalent (as a topos defined over Sh(X)) to the topos of sheaves on the internal 
locale T(f) in Sh(X). o 


We say an internal locale Z is spatial if its frame O(Z) is (isomorphic to) a 
subframe of a power object 4 — equivalently, if there exists a locale epimorphism 
from a discrete locale A to Z. Translating this via the equivalence of 1.6.3, we 
obtain 


Proposition 1.6.5 Let f: Y — X be a locale morphism. The following condi- 
tions are equivalent: 
(i) f corresponds to a spatial internal locale in Sh(X). 
(ii) There exists a locale epimorphism h: E — Y such that fh is a local 
homeomorphism. 
(iii) The canonical map AOT (f) — f (the counit of the adjunction of 
1.3.12(ii)) is an epimorphism. 
(iv) The family of all partial sections of f over open sublocales of X is 
eprmorphic. 
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Proof (i)e (ii): Since Z and € form an equivalence of categories, they preserve 
and reflect epimorphisms (and so does the forgetful functor Loc/X — Loc). So 
this is immediate from the identification of discrete internal locales in Sh(X) 
with local homeomorphisms over X. 

(ii) < (iii) since any epimorphism in Loc/X from a local homeomorphism to 
f must factor through the counit. 

(iii) e (iv): By definition, AOT (f) is the colimit in Loc/X of a diagram 
whose vertices are open sublocales of X, and under which the partial sections 
of f form a cone. So the family of partial sections is epimorphic iff the induced 
morphism from the colimit (i.e. the counit map) is epic. O 


Corollary 1.6.6 Suppose f: Y — X corresponds to a spatial internal locale in 
Sh(X). Then 


(i) f is an open map. 
(ii) If X is spatial, so is Y. 


Proof (i) It is clear from the definition of open maps that if fh is open and 
h is epimorphic, then f is open. So this follows from 1.6.5(ii) and the fact that 
local homeomorphisms are open maps. 

(ii) similarly follows from 1.3.2(v) and the fact that an epimorphic image of 
a spatial locale is spatial. O 


From 1.6.6(ii), we see that if X is a sober space then the category 
Sob(Sh(X)) of internal sober spaces in Sh(X) (equivalent to internal spatial 
locales) may be identified with a full subcategory of Sob/X. But 1.6.6(i) shows 
that it is by no means the whole of this category. In fact, for Tp-spaces, the 
spaces over X which correspond to spatial internal locales are characterized by 
a familiar condition: 


Corollary 1.6.7 Let f: Y — X be a continuous map of sober spaces. If Y is 
a Tp-space, then f corresponds to a spatial internal locale in Sh(X) iff it is a 
submersion, i.e. every point of Y lies in the image of a continuous section of f 
over some open subset of X. 


Proof This is immediate from 1.6.5(iv), since a family of maps to a sober 
Tp-space is epimorphic in Loc iff it is surjective (cf. the remarks after 1.5.2). 
oO 


Example 1.6.8 To fulfil a promise made after Lemma 1.2.4, we now give an 
example to show that that result can fail in a non-Boolean topos. Let X be 
a Hausdorff space (in Set) which is not discrete, and let X be the internal 
locale in Sh(X) corresponding to the projection mı: X x X — X. mı is a sub- 
mersion, since any point (x,y) of X x X is in the image of the (global) section 
t+» (t,y); so X is a spatial internal locale in Sh(X). Indeed, since the identity 
map X x Xap — X x X (where Xap denotes X retopologized with the discrete 
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topology, cf. the remarks after 1.2.13) is a continuous surjection, we can regard 
O(X) as a topology on the sheaf X =T (1: X x Xap — X), i.e. the sheaf whose 
sections over U are all locally constant functions U — X. And since X is (exter- 
nally) Hausdorff, it is not hard to verify that this topology makes X into an 
internal Hausdorff space in Sh(X). But it is not sober: its sobrification is (the 
‘sheaf of points’ of) X, i.e. the sheaf T(t: X x X — X) whose sections over U 
are all continuous functions U — X. And this is strictly larger than X, since the 
identity function X — X is not locally constant. 


We may also exploit Theorem 1.6.3 along the lines foreshadowed at the 
beginning of Section C1.4, by establishing the equivalence of various topolog- 
ical properties of internal locales in Sh(X) and properties of the locale maps 
Y — X which correspond to them. However, we shall postpone this line of 
development until Chapter C3, when we shall investigate it in some particular 
cases. 

For future reference, we shall need a description of internal frames in a topos 
of the form [C°?, Set], where C is a small cartesian category, similar to the 
description of internal frames in Sh(X ) which we gave earlier in this section. For 
this, it is convenient to proceed by way of complete join-semilattices. 


Lemma 1.6.9 Let C be a small cartesian category, and let L: C°? — Poset be 
a functor, considered as an internal ordered object in [C°?, Set]. 


(i) L is an internal complete join-semilattice in [C°P, Set] iff it takes values 
in the category CLat of complete lattices (i.e. L(U) is a complete (semi-) 
lattice for each object U of C, and L(a) preserves arbitrary joins and meets 
- equivalently, has adjoints on both sides — for each morphism a), and the 
Beck-Chevalley condition holds, i.e., for each pullback square 


U—-->Vv 
d 
w——> x 


in C, the square 


L(d) 
L(X) ——> L(W) 


(where Xp» and Xe are the left adjoints of L(b) and L(c)) commutes. 
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(ii) L is an internal frame in [C°?, Set] iff it satisfies the conditions in (i), and 
additionally each L(U) is a frame and the Frobenius reciprocity condition 


Ya(L(a)(u) Av) =u Xa(v) 


holds for alla: V — U in C, u € L(U) and v € L(V). 


Proof (i) First, suppose that L is an internal complete semilattice; equiva- 
lently, that the [C°P, Set]-indexed category [L] defined as in B2.3.3 is cocomplete. 
By B2.4.8 (or by the remark after B2.3.9), it is also complete. To see that the 
individual posets L(U) are necessarily complete and cocomplete, we observe that 
F + F(U) is the direct image of a pre-geometric morphism; specifically, it is the 
composite 
Y(U) Ya 
[C°P, Set] —_—> [C°?, Set]/Y (U) —~> Set 

where Y denotes the Yoneda embedding and 7, is the direct image of the unique 
geometric morphism. So by B2.3.7 it preserves (co)completeness of posets. The 
Yoneda embedding C — [C°?, Set] preserves pullbacks, so by B1.4.9 this implies 
that the C-indexed category obtained by restricting [L] to representables is also 
complete and cocomplete. But this is just L itself regarded as a functor C°P — 
Cat; so we deduce that the transition maps L(a) have left and right adjoints, 
and that the Beck—Chevalley conditions hold. (Of course, either of them implies 
the other, as we observed in B1.4.6.) 

For the converse, we use the characterization of cocomplete internal posets 
provided by B2.3.9(ii): that is, we construct a join map V: PL — L left adjoint 
to the principal-ideal map |: L — PL. We recall that PL(U) may be identified 
with the set of all subfunctors of L x C(—,U); and |y sends an element u to the 
subfunctor whose value at V is the set of pairs (v, a) such that v < L(a)(u). Given 
an arbitrary subfunctor F — L x C (—,U), we define Vp (F) to be the join, over 
all V € obC and all (v,a) E€ F(V), of the elements ©,(v) of L(U). To see that this 
defines a natural transformation PL — L, we use the Beck—Chevalley condition: 
given b: W — U inC and (v,a) € F(V), we have L(b)a(v) = ©.L(d)(v), where 


x—^ >W 
d b 
vV—_>U 


is a pullback in C, and the pair (L(d)(v),c) belongs to PL(b)(F)(X). So nat- 
urality follows from the fact that L(b) preserves joins. It is also clear that 
Vu lu (u) < u for all u € L(V), since the inequality v < L(a)(u) is equiva- 
lent to Xa (v) < u (in fact we actually have Vy lu (u) = u, since the pair (u, ly) 


C1.6 INTERNAL LOCALES 535 


belongs to |y (u)(U)); and lu Vy (F) 2 F for all F € PL(U), by a similar 
argument. So we have the required left adjoint. 

(ii) Suppose L is an internal frame. Then by 1.1.2 it is an internal Heyting 
algebra, so each L(U) is a Heyting algebra and hence a frame. Also, each L(a) 
is a Heyting algebra homomorphism, so by A1.5.8 the Frobenius reciprocity law 
holds. Conversely, if each L(U) is a frame and Frobenius reciprocity holds, then 
L is an internal Heyting algebra and hence a frame. O 


We may easily extend 1.6.9(ii) from functor categories [C°?, Set] to toposes 
Sh(C,T) of sheaves for a coverage on C. We recall from B2.3.10(ii) that, if j is 
a local operator on a topos E, then the complete internal posets in sh,(&) are 
just those complete internal posets in € whose underlying objects are j-sheaves; 
since the same is clearly true for Heyting algebras (by D1.2.13, if you insist), 
we deduce that the internal frames in sh;(€) are simply those internal frames 
in € whose underlying objects are sheaves. Also, by A4.3.5 and A4.4.8, we know 
that every coverage on a small category C corresponds to a local operator on 
[C°P, Set]. So we may immediately deduce 


Corollary 1.6.10 Let (C,T) be a small site whose underlying category C is 
cartesian. Then internal frames in Sh(C,T) may be identified with functors 
L: C°? — CLat such that each L(U) is a frame, the underlying set-valued func- 
tor of L is a T-sheaf, and L satisfies the Beck-Chevalley condition of 1.6.9(i) 
and the Frobenius reciprocity condition of 1.6.9(ii). O 


Suggestions for further reading: Fourman & Grayson [363], Joyal & 
Tierney [560], Moerdijk [825], Stout [1122]. 


C2 
SHEAVES ON A SITE 


C2.1 Sites and coverages 


We have seen in the last chapter how we may regard many familiar topological 
properties (of spaces or locales) as properties of sheaf toposes. This in itself would 
not be a significant gain, were it not for the fact that there are other contexts in 
mathematics where ‘topological intuition’ seems to be present, and which cannot 
be adequately described by either spaces or locales — but these contexts do give 
rise to toposes. 

Such contexts first arose in algebraic geometry, in the study of cohomology 
theories such as étale cohomology (see Chapter E3): here one has a cohomology 
theory living on something which resembles a space in that it possesses ‘local- 
izations’ and ‘coverings’, but which has an additional feature, not present in 
topology, that its ‘points’ have nontrivial automorphism groups which form part 
of the structure of the ‘space’. It was A. Grothendieck [36] who first saw how to 
give a unified description of such contexts, via the notion of a site. 

Grothendieck’s idea was that, in the definition of a sheaf on a space (or 
locale) X, the fact that the category O(X) on which the presheaves are defined 
is a frame is incidental: all one needs is a good notion of ‘covering’ in the category, 
in order to be able to formulate the sheaf axiom. Grothendieck used the name 
‘topology’ for this notion of covering; but this name, and the rather rigid axiom- 
atization of the notion which he formulated, has unhelpfully tended to obscure 
the real magnitude of the generalization involved in passing from frames to sites 
(categories equipped with a notion of covering). In particular, the fact that in 
a frame the coverings are determined by the structure of the category (they are 
exactly the colimit cones) has no parallel in the more general context, where the 
notion of covering is something arbitrarily imposed on a given category. Thus we 
have chosen to adopt the name ‘coverage’, which carries less emotional baggage 
around with it than ‘topology’, for the structure which is needed in order to 
define sheaves. 

A further fruitful source of possible misunderstanding in Grothendieck’s 
account [36] is that his notion of ‘topology’ is defined by three closure condi- 
tions of which one plays a very different rôle from the other two: one (in fact 
the second, in the order in which they are usually stated) is essential to the 
definition of the category of sheaves (cf. 2.1.13 below), whereas the other two 


C2.1 SITES AND COVERAGES 537 


are ‘saturation conditions’ which might as well be there because one can always 
close off under them without affecting the notion of sheaf. In our account, we 
shall (we hope) keep the distinction between the former and latter conditions 
clear; in fact we start off by assuming only the former. 

The definition of a coverage has already been given in A2.1.9, but we shall 
repeat it here — with one minor change: in Section A2.1 we considered coverages 
only on small categories, but here we shall not assume anything (for the time 
being, at least) about the size of the underlying category. (We also make one 
notational change: we shall denote objects of the underlying category by letters 
like U,V, W,... rather than A, B,C,..., partly to maintain continuity with the 
previous chapter and partly because we want to reserve A, B,C,... for objects 
of the topos of sheaves.) 


Definition 2.1.1 Let C bea category. By a coverage on C, we mean a function 
assigning to each object U of C a collection T(U) of families (fi: U; + U |i € J) 
of morphisms with common codomain U (called T- covering families), such that 


(C) If (fi: U; — U |i € I) is a T-covering family and g: V — U is any morphism 
with codomain U, there exists a T-covering family (h;: V; => V | j € J) such 
that each gh, factors through some fi. 


A site is a category equipped with a coverage. By a small site, we mean one 
whose underlying category is small. 


As we have already indicated, the purpose of introducing sites is in order to 
be able to define sheaves. Once again, we restate the definition from Section A2.1; 
for convenience, we also recall the definition of separated functor from A2.6.4(d). 


Definition 2.1.2 Let C be a category. We say a functor A: C°P — Set satisfies 
the sheaf aziom for a family of morphisms (fi: U; — U | i € I) if, whenever we 
are given a family of elements s; € A(U;) which are compatible in the sense that, 
whenever g: V — U; and h: V > U; satisfy fig = f,h (here i and j need not 
be distinct), we have A(g)(s;) = A(h)(s;), then there exists a unique s € A(U) 
such that A(f;)(s) = s; for each i € I. We say A is separated for (fi |i € T) if it 
satisfies the above condition with ‘a unique’ replaced by ‘at most one’. If T is a 
coverage on C, we say A is a T-sheaf (resp. T-separated) if it satisfies the sheaf 
axiom (resp. is separated) for every T-covering family. And we write Sh(C, T) 
(resp. Sep(C, T)) for the full subcategory of the functor category [C°?, Set] whose 
objects are T-sheaves (resp. T-separated functors). 


The collection of all families of morphisms for which a given functor A satisfies 
the sheaf axiom has a number of important closure properties; so there is no harm 
in adding these closure properties to the condition (C) which we have assumed 
in 2.1.1. We begin with the simplest one: recall that a sieve on an object U 
of C is a family R of morphisms with codomain U which is a right ideal in 
C, ie. f € R implies fg € R whenever the composite fg is defined. In the 
terminology introduced in A4.5.2 and A4.5.5, sieves on an object U of C are 
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sieves in the slice category C/U. (If C is locally small - which it generally will be 
from now on — then sieves on U correspond bijectively to subobjects in [C°?, Set] 
of the representable functor C (—,U), the correspondence sending a sieve R to 
the functor V + {f € R | dom f = V}.) Any family (fj: U; ~>U |i € J) 
generates a sieve on U, consisting of all the morphisms with codomain U which 
factor through at least one fj. 


Lemma 2.1.3 Let (f;:U; ~U |i € I) be a family of morphisms of C with 
common codomain, and A: C°? — Set a functor. Then A satisfies the sheaf 
ariom (resp. is separated) for (f; |i € I) iff it satisfies the sheaf axiom (resp. is 
separated) for the sieve generated by this family. 


Proof From the very definition of a compatible family, it is easy to see that any 
compatible family with respect to (f; | i € I) extends uniquely to a compatible 
family with respect to the sieve generated by the f;. So this is immediate. O 


It is clear that a collection T of families of morphisms satisfies condition (C) 
iff the same is true for the collection T consisting of all the sieves generated by 
members of T; so we may replace any site (C, T) by a site (C,T) which has the 
same sheaves, in which all the covering families are sieves. We shall call a coverage 
sifted if it has this latter property. Note that the definition of compatibility with 
respect to a family of morphisms can be simplified when the family is a sieve: 
(sf | f € R) is compatible with respect to a sieve R iff, whenever f € R and 
dom f = cod g, we have ss, = A(g)(sy). A further simplification is possible in 
the case when C is locally small, so that we can identify sieves with subfunctors of 
representable functors: in this case, a compatible family (ss | f € R) of elements 
of A is exactly (the image of) a natural transformation R — A. Bearing in mind 
the Yoneda Lemma, which says that elements of A(U) correspond to natural 
transformations C (—,U) — A, we thus obtain 


Lemma 2.1.4 LetC be a locally small category, and R a sieve on an object U 
of C. Then a functor A: C°P — Set satisfies the sheaf axiom (resp. is separated) 
for R iff each natural transformation R — A factors uniquely (resp. in at most 
one way) through the inclusion R — C (—, U). o 


One further advantage of restricting our attention to sifted coverages is that 
the statement of axiom (C) can be simplified. Given a sieve R on U and a 
morphism f: V — U we define f*(R) to be the family (which is clearly a sieve 
on V) of all morphisms g with codomain V such that fg € R. (In the case 
when C is locally small and we identify sieves with subfunctors of representable 
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functors, this is simply the pullback 


F(R) ——> R 


C(-,V) —> C (—,U) 


where the bottom edge of the square is the natural transformation induced by f.) 
Then condition (C), for a sifted coverage T, is equivalent to saying that if R € T 
then f*(R) contains a sieve in T for every morphism f whose codomain matches 
that of R. 

However, there are also disadvantages in passing from arbitrary covering fam- 
ilies to sieves. The principal one is that, in many particular cases (see the list 
of examples of sites in A2.1.11), the covering families may be easier to specify 
explicitly than the sieves which they generate (for example, the generating fam- 
ilies may be finite); and the verification that particular functors are sheaves also 
tends to be easier when we work with the generating families, at least in the case 
when the underlying category C has pullbacks. For this reason, we shall state 
the next three lemmas in two different versions, one for generating families and 
the other for sieves. (They also have alternative versions for separated functors 
rather than sheaves, but we shall leave the reader to supply these.) 


Lemma 2.1.5 Let U be an object of C, and A: C°P — Set a functor. 
(i) A satisfies the sheaf axiom for the singleton family (1u). 


(ii) A satisfies the sheaf aziom for the sieve My of all morphisms with 
codomain U. 


Proof The two statements are equivalent, since the family (ly) generates the 
sieve My. And (i) is immediate from Definition 2.1.2. o 


Lemma 2.1.6 Let A: C°? — Set be a sheaf for a coverage T on C. 


(i) Let (fi: U; — U |i € I) be a T-covering family, and (gj: V; >U |j €J) 
a family of morphisms with the same codomain, such that every fi factors 
through at least one gj. Then A satisfies the sheaf axiom for (g; |j € J). 


(ii) If T is a sifted coverage, then A satisfies the sheaf axiom for any sieve 
which contains a T’-covering sieve. 


Proof It suffices to prove (i), since (ii) is a direct translation of it into ‘sifted’ 
language. For (i), let (t; € A(V;) | 7 € J) be a compatible family with respect 
to the g;. For each i € J, we can choose a factorization fi = gjah of fi 
through some gj, and define s; = A(h;)(tj(i)) € A(Ui). The compatibility of the 
t; ensures that s; is independent of the choice of j(i) and hi, and that the s; 
form a compatible family with respect to the f;; so there is a unique s € A(U) 
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such that A(f;)(s) = s; for each i € J. This is clearly the unique element of A(U) 
which could possibly satisfy A(g;)(s) = tj for all j € J; to see that it actually 
does so (for a particular j), we need to observe that if k: W — V; is a morphism 
such that g;k factors through some f; (say g;k = fil), then 


A(k)A(95)(s) = ADAC) (s) = AC) (si) = AM Alhi) (tja) = Alk) (ti) 


(where the last step uses the compatibility of the t;). But by condition (C) 
there is a T-covering family of morphisms k with this property, and so the result 
follows since A is a T-sheaf. o 


Lemma 2.1.7 Let A be a functor C°P — Set. 


(i) Let (fi: U; — U |i € I) be a family for which A satisfies the sheaf axiom, 
and suppose, for each i € I, there is a family (hij: Ui; > U; | j € Ji) for 
which A also satisfies the sheaf axiom. Then A satisfies the sheaf axiom for 
the family of all composites (fihi; |i € 1,7 € Ji). 

(ii) Suppose T is a sifted coverage for which A is a sheaf. Let R be a T-covering 
sieve on an object U of C, and S a sieve on U such that f*(S) is a T- 
covering sieve ondom f for each f € R. Then A satisfies the sheaf axiom 


for S. 


Proof (i) Suppose given a compatible family (s;; € A(Ui;) |i € I,j € Ji). 
For each fixed i € I, the family (sij | j € Ji) is compatible with respect to the 
family (hi; | J € Ji), and so defines a unique element s; € A(U;) such that 
A(hi;)(si) = sij for all j € Ji. Further, the compatibility of the s;; ensures that 
(si | i € I) is a compatible family relative to the f;. Since A satisfies the sheaf 
axiom for the f,, there is a unique s € A(U) satisfying A(f;)(s) = s; for all i. 

(ii) The second version of the statement is not quite a simple translation of 
the first into the language of sieves, since S may be strictly larger than the sieve 
of all composites fh with f € R and h € f*(S) — there may be morphisms in 
S which do not factor through any member of R. But Lemma 2.1.6(ii) provides 
the extra information we need. 


The next definition, like the last three lemmas, could be given separately in 
‘unsifted’ and ‘sifted’ versions, but from now on we shall leave to the reader the 
task of formulating the former. 


Definition 2.1.8 We define a Grothendieck coverage on a category C to be 
a function assigning to each object U of C a collection T(U) of sieves on U, 
satisfying the closure property (C) of 2.1.1 and also 


(M) For any object U, the maximal sieve My is in T(U). 
(L) If R € T(U) and S is another sieve on U such that, for each f € R, the 
sieve f*(S) belongs to T(dom f), then S also belongs to T(U). 


C2.1 SITES AND COVERAGES 541 


The condition (L) is sometimes referred to as the local character condition. 
We note that, if a sifted coverage T satisfies both (L) and (M), then any sieve 
which contains a T’-covering sieve must be T-covering; for if R C S, then f*(S) = 
Maom f for each f € R. Hence, in the presence of (L) and (M), (C) may be 
replaced by the simpler condition 


(C’) If R € T(U) and g is any morphism with codomain U, then g*(R) € 
T(dom g). 


(This is the form in which the definition of ‘Grothendieck topology’ is usually 
presented.) One further closure property which we could have demanded is that 
T(U) should be closed under finite intersections (and so form a filter in the lattice 
of subfunctors of C (—,U)); the fact that the sieves for which a given A satisfies 
the sheaf axiom are closed under finite intersections can be proved by similar 
means to the last three lemmas. However, there is no need to do so explicitly, 
since this closure property follows from (L) and (C’): for any f € R, we have 
F (RA S) = f*(S). 


From Lemmas 2.1.5, 2.1.6 and 2.1.7, we immediately deduce 


Proposition 2.1.9 For any coverage T on a category C, there is a 
Grothendieck coverage T having the same sheaves as T. 


Proof First replace T by a sifted coverage T as in 2.1.3, and then take T to be 
the intersection of all Grothendieck coverages which contain T. It is clear from 
the nature of the closure conditions (L) and (M) that T is itself a Grothendieck 
coverage. Moreover, if A is any T-sheaf then it is also a T-sheaf by 2.1.3; and so it 
follows from the last three lemmas that the collection of all sieves R such that A 
satisfies the sheaf axiom for every f*(R) (as f ranges over all morphisms with the 
same codomain as the members of R) is a Grothendieck coverage containing T. 
So A is a sheaf for T. The converse is immediate. o 


In the case when C is small, T can also be constructed from T by a transfinite 
iteration: first adjoin all the maximal sieves M4 to T, and then repeatedly 
adjoin those sieves whose presence is required by (L), until the process converges. 
However, if C is large, there may be a proper class of distinct sieves on a given 
object of C, and so there is no guarantee that this process will converge. 

For the rest of this and the next two sections, we shall assume that all cov- 
erages are Grothendieck coverages, unless explicitly stated otherwise. We shall 
also tend to denote Grothendieck coverages by letters like J and K rather than 
T: in addition to drawing attention to the fact that a particular coverage is a 
Grothendieck coverage, this notation is consistent with that which we used in 
Sections A4.4 and A4.5, in the following sense. 


Proposition 2.1.10 LetC be a small category, and J a function assigning to 
each object U of C a set J(U) of sieves on U. Then J satisfies condition (C’) 
iff it is a subfunctor of the subobject classifier Q of [C°?, Set]. Moreover, if this 
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condition is satisfied, then J is a Grothendieck coverage iff the classifying map 
3:20 of J > Q is a local operator in the sense of A4.4.1. 


Proof The first assertion is immediate from the definition of Q in A1.6.6. 
Given this, the natural transformation 7: Q — Q corresponding to J is easily 
seen to be defined by 


ju(R) ={f:V > U | f*Re J(V)}, 


so it is clear that J satisfies (M) iff ly € ju(Mv) for all U, iff jT = T. If 
condition (L) also holds, then we have jy(R) € J(U) iff R € J(U), so that 
jj = j, and from the fact, noted above, that the set of J-covering sieves is closed 
under finite intersections we similarly deduce that j satisfies the third condition 
of A4.4.1. Conversely, if j satisfies the second and third conditions of A4.4.1, 
and R and S are two sieves on U satisfying the hypotheses of (L), then we have 
RC ju(S), whence ju (S) = juju(S) 2 ju(R) = Mu, so S € J(U). g 


We may now deduce (at least for small categories) the result that a 
Grothendieck coverage is determined by its sheaves; this is sometimes known 
as the ‘little Giraud theorem’. 


Corollary 2.1.11 For a small category C, the assignment J +> Sh(C, J) is a 
bijection from the set of Grothendieck coverages on C to the class of reflective 
subcategories of [C°P, Set] with cartesian reflector. 


Proof In A4.4.8 we established a bijection between such subcategories and 
local operators on [C°?, Set]. So the only remaining ingredient is to show that if 
J and j are related as in 2.1.10 then sh; ([C°P, Set]) coincides with Sh(C, J). But 
this was essentially done in A4.3.5: for the operator d on subobjects defined there 
is easily seen to coincide with that obtained by composing classifying maps with 
j (and in particular it is always idempotent if J is a Grothendieck coverage). 

o 


We shall extend 2.1.11 to non-small categories in 2.2.5 below. 

We saw in the proof of 2.1.9 that, for a given functor A: C°P — Set, there is 
a unique largest sifted coverage for which A is a sheaf, and it is a Grothendieck 
coverage. By intersecting such coverages, we may construct the largest coverage 
for which each of a given family of functors is a sheaf. In particular, when C 
is locally small, we define the canonical coverage on C to be the largest sifted 
coverage for which all representable functors are sheaves. 

We say a sieve R on U is effecttve-epimorphic if it forms a colimit. cone under 
the (large!) diagram consisting of the domains of all the morphisms in R, and 
all the morphisms over U between them. And we say R is universally effective- 
epimorphic if f*(R) is effective-epimorphic for every f with codomain U. It is 
easy to see that a sieve is effective-epimorphic iff every representable functor 
satisfies the sheaf axiom for it; hence the universally effective-epimorphic sieves 
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are exactly the covering sieves for the canonical coverage. We say a coverage is 
subcanonical if all its covering sieves are effective-epimorphic, i.e. if it is contained 
in the canonical coverage; we say a site (C,J) is standard if C is small and 
cartesian, and J is subcanonical. We shall also call a coverage separated if all its 
covering sieves are epimorphic families; clearly, this is equivalent to saying that 
all the representable functors are separated in the sense of 2.1.2. 

We shall see in the next section that there is no loss of generality (at least 
if we are interested not in the sites themselves, but only in their categories of 
sheaves) in restricting our attention to standard sites. 


Examples 2.1.12 (a) The empty sieve on an object U of C is effective- 
epimorphic iff U is an initial object; it is universally effective-epimorphic iff 
U is a strict initial object (cf. A1.4.1). (And it is merely epimorphic iff U is a 
quasi-initial object as defined before A1.5.14.) 

(b) Let R be the sieve generated by a single morphism f: V — U. Then R is 
effective-epimorphic iff any morphism g: V — W which satisfies gh = gk when- 
ever fh = fk factors uniquely through f. If f has a kernel-pair (a,b): K 3 V, 
then g satisfies the above condition iff ga = gb; so R is effective-epimorphic iff f 
is a coequalizer of its kernel-pair, i.e. iff f is regular epic. If C has pullbacks, then 
R is universally effective-epimorphic iff every pullback of f is regular epic; in 
particular, we deduce that in a regular category, the sieve generated by a single 
morphism f is universally effective-epimorphic iff f is a cover (a fact which we 
used in Example A2.1.11(a)). If C is a regular category, we define the regular cov- 
erage on C to consist of all sieves which contain a cover (in the sense of A1.3.2), 
it is easy to see that this is in fact a Grothendieck coverage, and if C is small 
then the resulting site is standard. 

(c) We saw in A2.1.11(A) that the collection of all singleton families of mor- 
phisms defines a coverage on C iff C satisfies the ‘right Ore condition’ that any 
pair of morphisms with common codomain may be embedded in a commuta- 
tive square. The Grothendieck coverage generated by this coverage as in 2.1.9 
clearly consists of all inhabited (= nonempty) sieves on objects of C. Can it be 
the canonical coverage? Clearly, this will be the case if every sieve generated 
by a singleton is effective-epimorphic, and the empty sieve is not universally 
effective-epimorphic on any object. So, combining (a) and (b) above, we deduce 
that if C is a category satisfying the right Ore condition, in which every mor- 
phism is regular epic and no object is strict initial, then the canonical coverage 
on C consists of all inhabited sieves. Examples of such categories are the category 
of (finite) sets and epimorphisms, and the dual of the category of (finite) sets 
and monomorphisms; we shall meet others in 3.5.9 below. 

(d) Let C be a coherent category, and let (fi: V; ~ U|1<i<7n) bea finite 
covering family, i.e. one such that the union of the images U; — U of the f; is the 
whole of U. Then the sieve R generated by the f; is effective-epimorphic (and 
hence universally effective-epimorphic, since finite covering families are stable 
under pullback in C); for if (gi: Vi > W | 1 < i < n) is a compatible family of 
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elements of C (—,W) relative to the fi, then each g; factors through V; —> U; 
(since it coequalizes the kernel-pair of f;) and the induced morphisms U; > W 
can be combined, by repeated use of A1.4.3, into a unique morphism U — W. 
(Once again, this argument was sketched before in A2.1.11(b).) We define the 
coherent coverage on a coherent category C to consist of all sieves which contain 
a finite covering family; by the remarks above, this coverage is subcanonical. If 
C is a pretopos, then finite covering families are the same thing as finite (jointly) 
epimorphic families, by A1.4.9; the coherent coverage on a pretopos is sometimes 
called the pre-canonical coverage. 

(e) Similarly, if C is a geometric category (resp. an oo-pretopos), we may 
show that sieves generated by small covering families (resp. small epimorphic 
families) are universally effective-epimorphic. There is a converse result if C has 
a separating set of objects, as defined in Section A1.2: for then any epimorphic 
sieve (in particular, any effective-epimorphic sieve) on an object U must contain 
an epimorphic family of morphisms V — U whose domains are all in the separat- 
ing set, and there is only a set of such morphisms since a geometric category is 
locally small. Thus in an co-pretopos with a separating set of objects, the canon- 
ical coverage consists precisely of those sieves which contain small epimorphic 
families. This result will be of considerable importance in the next section. 


Before leaving this section, we give an example to emphasize the difference 
already mentioned between condition (C) and the other closure conditions that 
we have introduced in the course of the section: if we omit. the former from 
our definition of a coverage, we arrive at a notion of ‘category of sheaves’ very 
different from that which we wish to study. For simplicity, we formulate the 
definitions in this example in terms of posets, but they can in fact be considered 
for more general categories; cf. [540]. 


Example 2.1.13 Given a poset P, we define an interval in P to be a 
(nonempty) subset of the form 


[a,b] = {re Pla<x<b} 


with a < bin P. We write Int(P) for the set of all intervals in P, ordered by inclu- 
sion. Int is a functor Poset — Poset: given an order-preserving map f: P > Q, 
we define Int(f)([a, b]) = [f (a), f(b)]. We say that f: P — Q is an interval fibra- 
tion if, for every a < b in P, the restriction of f to [a,b] is an order-isomorphism 
[a,b] > [f(a), f(d)]; equivalently if, given u € Q with f(a) < u < f(b), there 
exists a unique c € P witha < c < band f(c) = u. It is not hard to verify that f is 
an interval fibration if and only if Int(f): Int(P) — Int(Q) is a discrete fibration 
in the sense of B1.3.11; that is, iff it corresponds to a functor Int(Q)°? — Set via 
the Grothendieck construction of A1.1.7. Moreover, if f: P — Q and f’: P’ > Q 
are both interval fibrations, then order-preserving maps P — P’ over Q (are nec- 
essarily interval fibrations, and) correspond bijectively to order-preserving maps 
Int(P) — Int(P’) over Int(Q). Thus the category IFib/Q of interval fibrations 
over Q is equivalent to a full subcategory of [Int(Q)°P, Set]. 
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An arbitrary discrete fibration d: E — Int(Q) need not be of the form Int(f). 
However, if it is, then we may clearly reconstruct the domain P of f by taking 
its underlying set to be 


{a € E | d(a) is a singleton }, 


with a < b in P iff there exists x € E with d(x) = [f(a), f(b)] and such that 
a <2,b < x both hold in E. In general, this definition yields a relation on P 
which is reflexive and antisymmetric, but not necessarily transitive; for transi- 
tivity, what we need is precisely that the functor Int(Q)°? — Set corresponding 
to d should satisfy the sheaf axiom for the family 


(lu, v] S [u, w] , [v, w] S [u, w) 


whenever u < v < w in Q. Thus we may conclude that IFib/Q is equivalent to 
‘Sh(Int(Q), 7)’, where T is the set of all such families. 

Now let Q be the four-element poset {0,u,v,1} withO<u<10<v<1 
but u and v incomparable. It is clear that, for this Q, the collection of covers T 
fails to satisfy condition (C): there is no T-covering family on |v, v] which factors 
through the covering ({0, u] € (0, 1], [u, 1] C [0, 1]) of [0, 1]. Moreover, in this case 
IFib/Q is very far from being a topos. To see this, note that monomorphisms in 
IFib/Q are just injective order-preserving maps, and so the subterminal objects 
of IFib/Q may be identified with those sub-posets of Q for which the inclusion is 
an interval fibration. Among these, we have the subsets {u}, {0, u}, {0, u,v} and 
{u, 1} as well as Q itself; but {0, u, 1} is not a subterminal object, and so the join 
of {0, u} and {u, 1} in Subypi4/Q(1) is the whole of Q. Thus the five subterminal 
objects mentioned above form a non-modular (and in particular non-distributive) 
lattice. It now follows from A1.4.2 that IFib/Q is not a coherent category, from 
A1.5.10 that it is not cartesian closed, and from A1.6.3 that it does not have 
a subobject classifier. (In fact it is not hard to show that IFib/Q is not even 
regular.) 


Suggestions for further reading: Artin et al. [36], Bunge & Niefield [204], 
Johnstone [540], Pedicchio & Vermeulen [946]. 


C2.2 The topos of sheaves 


In A2.1.10, we gave a sketch proof of the fact that the category of sheaves on 
an arbitrary small site is a topos. (A more detailed proof can be extracted from 
the results of Sections A4.3 and A4.4, in particular A4.3.5 and A4.4.5.) For 
technical reasons which will become apparent shortly, we now wish to prove 
a more general version of this result, in which the hypothesis of smallness is 
relaxed slightly. In order to do this, we shall first need to establish a result (the 
Comparison Lemma) of considerable importance in its own right, which provides 
a criterion for determining when two sites give rise to equivalent categories of 
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sheaves. We shall assume in this section that all coverages are Grothendieck 
coverages, unless stated otherwise. 


Definition 2.2.1 Let (C, J) be a site. We say a subcategory D of C is J-dense 
(or simply dense, if J is obvious from the context) if 


(i) every object U of C has a covering sieve R € J(U) generated by morphisms 
whose domains are in D (equivalently, the sieve generated by all morphisms 
V — U with V € ob D is J-covering); and 


(ii) for any morphism f: U — V in C with V € ob D, there is a covering sieve 
R € J(U) generated by morphisms g: W — U for which the composite fg is 
in D (equivalently, the family of all such morphisms generates a J-covering 
sieve). 


It is easy to see that, in the presence of condition (i) of 2.2.1, condition 
(ii) may be reduced to the special case in which the domain U as well as the 
codomain V of f is in D; hence if D is a full subcategory condition (ii) may be 
omitted altogether. (On the other hand, given condition (ii) as we have stated it, 
condition (i) becomes redundant provided every object of C admits a morphism 
to some object of D — for example if D contains a terminal object of C.) In 
practice, the Comparison Lemma is most often used for full subcategories, and 
many texts only define denseness in this case; however, the extra generality 
afforded by the definition we have given is occasionally useful — we shall see an 
instance of its use in 5.2.5 below. 

Given any subcategory D of C and a coverage J on C, we define the induced 
coverage Jp on D by setting Jp(V) to be the collection of all sieves RM mor D, 
where R € J(V). It is straightforward to verify that Jp satisfies the closure 
conditions (C), (Z) and (M) if J does. 


Lemma 2.2.2 Let (C,J) be a site, and let D be a subcategory of C satisfying 
condition (ii) of 2.2.1. Then 


(i) A sieve S on an object V of D is Jp-covering iff the sieve S in C generated 
by the members of S is J-covering. 


(ii) For any J-sheaf A on C, the restriction of A to D is a Jp-sheaf. 


Proof (i) The right-to-left implication is clear, since SM mor D = S. For the 
converse, suppose that S = RM mor D for some J-covering sieve R. Then, for 
any f: U — V € R, f*(S) contains all morphisms g: W — U with fg € mor D, 
and is therefore J-covering; so S is J-covering by (L). 

(ii) Suppose S € Jp(V), and let (sf € A(dom f) | f € S) be a compatible 
family of elements for S. By (i), it is sufficient to show that (sp | f € S) extends 
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to a compatible family for S, i.e. that if we have a commutative diagram in C 


WS 


92 fi 
fr 


Ug ———-> V 
with fi, fo E S then A(g1)(ss,) = A(g2)(sy,). But W can be J-covered by 
morphisms h: X — W such that the composites gh and gzh both lie in D, so 
that the images of the two elements above under A(h) both coincide with $f, 9,n- 
Since A is a J-sheaf, it follows that the two elements are equal. 


Thus, under the hypotheses of Lemma 2.2.2, the functor [C°P, Set] — 
[DOP Set] which sends A to its restriction to D itself restricts to a functor 
Sh(C,J) — Sh(D, Jp). To get a functor in the opposite direction, we shall 
need to introduce some smallness hypotheses. 


Theorem 2.2.3 (Comparison Lemma) Let (C,J) be a site such that C is 
locally small, and let D be a small dense subcategory of C. Then the restriction 
functor Sh(C, J) — Sh(D, Jp) just defined is (one half of) an equivalence of 
categories. 


Proof Let B: D°P — Set be a functor. For each object U of C, let (DLU) 
denote the category whose objects are morphisms f: V — U inC with V € ob D, 
and whose morphisms f — f’ are morphisms g: dom f — dom f’ in D such that 
f'g = f. The hypotheses on C and D ensure that (D | U) is a small category, and 
hence we can form the limit A(U) of the composite 


(D|U)°? ——> D? eos Set, 


where the first factor is the forgetful functor f + dom f. The assignment U +> 
A(U) becomes a functor C°P — Set in an obvious way: a morphism U — U’ 
induces a functor (D]U) — (D|U’) over D, and hence a comparison map 
between the two limits. Similarly, the assignment B + A itself becomes a functor 
[D°P , Set] — [C°P, Set]. (Of course, this is just the usual construction of the right 
Kan extension of B along the inclusion functor D°P — C°P.) 

Next, we note that the denseness of D ensures that any sheaf A on C is 
isomorphic to the Kan extension of its restriction to D. For if we are given 
an element of the above limit (with B taken to be the restriction of A), i.e. a 
family of elements (ss € B(dom f) | f € ob (D|U)) satisfying the appropriate 
compatibility conditions, then an argument like that in the proof of 2.2.2(ii) 
ensures that this family extends to a compatible family for the sieve (in C) 
generated by all morphisms from objects of D to U, and hence that there is a 
unique s € A(U) with A(f)(s) = sy for all f. 
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Similarly, any Jp-sheaf B on D is isomorphic to the restriction of its right Kan 
extension. If D is a full subcategory of C this is trivial, since ly is a terminal 
object of (D| V) for every V € ob D. In general, if we are given an element 
(sf € B(W) | (f: W — V) € ob (D| V)) of A(V) where V € ob D, then for each 
f the morphisms g: X — W in D for which fg € mor D generate a Jp-covering 
sieve, so that s¢ is uniquely determined by the sy, for all such g — and these are 
in turn determined by s,,,. So we have a canonical bijection from B(V) to A(V). 

Thus, in view of Lemma 2.2.2, it remains only to prove that the right Kan 
extension of a Jp-sheaf on D is indeed a J-sheaf on C. For this, we can conve- 
niently use the characterization of sheaves given by 2.1.4: suppose A is the right 
Kan extension of B, and we are given R € J(U) and a natural transformation 
a: R — A. We have to produce an element of A(U); but by definition this con- 
sists of a compatible family of elements (xf € B(V) | f: V > U,V € ob D). For 
each such f, the sieve f*(R) N mor D is Jp-covering on V, and a defines (by 
restriction) a compatible family of elements of B relative to this cover; so we 
use the Jp-sheaf axiom for B to construct the required element xp. The rest is 
straightforward verification. g 


We shall say a site (C, J) is essentially small if C is locally small and has a 
small J-dense subcategory. We note that, although we required the subcategory 
D to be small in the statement of 2.2.3, the result has an immediate extension 
to the case where D (with the induced coverage Jp) is an essentially small site: 
for if D’ is a small Jp-dense subcategory of such a D, it is easy to see that D’ is 
also J-dense in C, and hence both Sh(C) and Sh(D) are equivalent to Sh(D’). 

It is also possible to formulate a still more general version of the Comparison 
Lemma, for functors F: D — C between the underlying categories of sites which 
are not even faithful; but we shall not need this extra generality. 


Examples 2.2.4 (a) Let X be a locale, and consider the category LH/X, 
equipped with its canonical coverage J. By the proof of 1.3.11, any object of 
LH/X is a colimit of a diagram whose vertices are open inclusions U — X, 
and these colimits are stable under pullback; so the full subcategory (isomorphic 
to O(X)) whose objects are these inclusions is J-dense in LH/X. Moreover, 
the induced coverage Jox) on O(X) is readily seen to be the (canonical) cov- 
erage used in the original definition of a sheaf on a locale; so we deduce that 
Sh(LH/X, J) is equivalent to Sh(X) (and hence, by 1.3.11, to LH/X itself). 
This example is a special case of a general phenomenon which we shall observe 
in 2.2.7 below. 

(b) More generally, by a basis for a locale X we mean a subset B of O(X) 
which generates it as a complete join-semilattice, i.e. such that every element of 
O(X) is expressible as a join of members of B. Then the inclusion B — O(X) 
satisfies the conditions of 2.2.1 with respect to the canonical coverage J on O(X), 
so we may also regard Sh(X), up to equivalence, as Sh(B, Jg). For example, if 
O(X) = IA is the frame of ideals of a distributive lattice A (cf. 1.1.16(e)), and 
B is the subset of principal ideals (isomorphic as a poset to A), then the induced 
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coverage Jg is simply the coherent coverage, since any covering of a principal 
ideal has a finite subcover. Thus we obtain the result that the topos of sheaves 
on a distributive lattice A for tts coherent coverage is equivalent to the topos of 
sheaves on the locale corresponding to the frame of ideals of A. 

(c) Let G be a topological group, and consider the topos Cont(G) of contin- 
uous G-sets, defined in A2.1.6, as a site equipped with its canonical coverage C. 
It is easy to see that the full subcategory consisting of the transitive G-sets is 
dense, since any continuous G-set is the disjoint union of its G-orbits; so we may 
make the transitive continuous G-sets into the objects of a site (C, J) such that 
Sh(C, J) ~ Sh(Cont(G), C). (As with example (a) above, we shall see in 2.2.7 
below that the latter is equivalent to Cont(G) itself.) 

Clearly, any transitive continuous G-set is isomorphic to the set G : H of 
left cosets of some open subgroup H of G, with G acting by left translation; 
so we may take these G-sets to be the objects of our category C. Alternatively, 
we can take the objects to be the open subgroups themselves, with morphisms 
H — K labelled by left cosets gK of K such that H C gKg™! — note that this 
makes sense, since the conjugate gKg~! depends only on the coset gK, and not 
on the choice of representative g. (The idea is that gK: H — K corresponds to 
the unique G-equivariant map G : H — G : K sending H to the coset gK.) The 
composite of 

gk hL 

H ——+> K ——> L 

is ghL; again, we need to check that this is well-defined, but the inclusion K C 
hLh—, or equivalently h~! Kh C L, implies that KhL = hL, and hence ghL = 
gKhL depends only on gK and hL. And since every morphism between transitive 
G-sets is epic in Cont(G), the coverage J on this category has the property that 
every inhabited sieve is covering (cf. 2.1.12(c)). We leave to the reader the task 
of verifying that C satisfies the right Ore condition. 

As in (b), we may replace the set of all open subgroups in this example by a 
basis for the open subgroups, i.e. a family B of open subgroups such that every 
open subgroup contains a member of B. And in 5.3.9 below we shall see that this 
site can still be used in the case of a non-spatial localic group G (that is, a group 
object in Loc); we already knew that the continuous actions of such a group 
formed a topos, by B3.4.14(b), but we did not previously have a description of 
it as a category of sheaves on a site. (Nontrivial examples of localic groups with 
no points other then the identity element do exist: for a simple one, revisit the 
non-spatial locale constructed as an inverse limit in 1.2.9, and replace the sets 
X; appearing in the inverse system by the free groups which they generate.) 

(d) Let C be any locally small category, D a small full subcategory of C, 
and define a coverage J on C by saying that a sieve R on U is J-covering iff it 
contains all the morphisms from objects of D to U. It is easily verified that this 
is indeed a (Grothendieck) coverage, that D is a J-dense subcategory, and that 
the induced coverage Jp is the trivial one whose only covers are the maximal 
sieves My, V € ob D. Thus 2.2.3 yields an equivalence Sh(C, J) ~ [D°?, Set]. 
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In the case when C is also small, this equivalence identifies the geometric inclusion 
Sh(C, J) — [C°P, Set] with that induced as in A4.2.12(b) by the full embedding 
DP — CP, (We shall return to this example in 2.2.19 below.) 

(e) Let (C, J) be a site, and let C’ be the full subcategory on those objects 
of C which are not J-covered by the empty sieve. Then it is clear that C’ is 
J-dense in C. Moreover, it follows easily from (L) that the induced coverage J’ 
on C’ has all its sieves nonempty; so (provided we assume classical logic in our 
base topos Set — a point to which we shall return in Section C3.1 below) we 
see that every category of sheaves on a site is equivalent to one on a site where 
every covering sieve 1s nonempty. This condition on a site (C, J) correponds to 
saying that Sh(C, J) is a dense subtopos of [C°?, Set], i.e. that the initial object 
of [C°P, Set] (the constant functor with value 0) is a sheaf; so we can regard the 
italicized statement above as an analogue for toposes of the fact, noted in 1.2.7, 
that any locale is a dense sublocale of a space. 


Remark 2.2.5 We may now prove, as promised in the last section, that if J 
and K are two Grothendieck coverages on a category C such that the sites (C, J) 
and (C, K) are both essentially small, and Sh(C, J) coincides with Sh(C, K) as a 
subcategory of [C°P, Set], then J = K. For we may find a small (full) subcategory 
D of C which is dense for both J and K; then Sh(D, Jp) and Sh(D, Kp) coincide 
as subcategories of [D°P, Set], and so Jp = Kp by 2.1.11. But, given that J 
satisfies (L), it is easily seen that we may recover it from Jp, as follows: a sieve 
R on an object U of C is J-covering iff, for each f: V — U with V € ob D, the 
sieve f*RN mor D is Jp-covering. Hence Jp = Kp implies J = K. 


We next recall 


Proposition 2.2.6 For any small site (C, J), Sh(C, J) is a topos, and has all 
small limits and colimits. Moreover, it is reflective in [C°P, Set]. 


Proof As we remarked earlier in this section, we have already proved this in 
Part A, but we shall repeat the proof here, expanding on the brief outline given 
in A2.1.10. 

The first step, the construction of small limits in Sh(C, J), does not in fact 
require any smallness assumption; it is immediate from the definition that the 
class of functors which satisfy the sheaf axiom for a given family is closed under 
all (pointwise) limits in [C°P, Set], and hence that it has all small limits. 

From now on, we assume that C is small. In A1.5.5, we saw that [C°P, Set] is 
cartesian closed; we shall show that Sh(C, J) is an exponential ideal in [C°?, Set], 
and hence cartesian closed in its own right. Suppose given a J-sheaf A, an arbi- 
trary functor B: C°P — Set, and a natural transformation 6: R — A? where 
R e J(U) for some U € ob C. Then extending 0 to a natural transforma- 
tion ¢:C (—,U) — AP is equivalent to extending 0: Rx B — A to a natural 
transformation ¢: C (—,U) x B — A; we shall show that the latter is uniquely 
possible. Given any f: V — U and any x € B(V), the sieve f* R is J-covering, 
and the pairs ((fg, B(g)(x)) | g € f* R) form a compatible family of elements of 
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R x B, so their images under @ form a compatible family of elements of A. We 
define ġy (f, £) to be the element of A(V) obtained by applying the sheaf axiom 
to this family; it is easy to check that ¢ is natural, and that it is the unique 
natural transformation extending 0. 

We saw in A1.6.6 that the subobject classifier Q in [C°P, Set] is given by 
Q(U) = {sieves on U} with Q(f)(R) = f*(R). We define a sieve R € Q(U) to 
be J-closed if every f: V — U such that f*(R) € J(V) belongs to R, i.e. if 
f*(R) € J(V) implies f*(R) = My. It is clear from the definition that the 
assignment U ++ {J-closed sieves on U} defines a subfunctor Ny of Q; moreover, 
if A is a sheaf, then a subfunctor A’ of A is a sheaf iff, for any x € A(U), the 
sieve {f: V >U | A(f)(x) € A’(V)} is J-closed, i.e. iff the classifying map of 
A’ — A factors through 0; — Q. So, provided we can show that Q; itself is 
a J-sheaf, it will serve as a subobject classifier for Sh(C, J). But, if R € J(U) 
and we are given a compatible family (Sf | f € R) of J-closed sieves on the 
domains of the members of R, then there is a unique closed sieve S on U with 
f*(S) = Sp for all f, namely the sieve of all g: V — U such that Sys = Mw for 
all f: W — V € g*(R). 

The existence of all small colimits in Sh(C, J) may be shown either by appeal- 
ing to the Monadicity Theorem (A2.2.7) or by showing that Sh(C, J) is reflective 
in [C°P, Set]. We did the latter in A4.4.4, but we shall give a different construc- 
tion of the reflection here. Given a functor A: C°P — Set, we define At(U) to 
be the colimit, over the ordered set J(U) of J-covering sieves on U, of the sets 
of morphisms R — A; equivalently, an element of A+(U) is an equivalence class 
of pairs (R, s) where R € J(U) and s = (sf € A(dom f) | f € R) is a compatible 
family of elements of A relative to R, the equivalence relation identifying (R, s) 
and (R’, s’) iff there is a J-covering sieve R” C RN R' on which the restrictions 
of s and s’ agree. Given f: V — U in C, it is easy to see that we obtain a well- 
defined mapping At(U) — At(V) sending the equivalence class of (R, s) to that 
of (f*(R), (sfg | g € f*(R))); thus At becomes a functor C°P — Set. Moreover, 
the assignment A+» A? is itself functorial in an obvious way, and we have a 
natural transformation 74: A — At sending s € A(U) to the equivalence class 
of (Mu, (A(f)(s) | f € My)), which is an isomorphism iff A is a J-sheaf. Hence 
every morphism from a general A to a sheaf factors uniquely through na. 

For a general A, At need not be a sheaf, but it is at least J-separated. To see 
this, let (R, s) and (R’,s’) represent two elements of At(U) which agree when 
restricted to the domain of every morphism in a third J-covering sieve R” on U. 
Thus, for every f: V — U in R”, we have a J-covering sieve Sy on V, contained 
in f*(R)N f*(R'), such that sfg = sp, for every g € Sp. But then, by (L), the 
sieve consisting of all the composites fg, as f ranges over R” and g over Sf, is 
J-covering on U; so (R, s) and (R’,s’) represent the same element of At (U). 

To complete the construction of the left adjoint, it suffices to show that if A 
is separated then At is a sheaf; for then we shall know that A** is a sheaf for 
any functor A, and every morphism from A to a sheaf factors uniquely through 
the composite 74+ o ņa: A— At — Att. So suppose A is separated; then we 
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see that if (R, s) and (R’, s’) represent the same element of A*(U), then s and s’ 
must agree on the whole of RNR’ (rather than on some covering sieve contained 
in RO FP’), since for any f € ROR’ the elements s+ and s} agree on a J-covering 
sieve. Thus each element of At(U) has a canonical representative, namely the 
union of all the families which represent it. So if we now have a covering sieve R on 
U and a compatible family of elements (tf € A+ (dom f) | f € R), the canonical 
representatives (Sf, ssf) of the tf can be combined into a single compatible family 
of elements of A indexed by the sieve {fg | f € R,g € Sy}, which is again 
J-covering by (L). In this way we obtain an element t of A*(U) such that 
At(f)(t) = tp for each f € R; so At satisfies the ‘existence’ as well as the 
‘uniqueness’ part of the definition of a sheaf. oO 


We recall that in Section A4.4 we not only constructed a left adjoint for the 
inclusion Sh(C, J) — {C°?, Set], but also proved that it is cartesian (i.e. preserves 
finite limits). In terms of the construction given in the proof of 2.2.6, the same 
result may be obtained by observing that the poset J(U) of J-covering sieves on 
U is codirected, since it has finite intersections; so the fact that filtered colimits 
and finite limits commute in Set implies that the functor A +> A?* is cartesian. 
As we remarked at the end of Section A4.4, there is also an ‘internal’ version of 
this construction of the associated sheaf functor, applicable to any local operator 
in an elementary topos; cf. [501]. 

We note that a topos of the form Sh(C, J), as well as having all small colimits, 
is locally small; hence by A4.1.9 it admits a geometric morphism to Set. It is 
also well-powered (since subobjects of an object A correspond bijectively to 
morphisms A — Q) and hence an oo-pretopos as defined in Section A1.4. In the 
converse direction, we have 


Proposition 2.2.7 Let C be an oo-pretopos with a separating set. Then the 
canonical coverage J makes (C,J) into an essentially small site, and every 
J-sheaf on C is representable. 


Proof By 2.1.12(e), we know that the canonical coverage consists of all sieves 
which contain small epimorphic families. But the definition of separating set 
says that, for any U € ob C, the family of all morphisms from objects in the 
separating set to U is (small and) epimorphic, so the full subcategory D on these 
objects is J-dense. Thus (C, J) is essentially small. 

Let l: C — Sh(C,J) be the factorization through Sh(C,J) of the Yoneda 
embedding U +» C(—,U). By the argument of A2.1.11(a) and (b), | preserves 
covers and small unions, i.e. it is a geometric functor. We must show that every 
object of Sh(C, J) is isomorphic to one in the image of l. Given a sheaf A, let 
S be the set of pairs (V, x) with V € ob D and x € A(V), and let W be the 
coproduct (in C) of the S-indexed family (V | (V,x) € S). From the proof of 
2.2.3, we know that any J-sheaf B is isomorphic to the right Kan extension of 
its restriction to D, and so an element y € B(U) (for any U € ob C) is uniquely 
determined by the family of elements (B(f)(y) | f: V => U,V € ob D). It follows 
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that two natural transformations A =3 B which agree at all objects of D must be 
equal, and hence the natural transformation (W) = [ [ev sjes I(V) — A whose 
(V,x)th component is (induced by) x is epimorphic in Sh(C, J). 

Now form the kernel-pair R — I(W) x l(W) of (W) > A. By the same 
argument, we can find an epimorphism /(X) —> R. But l (is full and) preserves 
images, so we have R = I(J), where J is the image of the corresponding pair 
X — W xW in C. And since epimorphisms in Sh(C,J) are regular, and l 
preserves coequalizers of equivalence relations, we now have A = 1(Q), where Q 
is the coequalizer of J = W in C. So the result is proved. oO 


Summarizing the results of this section so far, we now have the characteriza- 
tion of sheaf toposes known as Giraud’s Theorem [405]. 


Theorem 2.2.8 For a category E, the following are equivalent: 


(i) There exists an essentially small site (C, J) such that E ~ Sh(C, J). 
(ii) There exists a small site (C, J) such that E ~ Sh(C, J). 
(iii) There exists a standard site (C, J) such that E ~ Sh(C, J). 
(iv) E is a cocomplete, locally small topos with a separating set of objects. 


(v) E is a topos, and there exists a bounded geometric morphism E — Set 
(cf. B3.1.7). 


(vi) E ~ Map(A), where A is a tabular effective positive geometric allegory 
with a strong separator (cf. A3.4.11). 


(vii) E is an oo-pretopos with a separating set of objects. 


(viii) E€ is locally small and has a small subcategory which is dense for 
the canonical coverage, and every sheaf for the canonical coverage is 
representable. 


(ix) E is locally presentable (cf. D2.3.4), locally cartesian closed and balanced, 
and is effective as a regular category. 


Proof (i)> (ii) follows from 2.2.3, and (iii) > (ii) is obvious. (ii) > (iv) was all 
proved in 2.2.6, except for the separating set of objects; but it is well known that 
the hom-functors C (—,U),U € ob C, form a separating set for (C°P, Set] if C is 
small, and since Sh(C, J) is reflective in (C°P , Set] their images under the reflector 
form a separating set for Sh(C, J). Now (iii) > (ix) because D2.3.7 shows that 
any category satisfying (iii) is locally presentable, and the other hypotheses of 
(ix) are satisfied by any topos. The equivalence of (iv) and (v) was established in 
B3.1.8(b) (cf. also A4.1.9), and that of (v) and (vi) in B3.1.9. (iv) > (vii) because 
a locally small topos is well-powered, and a cocomplete well-powered topos is 
an oo-pretopos. Similarly, (ix) > (vii) because the infinitary analogue of A1.5.13 
shows that E is a geometric category (it is well-powered by A1.4.17) and hence 
an oo-pretopos by the infinitary analogue of A1.5.14, and it has a generating 
set by the definition of local presentability. (vii) > (viii) by 2.2.7 and the fact 
(which follows from 2.1.12(e)) that any separating set for an oo-pretopos defines 
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a dense full subcategory for the canonical coverage. Assuming (viii), let J be the 
canonical coverage on E; then, since the Yoneda embedding l: E — Sh(€, J) is 
full and faithful, (viii) says that l is a weak equivalence, and so (with choice) we 
have an equivalence E ~ Sh(€, J). Hence (i) holds. 

It remains to show that (iii) is implied by the other conditions. First, we note 
that if D is a dense subcategory then so is the full subcategory D of € on the 
same objects; and the induced coverage Jp on the latter will be subcanonical, 
since all the functors D(—,V) (V € ob D) are sheaves for it. Moreover, D is 
small if D is, since E is locally small; and we may further close it under finite 
limits in € without destroying its smallness (and clearly without destroying its 
denseness either). o 


Definition 2.2.9 A category satisfying the equivalent conditions of 2.2.8 is 
called a Grothendieck topos. 


Remark 2.2.10 We have seen that a Grothendieck topos is complete, cocom- 
plete, locally small and well-powered, and it has a separating set. It is also 
well-copowered (because isomorphism classes of epimorphisms with domain A 
correspond bijectively to equivalence relations on A), and has a coseparating set 
(in fact a single coseparator), by B3.1.13. Thus we are always free to appeal to the 
Special Adjoint Functor Theorem (B2.4.6) when considering functors between 
Grothendieck toposes: that is, a functor between Grothendieck toposes has a left 
(resp. right) adjoint iff it preserves all small limits (resp. colimits). In particular, 
we note that if F: E — F is a geometric functor between Grothendieck toposes, 
then it preserves all small colimits (by the argument of A1.4.19), and so has a 
right adjoint; thus it is the inverse image of a geometric morphism F — E. (This 
is essentially a restatement of something we observed in A4.1.18.) 


Another consequence of the ‘smallness condition’ built into the definition of a 
Grothendieck topos is also worth stating explicitly. Although the hom-categories 
of the 2-category BLop/Set of Grothendieck toposes need not be small, they 
do have an important ‘smallness property’: 


Lemma 2.2.11 Let € and F be Grothendieck toposes. Then there exists a set 
K of geometric morphisms F — E which is ‘relatively conservative’: that is, uf 
u: A’ >> A is a monomorphism in E such that f*(u) is an isomorphism for all 
f € K, then g*(u) is an isomorphism for all geometric morphisms g: F — €E. 


Proof Let (C,J/) be a small site such that € ~ Sh(C, J), and as usual let 
l: C — E denote the composite of the Yoneda embedding and the associated 
sheaf functor. For each U € ob C and each subobject m: R — I(U) in E, if there 
exists a geometric morphism f: F — E such that f*(m) is not an isomorphism, 
then choose one and call it fm; let K be the set of all such fm, as m ranges 
over (a representative set of) subobjects of objects of the form [(U). Now let 
u: A’ >> A be an arbitrary monomorphism in E, and suppose g*(u) is not an 
isomorphism for some g: F — E. Since A can be expressed as a colimit of objects 
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of the form [(U), and since g* preserves colimits and pullbacks, there must exist 
x: l(U) — A in E for which the pullback 


R ———> A’ 


(a= =A 


is such that g*(m) is not an isomorphism. But then we have fm € K such that 
fù (m) is not an isomorphism, and hence f (u) is not an isomorphism either. 
o 


Lemma 2.2.11 is most often of use in the particular case F = Set. By analogy 
with the case of sheaves on a locale, we define a point of a Grothendieck topos 
E to be a geometric morphism Set — E, and we say that E has enough points if 
the class of all inverse image functors € — Set is jointly conservative. We may 
thus deduce 


Corollary 2.2.12 A Grothendieck topos E has enough points iff there exists a 
surjection Set/K — E for some set K. 


Proof By B3.4.1 and the remarks following it (plus A1.1.6, if you insist), spec- 
ifying a geometric morphism Set /K — € is equivalent to specifying a K-indexed 
family of points of E. Moreover, the former is surjective iff the inverse images of 
the latter are jointly conservative. So the result is immediate from 2.2.11. oO 


In D2.3.12 we prove a stronger version of 2.2.11: namely, that 
BTop/Set (F, E) contains a set of objects K such that every object is express- 
ible as a filtered colimit of members of K. (We recall from B3.4.8 - though it 
also follows easily from 2.2.10 — that filtered colimits in BYop/Set (F, E) are 
computed ‘pointwise’ at the level of inverse image functors, so 2.2.11 does indeed 
follow from this assertion.) 

Next, we digress to consider the analogue for quasitoposes of Giraud’s 
Theorem. By a bisite, we mean a category C equipped with two coverages J 
and K, such that J C K. We say a functor A: C°P — Set is biseparated for 
the pair (J, K) if it satisfies the sheaf axiom for all covers in J and is separated 
for all covers in K; and we write Bisep(C,J, K} for the category of all such 
functors (and natural transformations between them). If C is small, it is easy to 
see that these functors are exactly the objects of Sh(C, J) which are separated 
(in the sense of A4.3.4) for the local operator k corresponding to the subtopos 
Sh(C, K) C Sh(C, J); hence by A4.4.5 Bisep(C, J, K) is a quasitopos. Moreover, 
it is reflective in Sh(C, J) and hence in (C°?, Set]; so it is complete and cocom- 
plete, and has a generating set consisting of the reflections of the representable 
functors. 
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One more definition: we say an equivalence relation R 3 A in a regular 
category is strong if the corresponding monomorphism R — A x A is a cocover 
(i.e. a strong monomorphism). Clearly, every effective equivalence relation is 
strong, since the indicated monomorphism is an equalizer. We say that a regular 
category is quasi-effective if every strong equivalence relation is effective. The 
proof given in A2.4.1 that a topos is effective as a regular category may easily 
be adapted to prove that a quasitopos is quasi-effective. 


Theorem 2.2.13 Fora category E, the following conditions are equivalent: 


(i) There exists a Grothendieck topos F and a local operator k on F such that 
E ~ sep, (F). 

(ii) E ts equivalent to Bisep(C, J, K) for some small bisite (C, J, K} (whose 
underlying category may be taken to have finite limits). 


(iii) E is a locally small, cocomplete quasitopos with a generating set. 


(iv) E is locally presentable, locally cartesian closed and quasi-effective. 


Proof The proof of (i) = (ii) => (iii) > (iv) is mostly contained in the dis- 
cussion before the statement of the theorem; the only thing missing is the local 
presentability, which may be established as in D2.3.7. So it remains to prove 
(iv) = (ii). 

Suppose € is locally «-presentable. Then by D2.3.4 we can represent it as 
the category of «-continuous functors C°P — Set, where C is a small category 
(equivalent to the full subcategory of K-presentable objects of €) with «-small 
colimits. We shall take this C as the underlying category of our bisite, and 
regard it as a full subcategory of €. The coverages J and K are defined as those 
induced on C by the coverages on E consisting of sieves containing small effective- 
epimorphic families, and small epimorphic families, respectively: note that, since 
E€ is a geometric category by A1.5.13, these families are indeed stable under 
pullback. Also, since every object of € is a («-filtered) colimit of objects of C, 
the inclusion C — E is J-dense and hence also K-dense; and every representable 
functor E°P — Set is a J-sheaf and K-separated, so we have E C Bisep(C, J, K) 
as subcategories of [C°?, Set]. 

For the reverse inclusion, we need to show that every (J, ’)-biseparated 
functor C°P —+ Set preserves «-small limits. Consider first a coproduct U = 
Xer Ui: by (the infinitary analogue of) A1.5.14, this coproduct is quasi-disjoint, 
i.e. the coprojections v;: U;  U are monic and U; N U; is a quasi-initial object 
for each i # j. In particular, each U; N U; is K-covered by the empty sieve; so if 
A: C°P — Set is (J, K)-biseparated, then each F(U; N U,) is either empty or a 
singleton. Now the v; generate a J-covering sieve; the sheaf axiom for this sieve 
says that A(U) is the limit of a diagram whose vertices are the A(U;) and the 
A(U; NU,), but the triviality of the latter means that we can omit them from 
the diagram without changing its limit. (Note that if any A(U; N U;) is empty, 
then both A(U;) and A(U,) must be empty too.) So A(U) — [[;er A(U;) is an 
isomorphism, as required. 
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It remains to consider coequalizers. First we note that the Yoneda embedding 
h: C — Bisep(C, J, K) preserves regular epimorphisms: since a regular epimor- 
phism q: V =» W generates a J-covering sieve, h(q) is the coequalizer of its 
kernel-pair in Bisep(C, J, K). Now consider an arbitrary coequalizer diagram 


W 


eV 


g 


in C; let q': h(B) > Q be the coequalizer of (h(f), h(g)) in Bisep(C, J, K), and 
(a,b): R 3 A(V) the kernel-pair of q’ in this category. We note that h(q) factors 
through q’ (so that h(q)a = h(q)b), and that (h(f), h(g)) factors through (a, b). 

We claim next that R lies in E. We have a monomorphism 
(a,b): R= h(V) x h(V); but h(V) x A(V) lies in £, so if we are given any 
other coequalizer diagram U’ 3 V’ > W’ in C and a morphism h(V') — R 
having equal composites with h(U’) 3 A(V’), then the composite h(V’) — R — 
h(V) x h(V) factors uniquely through h(V’) > h(W’), and from the fact that 
h preserves regular epimorphisms we get a unique lifting of this factorization 
through R — h(V) x h(V). So R preserves equalizers as a functor C° — Set; 
but by the argument already given we know that it preserves K-small products, 
and hence it lies in €. 

In fact (a,b): R= A(V) x h(V) is a cocover in Bisep(C, J, K}, since it is 
regular monic; but the inclusion E€ — Bisep(C, J, K) is easily seen to pre- 
serve epimorphisms, so it is also a cocover in E. Hence R 3 A(V) is a strong 
equivalence relation in E, and so it is the kernel-pair of its coequalizer there; 
but the latter may be identified with h(q), by the existence of the factoriza- 
tions mentioned earlier and the fact that h preserves coequalizers as a functor 
C — E. So h(q) is isomorphic to q’, since both are regular epimorphisms in 
Bisep(C, J, K) and they have the same kernel-pair. Thus we have shown that the 
inclusion C — Bisep(C, J, K) preserves coequalizers; equivalently, every object 
of Bisep(C, J, K), considered as a functor C°P — Set, preserves equalizers. 


oO 


A category satisfying the equivalent conditions of 2.2.13 is called a 
Grothendieck quasitopos. 


Examples 2.2.14 (a) We saw in A2.6.4(a) that any Heyting algebra is a qua- 
sitopos. Clearly, a Heyting algebra is a Grothendieck quasitopos iff it is complete, 
i.e. iff it is a frame. Given a frame L, we know from 1.3.15 that L may be 
recovered (up to equivalence) as the lattice of subterminal objects of Sh(L, J), 
where J is the canonical coverage on L; equivalently, it can be represented as 
Bisep(L, J, K) where K is the coverage (sometimes called the chaotic coverage) 
in which every sieve is covering, and so the only sheaf is the terminal object. 
(b) The quasitopos Fre of Fréchet spaces (A2.6.4(c)) was identified with a 
full subcategory of the topos Sh(M, J) of A2.1.11(j), where M is the monoid of 
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continuous endomorphisms of the space N+ and J is (the Grothendieck coverage 
generated by) the coverage defined in A2.1.11(j). It is not hard to see that 
Fre consists precisely of the ~—-separated objects of this topos; equivalently, 
we may identify it with Bisep(M,J, K), where K is the coverage consisting 
of all jointly-surjective sieves of endomorphisms of N+ (equivalently, of those 
sieves which contain all the constant maps Nt — Nt). So it is a Grothendieck 
quasitopos. 


The assumption ‘E has a generating set’ in 2.2.13(iii) cannot be weakened (as 
in 2.2.8(iv)) to ʻE has a separating set’. The quasitopos Choq of Choquet spaces, 
considered in A2.6.4(b), has a single separator, namely 1; but it does not have 
a generating set, by an argument like that given for Sp in A1.2.5. Similarly, 
the assumption of quasi-effectiveness cannot be omitted from 2.2.13(iv) (and 
effectiveness cannot be omitted from 2.2.8(ix)). We shall give a counterexample 
in 4.2.4 below. 

We now revert to the study of toposes. Given a (Grothendieck) topos €, 
by a site of definition for E we mean a site (C, J) such that E ~ Sh(C, J). Of 
course, there are many possible sites of definition for a given topos; but, at least 
if we restrict our attention to separated sites, it is possible to characterize them 
via Proposition 2.2.16 below. Given a small site (C, J), we shall normally write 
l: C — Sh(C, J) for the composite of the Yoneda embedding h: C — [C°P, Set] 
with the associated sheaf functor L: [C°P, Set] — Sh(C, J). 


Lemma 2.2.15 Let (C,J) be a small site. Then the functor l: C — Sh(C, J) 
is faithful (resp. full and faithful) iff the coverage J is separated (resp. 
subcanonical). 


Proof If J is subcanonical, then the Yoneda embedding h factors through the 
inclusion Sh(C, J) — [C°?, Set], and we may identify l with this factorization; 
so the fact that it is full and faithful follows from the corresponding property of 
h. If J is merely separated, then for each U € ob C the unit map h(U) — [(U) 
is monic, and so composition with it induces an injection 


C (V,U) = [c°?, Set] (A(V), h(U)) —> [C°?, Set] (A(V), 1(U)) 
= Sh(C) (1(V), (U)) 
which is clearly the effect of the functor l on morphisms V > U. 


Conversely, suppose L is faithful. Then for any two objects U and V of C, we 
have an injection 


h(U)(V) =C (V.U) —> Sh(C) (U(V), I(U)) = [C°P, Set} (h(V),1(U)) = (U)V); 


and these injections are natural in V, so they define a monomorphism h(U) > 
I(U) in [C°P, Set]. Hence by A4.3.6(a) h(U) is separated for any U, ie. J is a 
separated coverage. If l is full as well as faithful, then the above monomorphism 
is an isomorphism; so the h(U) are all sheaves, i.e. J is subcanonical. o 
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It follows from 2.2.15 that, given E, every small separated (resp. subcanonical) 
site of definition for € occurs, up to equivalence, as a subcategory (resp. a full 
subcategory) of € itself. In the subcanonical case, we can delete the word ‘small’ 
from this assertion: although the inclusion Sh(C, J) — [C°P, Set] may not have 
a left adjoint if C is not small, it is still true that if J is subcanonical then 
the Yoneda embedding factors through the inclusion, and so we may identify 
C with a full subcategory of Sh(C, J). Moreover, it is not hard to see which 
(full) subcategories of E give rise to such sites: they are precisely those which 
satisfy the hypotheses of the Comparison Lemma 2.2.3, when we equip E with 
its canonical coverage. Finally, the assertion that a full subcategory C is dense 
for the canonical coverage is clearly equivalent to saying that ob C is a separating 
family for €. Thus we have established 


Proposition 2.2.16 Let E€ be a Grothendieck topos. Then, up to 
equivalence, 


(i) the small sites of definition (C,J) for E such that J is separated are pre- 
cisely the sites obtained by equipping small dense subcategories of E with 
the coverages induced by the canonical coverage on E; and 


(ii) the essentially small sites of definition (C, J) for E such that J is subcanon- 
ical are precisely the sites obtained by equipping full subcategories C of E 
such that ob C is a separating family for E with the coverages induced by 
the canonical coverage on E. oO 


The ‘fundamental theorem of topos theory’ (A2.3.2) says that if E is a topos 
then so is €/A for any object A of E€. Moreover, it is easy to see that if E is 
a Grothendieck topos then so is €/A. The following result is often useful in 
obtaining a site of definition for €/A. 


Lemma 2.2.17 Let (C, J) be an essentially small subcanonical site, U an object 
of C. Define a sieve R on an object (f: V — U) of C/U to be Jy-covering iff the 
sieve {Xu(g) | g € R} is J-covering on V = Ly (f). Then (C/U, Ju) is an 
essentially small subcanonical site, and Sh(C/U, Ju) = Sh(C, J)/1(U). 


Proof It is clear that Jy satisfies conditions (C), (L) and (M) if J does; 
and if D is a small dense full subcategory of C, then the full subcategory of 
C/U on all objects (f: V = U) with V € ob D is (small and) dense. More- 
over, Jy is subcanonical: given a Jy-covering sieve R on an object f: V — U 
of C/U and another object g: W — U, a compatible family of morphisms 
(hy: dom k +g | k e R) is in particular a compatible family for the sieve 
{Xu(k) | k € R}, and hence induces a unique morphism h: V > W, and the 
latter is a morphism f — g in C/U (that is, gh = f) since C(—,U) is a J-sheaf. 
The final assertion now follows from 2.2.16(ii), since if we identify C with a dense 
full subcategory of Sh(C, J) then C/U will be identified with a dense full sub- 
category of Sh(C, J)/I(U) (and the coverage induced on it will be exactly Jy, 
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since the forgetful functor Sh(C, J)/1(U) — Sh(C, J) preserves and reflects epi- 
morphic families). o 


Before leaving this section, we revert briefly to the consideration of those 
coverages which arise as in 2.2.4(d). 


Definition 2.2.18 Let (C, J) be a site. 


(a) We say an object V of C is (J-)irreducible if the only J-covering sieve on V 
is the maximal sieve My. 


(b) We say J is rigid if, for every U € ob C, the family of all morphisms from 
J-irreducible objects to U generates a J-covering sieve. 


If J is rigid, then for every U € ob C the sieve described in 2.2.18(b) must 
be the unique smallest J-covering sieve on U; in particular, every object has a 
smallest J-covering sieve (cf. A4.5.2). (But the latter condition does not imply 
rigidity: for example, on any dense totally ordered set, such as Q or R, we 
have a (Grothendieck) coverage in which each object U has just two covering 
sieves, namely My and the sieve of all morphisms V — U with V < U, and 
this clearly has no irreducible objects.) Clearly, a coverage which arises as in 
2.2.4(d) is rigid, the irreducible objects being (retracts of) objects of the full 
subcategory D; conversely, if J is a rigid coverage on C, then (provided the full 
subcategory D of J-irreducible objects of C is small) the Comparison Lemma 
yields an equivalence Sh(C, J) ~ [D°?, Set}. 


Example 2.2.19 Let C be a small cartesian category, and Reg(C) the regu- 
larization of C, as defined in A1.3.9. Then the regular coverage J on Reg(C) is 
rigid; for we saw in the proof of A1.3.9 that objects of the form (A), A € ob C, 
are projective in Reg(C) (and hence J-irreducible, since any cover C —p J(A) 
is a split epimorphism); on the other hand, as we noted in A1.3.10(b), every 
object of Reg(C) can be covered by one of the form J(A). So in this case the 
Comparison Lemma tells us that Sh(Reg(C), J) is equivalent to the functor cat- 
egory [C°P, Set]. More generally, if D is any small regular category with enough 
projectives (i.e. such that every object can be covered by a projective), then the 
topos of sheaves for the regular coverage on D is equivalent to [C°?, Set], where 
C is the full subcategory of projective objects of D. 


As an application of the notion of rigidity, we may establish a useful 
companion for A1.1.10: 


Lemma 2.2.20 For a Grothendieck topos E, the following are equivalent: 
(i) There exists a small category C such that E ~ [C°P, Set]. 
(ii) The canonical coverage on E is rigid. 


(iii) E has a separating set of indecomposable projective objects. 


Proof (i) = (iii) follows easily from (the remarks before) A1.1.10, since the 
representable functors form a separating set for [C°?, Set]. 
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(iii) => (ii): Suppose A is an indecomposable projective in €. Then, given 
any epimorphic family (f;:B; > A | i € J), at least one f; must be a split 
epimorphism; for projectivity allows us to find a splitting s: A — [ [;er Bi for the 
induced epimorphism, and if we pull back the coprojections along s we obtain a 
coproduct decomposition of A, which must be trivial. So A is irreducible for the 
canonical coverage on €; thus (iii) says that every object of E can be covered by 
irreducible objects. 

(ii) > (i) will be immediate from the Comparison Lemma, provided we can 
show that the full subcategory of irreducible objects for the canonical coverage 
is equivalent to a small category. But, if G is any separating set for €, then for 
each irreducible object A the family of all morphisms from members of G to A 
must contain a split epimorphism; hence A is a retract of some member of G, 
and there are (up to isomorphism) only a set of such retracts. O 


In passing, we remark that Grothendieck toposes with a separating set of 
projective (but not necessarily indecomposable) objects have been characterized 
in [531]. 

Rigid coverages also arise from consideration of finite categories: 


Lemma 2.2.21 If C is a finite Cauchy-complete category, then every 
(Grothendieck) coverage on C is rigid. 


Proof Let J be such a coverage. Since J-covering sieves are closed under 
finite intersections by the remarks after 2.1.8, any object U of C has a small- 
est J-covering sieve, which we shall denote by Ry. We must show that Ry is 
generated by morphisms whose domains are J-irreducible. 

We shall say that a morphism f: V — U is essential in a sieve R if (f € R 
and) it does not factor nontrivially through any other morphism in R, i.e., when- 
ever we have a factorization f = gh with g € R, then h is asplit monomorphism. 
We claim first that if f is essential in Ry, then V is J-irreducible. For the sieve 
consisting of all composites gh with g € Ry and h € Radom g is J-covering by 
(L) and so contains (in fact equals) Ry; hence we have a factorization f = gh 
with g € Ry and h € Raom g, whence h is split monic (with splitting k, say). 
Now h € k*(Ry) since the latter is a J-covering sieve on dom k = dom g; so 
kh= ly € Ry, 1.e. Ry = My. 

Thus we are reduced to showing that Ry is generated by its essential mem- 
bers. In fact this is true for any sieve R on an object U of a finite Cauchy-complete 
category: to prove it, let fo: Vj ~ U be a member of R. We may choose a 
sequence of morphisms f,:V, > U in R, such that fm factors through fn 
whenever m < n, by the following rules: 


(a) If fn can be factored as g,h, where h, is split epic but not an isomorphism, 
choose such a factorization and set fri: = gn. (Note that any such gn is 
necessarily in R.) 
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(b) If (a) is impossible but f» can be factored as g,h, where gn € R and hy is 
not split monic, choose such a factorization and set fn41 = Jn- 


(c) If both (a) and (b) are impossible, then stop. 


Clearly, if the process stops then it does so at a morphism f, which is essential 
in R. But since R is finite, it must either stop or repeat itself; i.e., if it does 
not stop, we can find n < m with fn = fm. Composing the h; forn <j <m 
in the appropriate order, we thus obtain for each j in this range a morphism 
k,:V; > V, with f,k, = fj. Now k; cannot be an isomorphism, for if it were 
then h, would be a split monomorphism. However, since V, has only finitely 
many endomorphisms some power of k; must be idempotent; since C is Cauchy- 
complete this idempotent splits, and hence f, factors through some nontrivial 
split epimorphism with domain V;, so that h; must be such an epimorphism. 
Since this is true for every j, n < j < m, it follows that kn is a (split) epimor- 
phism; but this is impossible, since we have just seen that some power of kn is a 
non-invertible idempotent. So the process described above must stop; hence fo 
factors through an essential member of R. O 


The proof of 2.2.21 easily extends to any category C such that each slice 
category C/U, U € ob C, is equivalent to a finite category (cf. A2.1.5). 


Corollary 2.2.22 Every bounded Set ;-topos is (equivalent to one) of the form 
[D°P, Sets] where D is a finite category. 


Proof By the analogue of 2.2.8 with ‘small’ replaced by ‘finite’, every such 
topos E is equivalent to one of the form Sh;(C,J), the category of sheaves of 
finite sets on a finite site (C, J). Moreover, since idempotents split in Sets, we 
may assume by A1.1.9 that C is Cauchy-complete. Then by 2.2.21 J is rigid, 
so by (the finite analogue of) 2.2.4(d) E is equivalent to [D°?, Set] where D is 
the full subcategory of J-irreducible objects of C. (Note, incidentally, that any 
retract of a J-irreducible object is J-irreducible, so D is also Cauchy-complete.) 

O 


There is a stronger ‘functorial’ version of 2.2.22. By the finite analogue of 
2.2.10, any cartesian functor between bounded Set s-toposes has a left adjoint, 
and hence any geometric morphism between such toposes is essential. But by 
the finite version of A4.1.5, an essential geometric morphism between toposes 
of the form |C, Sets] is induced by a functor between the corresponding finite 
categories, provided they are Cauchy-complete, and so we have 


Corollary 2.2.23 The assignment C +> [C,Set;] is an equivalence of 
2-categories from the 2-category Cau; of finite Cauchy-complete categories, func- 
tors and natural transformations between them, to the 2-category BYop/Set s 
of bounded Set ;-toposes. Oo 
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Thus, if we replace the base topos Set by Sets, the study of ‘Grothendieck 
toposes’ reduces entirely to the study of finite (Cauchy-complete) categories. If 
we view toposes as generalized spaces, this result may be seen as the natural 
generalization of the well-known fact that the category of finite Ty-spaces is 
isomorphic to the category of finite posets. 


Suggestions for further reading: Artin et al. [36], Borceux & Pedicchio [156], 
Bunge [182], Carboni & Mantovani [228], Pedicchio [942]. 


C2.3 Morphisms of sites 


In this section we investigate those functors between (the underlying 
categories of) sites which give rise to geometric morphisms between their toposes 
of sheaves. Such functors are traditionally called ‘continuous functors’, but this 
name is unsatisfactory, for two reasons. First, it clashes with the common use 
of ‘continuous’ in category theory to mean ‘limit-preserving’ (continuity in this 
sense is actually more akin to a colimit-preservation property); and secondly, 
whilst the traditional justification for the name is that continuous functors 
between sites are the natural generalization of continuous maps between spaces, 
they actually go in the ‘algebraic’ rather than the ‘geometric’ direction, i.e. 
the opposite direction to the geometric morphisms which they induce. We have 
therefore chosen to abandon the traditional name, and use the slightly more 
cumbersome but less misleading term ‘cover-preserving’; we shall also use the 
term ‘morphism of sites’ for a functor which is cartesian and cover-preserving. 

Unless otherwise stated, all coverages in this section will be Grothendieck 
coverages. For most of the section, we shall also assume that the underlying 
categories of our sites are cartesian; we shall discuss in 2.3.7 below how the 
theory needs to be adapted if this hypothesis is dropped. 


Definition 2.3.1 Let (C,J) and (D, K) be (cartesian) sites, and F:C > Da 
functor. We say F is cover-preserving if, for any U € ob C and any covering sieve 
R € J(U), the family (F(f) | f € R) generates a K-covering sieve on F(U). And 
we shall say that F is a morphism of sites (C, J) — (D, K) if it is cartesian and 
cover-preserving. 


Examples 2.3.2 (a) If A and B are frames, made into sites via their canonical 
coverages, then a morphism of sites A — B is exactly the same thing as a frame 
homomorphism, i.e. a function preserving finite meets and arbitrary joins. 

(b) If C and D are regular categories equipped with their regular coverages, 
then a morphism of sites C — D is the same thing as a regular functor, i.e. one 
preserving finite limits and covers. Similarly, coherent functors between coherent 
categories are the same thing as morphisms of sites for their coherent coverages. 

(c) Given any small cartesian site (C,J), let 1: C => Sh(C,J) denote the 
composite of the Yoneda embedding h: C — [C°P, Set] and the associated sheaf 
functor, as in 2.2.15. Then l is cartesian, since both h and the associated sheaf 
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functor preserve finite limits; and it preserves covers if Sh(C, J) is equipped with 
its canonical topology, since if R € J(U), then R (regarded as a subobject of 
h(U) in [C°?, Set]) is the union of the images of the morphisms h(f), f € R, and 
the associated sheaf functor maps the inclusion R — h(U) to an isomorphism. 
So l is a morphism of sites. If C is not small, then we do not in general have an 
associated sheaf functor [C°?, Set] — Sh(C, J); but if (C, J) is essentially small 
and J is subcanonical then we shall still write l: C — Sh(C, J) for the factoriza- 
tion of the Yoneda embedding through the inclusion Sh(C, J) — [C°P, Set]; and 
in this case also it is easily seen to be a morphism of sites. 


Lemma 2.3.3 Let F:(C,J)—(D,K) be a morphism of sites. Then the 
induced functor F*: [D°P, Set] — [C°P, Set] which sends B to Bo F restricts 
to a functor Sh(D, K) > Sh(C, J). 


Proof We have to show that if B is a K-sheaf on D then F*(B) is a J-sheaf 
on C. So let R be a J-covering sieve on an object U of C, and suppose we have 
a compatible family of elements (ss € B(F(dom f)) | f € R). Let S be the 
(K-covering) sieve on FU generated by the images under F of the members 
of R; it is enough to show that there is a unique compatible family (t, | g € 
S) of elements of B such that trp = sy for all f € R. The uniqueness is 
clear since the F(f) generate S; for the existence, note that if a morphism 
g: W — FU of S factors through two different morphisms F'(f): FV — FU and 
F(f'): FV’ — FU in the image of R (say g = F(f)h = F(f')h’), then it also 
factors through F(f”) where 


V =’ yV 
4 
Il 


/ 
vi su 
is a pullback in C, since F preserves this pullback. So the compatibility of the 
family (ss | f € R) implies that we have B(h)(s¢) = B(h”)(s-) = B(h’)(s p); 
hence we can unambiguously define t, to be B(h)(sp) for any factorization g = 
F(f)h of g through a morphism in the image of R. 0o 


The proof of 2.3.3 should be compared with that of 2.2.2(ii): in the earlier 
result, we did not assume that the inclusion functor was cartesian (although 
2.2.2(i) implied that it was cover-preserving), and to compensate for this we 
needed the hypothesis that its domain was a dense subcategory of its codomain. 
On the other hand, even without the assumption of cartesianness, the proof of 
2.3.3 shows that composition with a cover-preserving functor maps sheaves to 
separated functors — a fact which we shall use in the proof of 3.1.21 below. 

Note, incidentally, that we did not use the full strength of cartesianness of F 
in 2.3.3, but only the preservation of pullbacks (equivalently, by A1.2.9, of finite 
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connected limits). If we were to weaken the definition of ‘morphism of sites’ in this 
way, then the argument of 2.3.4 below would show that every such morphism 
yields a pre-geometric morphism of sheaf toposes, as defined in A4.1.13. For 
example, in the situation of 2.2.17, the forgetful functor Xy: C/U — C induces 
the pre-geometric morphism Sh(C) — Sh(C)/I(U) whose inverse image is Eyy) 
(and this example is ‘generic’, in that any ‘pre-morphism of sites’ with codomain 
C factors as a morphism of sites followed by a functor of the above form). 


Corollary 2.3.4 In addition to the hypotheses of 2.3.3, assume that C is 
small and that (D, K) is essentially small. Then there is a geometric morphism 
Sh(D, K) — Sh(C, J) whose direct image is F*. 


Proof If Dis also small, it is easy to see that F* has a left adjoint Fi, namely 
the composite 


lim p a 
Sh(C) —> [C°?, Set] ——> [D°?, Set] —> Sh(D) 


where a is the associated sheaf functor: and this functor is cartesian, since the left 
Kan extension lim p is cartesian by the argument of A4.1.10, and a is cartesian 
by the argument sketched after 2.2.6 (or by A4.3.1 and A4.4.6). (In fact, the 
existence of a left adjoint for F* could have been deduced immediately from 
2.2.10, since F* preserves all small limits as a functor [D°?, Set] — [C°?, Set] 
and hence also as a functor Sh(D) — Sh(C).) 

If D is not small, we do not in general have the associated sheaf functor 
available; but we may reduce to the case just considered on replacing D by a 
small dense full subcategory D’ which is closed under finite limits and contains 
the image of F. For if we equip this subcategory with the induced coverage Ky, 
then F is still a morphism of sites (C, J) — (D’, Kp), and so we get a cartesian 
left adjoint for F*: Sh(D’) — Sh(C). But Sh(D') is equivalent to Sh(D) by 2.2.3. 

E 


Remark 2.3.5 We note that 2.3.4 can be made functorial: given two mor- 
phisms of sites F,G: (C, J) 3 (D, K), every natural transformation F — G 
induces a natural transformation G* — F* (since sheaves are contravariant func- 
tors), and hence a geometric transformation (F*, F) — (G*, G). The converse 
holds if K is subcanonical; for then l: D — Sh(D) is full and faithful, and it 
follows easily from the adjunction that we have Fi(l(U)) (FU) for any object 
U of C. So any natural transformation F) — G, induces a natural transformation 
F — G by restriction along l. 


Remark 2.3.6 There is also a converse to 2.3.4: if (C, J) and (D, K) are small 
(cartesian) sites, and F:C — D is a cartesian functor such that the geometric 
morphism (F*, lim f): [D°P, Set] — [C°P, Set] of A4.1.10 restricts to a geometric 
morphism Sh(D, K) — Sh(C,J), then F' is cover-preserving. For, by A4.3.12, 
such a restriction exists iff F* maps K-sheaves to J-sheaves, iff lim ys sends 
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j-dense monomorphisms to k-dense monomorphisms (where j and k are the local 
operators corresponding to J and K as in 2.1.10). Now if R is any J-covering 
sieve on an object. U of C, we may regard it as a j-dense subobject of the 
representable functor C (—,U); but lim; sends the latter to D (—, FU), and it is 
easy to see that lim (A) is (the subfunctor corresponding to) the sieve generated 
by {F(f) | f € R}. So this sieve must be K-covering. 


Remark 2.3.7 If we wish to work with (small) non-cartesian sites, then 
the argument of 2.3.4 can still be made to work provided the functor 
lim p: [C°P, Set] — [D°P, Set] is cartesian — for which it suffices that, for each 
object V of D, the comma category (V | F) is cofiltered (cf. A4.1.10 and 
B3.2.8(c)). A functor with this property is commonly called flat; in the case 
when C has finite limits, the condition is equivalent to saying that F preserves 
them, since we can recover F as the restriction of lim p to the subcategories of 
representable functors. Moreover, if this condition is satisfied, then the argument 
of 2.3.3 can also be made to work: in the notation of the proof of 2.3.3, we simply 
take (V”, h”) to be an object of (W | F) admitting morphisms in this category 
to (V,h) and (V’,h’), whose composites with f and f’ are equal. Accordingly, we 
define a morphism of sites F: (C, J) — (D, K) between not-necessarily-cartesian 
sites to be a functor which preserves covers and is flat; and the results of this 
section can all be made to work in this more general context. Nevertheless, for 
the time being we shall continue to restrict our attention to cartesian sites in 
what follows. 


Given two sites (C, J) and (D, K), we cannot in general expect every geomet- 
ric morphism f: Sh(D, K) — Sh(C, J) to be induced as in 2.3.4 by a morphism 
of sites (C, J) — (D, K): clearly, a necessary condition for this is that the inverse 
image functor f* should map objects in the image of l to objects in the image 
of |. However, this is essentially the only obstruction: 


Lemma 2.3.8 Let (C,J) and (D, K) be cartesian sites such that C is small 
and (D,K) is essentially small and subcanonical. Then a geometric mor- 
phism f:Sh(D, K) — Sh(C,J) is induced by a morphism of sites (C,J) —> 
(D,K) iff the composite f*l:C — Sh(D, K) factors through the embedding 
l: D — Sh(D, K). 


Proof [It suffices to show that the resulting factorization F: C — D is a mor- 
phism of sites: it will then follow immediately that f* is isomorphic to Fi, since 
the two functors agree on the image of l and both preserve colimits. Clearly, 
F is cartesian since f*l is. Also, f*l preserves covers (that is, maps them to 
epimorphic families in Sh(D, K)); and since K is subcanonical it follows from 
2.2.16 that the coverage K is that induced on D by the canonical coverage on 
Sh(D, K), i.e. that l: D — Sh(D, K) reflects covers as well as preserving them. 
Hence F preserves covers. 0o 


Corollary 2.3.9 For any small cartesian site (C, J) and any Grothendieck 
topos E, the category Sop (€, Sh(C, J)) of geometric morphisms E — Sh(C, J) is 
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equivalent to the category Gite ((C, J),(E,C)) of morphisms of sites and natural 
transformations between them, where C is the canonical coverage on E. 


Proof Since l: E + Sh(E€,C) is an equivalence by 2.2.7, every geometric 
morphism € — Sh(C,J) satisfies the hypothesis of 2.3.8. The fact that the 
assignment F +> (F*, Fi) yields an equivalence of categories then follows 
from 2.3.5. O 


If we wish to consider only small sites, then we have an alternative to 2.3.9: 


Corollary 2.3.10 For any geometric morphism f:F—E between 
Grothendieck toposes, there exist (small) standard sites of definition (C, J) and 
(D,K) for E and F respectively, such that f is induced by a morphism of sites 
(C, J) > (D,K). 


Proof First choose a standard site (C, J) for E, as in 2.2.16; then choose a 
similar site for F ensuring that ob D contains, in addition to a separating set 
for F, all objects of the form f*(l(U)), U € ob C. The result is then immediate 
from 2.3.8. O 


Remark 2.3.11 In general, the site (D, K) in 2.3.10 will depend on the mor- 
phism f; we cannot expect to find a single (small) site which will suffice for all 
geometric morphisms F — €E. However, there is an important special case in 
which this is possible: namely, when the topos € is localic (over Set). For in 
this case we can take Sub¢(1), with its canonical coverage, as our site for €, and 
since any inverse image functor preserves 1 and monomorphisms we can take D 
to be any small subcategory of F containing a separating set and (representa- 
tives of) all subobjects of 1 in F. In the particular case when F is also localic, 
we recover the result of 1.4.5 that Top (F,E) is equivalent to the poset of frame 
homomorphisms Sub¢(1) — Sub#(1), by 2.3.2(a). 


Given a functor F: C — D between two categories and a coverage on one of 
them, it is natural to ask whether there is a ‘best possible’ coverage on the other 
which makes the functor cover-preserving. In one direction this is easy; in the 
other rather less so. 


Lemma 2.3.12 Given a functor F:C — D and a coverage J on C, there is a 
unique smallest coverage Fa J on D for which F is cover-preserving. If C and D 
are small and cartesian and F preserves finite limits, then this coverage makes 


the square 
Sh(D, F.J) ——> Sh(C, J) 


[DP Set] = [c°P, Set] 


a pullback in Yop, where f is the morphism induced by F as in A4.1.10. 
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Proof Since an intersection of (Grothendieck) coverages is a coverage, we 
simply define F,J to be the intersection of all coverages containing the sieve 
generated by {F(f) | f € R} for every J-covering sieve R. For the second part, 
we note that Sh(D, F.J) is the largest subtopos of [D°?,Set] for which the 
composite Sh(D) — [DP , Set] — [C°P, Set] factors through Sh(C, J), by 2.3.6; 
but this was how we defined pullbacks of inclusions in Top, in A4.5.14(e). O 


Lemma 2.3.13 Suppose given a functor F:C — D and a coverage K on D. 
IfC has and F preserves pullbacks, then there is a unique largest coverage F° K 
on C for which F is cover-preserving. If C and D are small and cartesian and F 
preserves finite limits, then Sh(C, F*K) — [C°P, Set] is the image (in the sense 
of A4.2.10) of the composite Sh(D, K) — [D°?, Set] — [C°P, Set]. 


Proof We define F*K(U) to consist of all sieves R on U for which the set 
{Ff | f € R} generates a K-covering sieve on FU. Clearly, if this is indeed 
a coverage then it is the largest coverage for which F is cover-preserving. It is 
straightforward to verify that F° K satisfies (M) and (L) if K does; the difficulty 
lies in verifying (C) (or equivalently (C’)), since if g: U’ — U is a morphism of 
C then the sieve generated by {F f’ | f’ € g*R} is in general smaller than the 
pullback along Fg of the sieve generated by {Ff | f € R}. However, under 
the hypothesis that F preserves pullbacks, the two are equal, since both are 
generated by the left vertical morphisms in the pullback squares 


Fy' ——> FV 


Ff 


Fg 
FU' ———> FU 
as f ranges over the morphisms in R. The second assertion again follows from 
2.3.6, since Sh(C, F° K) is the smallest subtopos of [C°?, Set] through which the 
given composite factors. 0o 


Combining 2.3.9 and 2.3.12, we may obtain a description of weighted limits 
in BTop/Set (cf. Section B4.1) in terms of sites. 


Lemma 2.3.14 Suppose given a small weighted diagram (D,W) in 
BTop/Set; suppose further that we have chosen cartesian sites (C,, Ji) for each 
topos €, appearing as a vertex of this diagram, in such a way that each geometric 
morphism fa: E, — E, appearing as an edge of the diagram is induced by a mor- 
phism of sites Fa: (Ci, Ji)  (C).J,). Then the weighted limit of the diagram may 
be identified with Sh(C,., Jæ), where C~ is the weighted colimit of the diagram 
formed by the C; and the Fa in the 2-category Catt of small cartesian categories 
and cartesian functors, and J, is the smallest coverage on Cx for which each 
leg F;: C, + Cœ of the colimit cone is cover-preserving (equivalently, Jæ is the 
join of the coverages (F,)eJ, in the lattice of (Grothendieck) coverages on Ca). 
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Proof By 2.3.9, a weighted cone over D with vertex F corresponds to a 
weighted cone in Cart under the diagram formed by the C; and Fy, all of whose 
legs are cover-preserving functors. If we impose on Cæ the coverage J, described 
in the statement, then a cartesian functor G: Cæ — F is cover-preserving (for 
the canonical coverage C on F) iff Jæ C G*(C), iff each composite GF; is cover- 
preserving. So the result is immediate. 0o 


Lemma 2.3.14 could be formulated for sites whose underlying categories 
are not cartesian, provided we consider weighted colimits in an appropriate 
2-category Flat of small categories and flat functors between them. Of course, 
the usefulness of the result depends on our ability to give explicit descriptions 
of the required weighted colimits, and this tends to be simpler in Cart than in 
Flat. There are a number of particular cases where we can do so relatively easily. 


Examples 2.3.15 (a) Finite coproducts in Cart coincide with products, in 
much the same way that they coincide in an abelian category: given carte- 
sian categories Cı and C2, the projections P;: Cı x C2 — C; have (automatically 
cartesian!) right adjoints R; (defined by Ri(U) = (U,1) and R2(V) = (1,V)), 
which have the universal property of a coproduct cone. (Given cartesian functors 
Gı: Cı — D and G2: C2 — D, the induced cartesian functor Cı x Co ~ D sends 
an object (U,V) to GU x GaV.) So 2.3.14 yields a description of the product of 
two Grothendieck toposes Sh(C, J1) and Sh(C2, J2) in the form Sh(C; x C2, J3) 
for a suitable coverage J3. In particular, restricting to the case when J; and J2 
are trivial, we recover (part of) the result of B3.2.13 that [C;?, Set] x [C3 , Set] is 
equivalent to [(C; xC2)°?, Set]. (In B3.2.13 we proved this without the restriction 
that Cı and Cz should be cartesian; but we cannot do the same here, because 
the coincidence of finite products and coproducts fails to hold in §lat.) 

(b) There are other instances of limit—colimit coincidence in Catt, which can 
be exploited in the same way. For example, suppose given a cartesian functor 
FC, — C2; then the comma category GI(F) = (C2|F) (cf. A2.1.12) has the 
property that its two projections P;: GI(F) — C; have right adjoints R; (specif- 
ically, R,(U) = (U,FU,1lrv) and Rə(V) = (1,V,(V — 1))), and there is an 
obvious natural transformation 3: R,; — R2F (the mate of the universal 2-cell 
a: Pa — FP) giving Gl(F) the universal property of a cocomma object (C1 f? F) 
in Cart. Given cartesian functors G;: C; — D (i = 1,2) and a natural transfor- 
mation y: Gı > G2F, the induced cartesian functor GI(F) —> D sends (U,V, f) 
to the pullback of 


GU 


Yu 


Gof 
GoV ——> GFU 
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(Observant readers will recognize that we have made use of this property of 
GIl(F’) several times in earlier parts of this book, when applying the glueing 
construction to toposes.) Thus, if we wish to construct a site of definition for 
a comma object of the form (£, | f) in BTop/Set, for a geometric morphism 
f: E2 — E, we need only find cartesian sites of definition (C;, J;) for the E; such 
that f is induced by a morphism of sites F: (C1, J1) — (C2, J2), and then impose 
the appropriate coverage on GL(F). 

(c) Coinverters in Cart may be constructed using categories of fractions in 
the sense of Gabriel and Zisman [387]. To form the coinverter of a natural trans- 
formation a: F — G between cartesian functors B = C, let X be the set of those 
morphisms of C which can be expressed as composites of pullbacks of components 
ay: FU — GU of a. Then © admits a calculus of right fractions, so that the cat- 
egory of fractions C[X~*] is cartesian, and the canonical functor P: C — C[X7*] 
preserves finite limits; on the other hand, if H: C — D is any cartesian functor 
such that H oq is an isomorphism, then H must send all morphisms in © to iso- 
morphisms, and so it factors uniquely through P. Thus we obtain a description 
of inverters in BTop/Set in terms of sites (though we could alternatively have 
done this by retaining the same category C but imposing a larger coverage on it; 
cf. B4.1.7). 

(d) By putting together the last three constructions, we may obtain a 
description of general cocomma objects in Cart, as follows. Suppose given 
a span 


Se es ee 


in Cart. We first form the comma category GI(F,G) = (C2 x C3 | (F,G)), whose 
objects are quintuples (U,V,W, f,g) such that f: V — FU and g: W — GU. 
As in (b), this has the universal property of a cocomma object (C, ?(F,G)) in 
Cart, but since C2 x C3 is also a coproduct in Cart we may regard this as a 
‘lax pushout’, that is a normalized lax colimit for the given span. Explicitly, this 
means that we have functors R;: C; — GI(F,G) (i = 1, 2,3) together with 2-cells 
a: Ry — RF and 8: Rı — R3G, which are universal in the obvious sense. (The 
word ‘normalized’ signifies the fact that, in comparison with the lax and oplax 
limits considered in B1.1.6, we have not required the presence in the lax cone 
of 2-cells corresponding to the identity morphisms in the diagram — or, if you 
prefer, that we have required these 2-cells to be identities. ) 

Now, if we wish to construct (FTG) in Cart, we have simply to 
form the coinverter L: Gl(F,G) — £ of the 2-cell a, since then the 2-cell 
(Lo 8)(L 0 a)~! will have the appropriate universal property. As we observed 
in (c), we may do this by forming a category of fractions; in the present 
case, we have to form GI(F,G)[=~'], where © consists of all morphisms 
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(u,v, w): (U, V, W, f,g) — (U', V’, W’, f’, 9’) such that v is an isomorphism and 


w= Gi 


w Gu 


1 
w —— > Gu’ 
is a pullback. For every such morphism may be expressed as a pullback of the U- 
component ((U — 1), 1, (GU — 1)): (U, FU,GU, 1,1) > (1, FU,1, (FU — 1),1) 
of œ; and the class of such morphisms is stable under composition. 
Using the explicit description of cocomma objects in Cart provided by 
2.3.15(d), we may prove a result promised in Section B4.1. First we need 


Lemma 2.3.16 Suppose given a cocomma square 


eee 
G | P 


C3 ———> D 


in Cart, where F has a (not necessarily cartesian) left adjoint L. Then Q has 
a C3-indexed left adjoint M, and the Beck-Chevalley natural transformation 
MP — GL is an isomorphism. 


Proof By 2.3.15(d), we may identify D with a category of fractions 
Gl(F,G)[=~*]; but in the present case we may also identify it with a subcate- 
gory of GI(F,G), namely the full subcategory on those objects (U, V, W, f, g) for 
which the transpose f: LV — U of f is an isomorphism. For this subcategory is 
coreflective in GI(F, G); the coreflector sends an arbitrary object (U,V, W, f, 9) 
to (LV, V, W,nv,9), where 7) is the unit of (L 4 F) and 


A 


wlw 


{> 
Q 


GF 
GLV ——> GU 

is a pullback. The counit (F, 1, f): (LV, V,W ‚nv, ĝ) — (U,V, W, f.g) is easily 

seen to belong to ©, and the coreflector maps all morphisms in X to isomor- 

phisms, from which it follows easily that it has the universal property of the 

canonical functor GI(F, G) — GI(F,G){=7"]. 
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We may further simplify our description of D by considering its objects to 
be triples (V, W, g) where g: W — GLV in C3; in other words, it is simply the 
comma, category (C3) GL). In terms of this description, the functors P and Q 
are given by 


P(V) =(V,GLV,letv) and Q(W) = (1,GL(1) x W,m). 


It is now easy to see that both P and Q have left adjoints given by the two 
projections, and that the left adjoint M of Q satisfies MP(V) = GL(V) for any 
V € obC2. Moreover, the product of two objects (V, W, g) and (V’,W’, 9’) of D 
is of the form (V x V’,W”, 9”), where W” is the limit of the diagram 


GL(V x V’) 
aN ral Ne A 
GLV” 
from which it follows easily that the Frobenius reciprocity condition 
M((V, W, g) x Q(W')) = M(V, W, g) x W" 


holds. Using this (and the analogous result for pullbacks in D), we may easily 
verify that the left adjoints MW for the functors Q/W : C3/W — D/QW, defined 
as in B1.2.4, fit together to form a C3-indexed left adjoint for Q. O 


Theorem 2.3.17 Suppose given a comma square 


Gig se 
p > g 
f 


Pe 


in BTop/Set, where f is essential. Then q is G-essential (that is, q* has a 
G-indezed right adjoint), and the Beck-Chevalley natural transformation qmp* > 
g* fı is an isomorphism. 


Proof We need to choose small full subcategories A, B, C of E, F, G respec- 
tively, such that each is closed under finite limits and contains a generating set 
for the relevant topos, and such that g*, f* and fı all restrict to functors between 
the corresponding subcategories. To do this, we take A and B to be the unions of 
increasing sequences An and B, respectively, chosen such that f* maps An into 
Bn and fı maps Bn into An+ı for each n; then we choose C to be any small full 
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subcategory of G which contains the image of A under g*, as well as a generating 
set, and is closed under finite limits. 

Now let F, G and L denote the restrictions of f*, g* and f, to the relevant 
small subcategories, and form the cocomma object D = (F 7G) in Cart as in 
2.3.16 — that is, we take D to be the comma category (C | GL) in the classical 
sense. By 2.3.14, the comma object (f |g) in BYop/Set may be taken to be 
Sh(D, K), where K is the smallest coverage on D for which the functors P and 
Q are cover-preserving (relative to the coverages H and J on B and C induced 
by the canonical coverages on F and G). 

We claim that K may be described explicitly as follows: a family 


((vi, wi): (Vi, Wi, gi) —> (V,W, g) li € I) 


generates a K-covering sieve iff (i) (w; | ¿ € I) generates a J-covering sieve, 
and (ii) if J is inhabited, then (v; | i € I) generates an H-covering sieve. For 
the generating covers all satisfy this condition (note that GL is cover-preserving, 
since it is the restriction of a functor F — G which preserves epimorphic families); 
the covers satisfying the condition form a Grothendieck coverage; and every cover 
satisfying the condition may be obtained by pulling back and composing covers 
which are the images of H-covering or J-covering families. In particular, it follows 
that the projection M : D — C (the left adjoint of Q) is cover-preserving. 

Also, although K is not subcanonical (the object (1,0,0 — GL(1)) is covered 
by the empty sieve, though it is not strict initial unless F is degenerate), it is 
easy to see that any representable functor of the form D(—,QW) is a K-sheaf. 
Hence we may simplify the description of q*, at least on objects of G which 
belong to the subcategory C (equivalently, are representable as sheaves on C): 
by definition, g*(C (—,W)) is obtained by computing the left Kan extension of 
C (—,W) along Q and then applying the associated sheaf functor, but the Kan 
extension is simply D(—,QW) = C (M—, W), so it is a sheaf already. It follows 
that if, for an arbitrary sheaf D on (D, K), we define q:(D) to be the associated 
J-sheaf of the left Kan extension lim m (D), then morphisms qD — C correspond 
naturally to morphisms D — q*C for all representable sheaves C on (C, J). But 
since every object of G ~ Sh(C, J) can be J-covered by representables (or, if you 
prefer, because we could have included any given object of G in our site — indeed, 
we could really have ignored questions of size and taken A, B and C to be the 
whole of €, F and G respectively), it follows that q is the desired left adjoint 
to q*. Further, the Beck—Chevalley condition qmp* = g* fi, and the fact that the 
adjunction (q! 4 q*) is G-indexed, follow from the corresponding parts of 2.3.16 
and the fact that any given object of F (resp. any given morphism of G) could 
have been included in our generating subcategory B (resp. C). g 


The arguments in the proof of 2.3.17 are all constructive, and so can be 
applied to comma squares in @Yop/S for an arbitrary topos S with a natural 
number object - provided we strengthen the hypothesis ‘f is essential’ to ‘f is 
S-essential’ (cf. B3.1.1). Thus we may obtain Pitts’s theorem as stated in B4.1.8. 
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Given sites (C, J) and (D, K), we say a functor L: D — C is cover-reflecting 
if, given any object V of D and any J-covering sieve R on LV, there exists 
a K-covering sieve S on V such that L(f) € R for all f € S — equivalently, 
the sieve of all f: U — V such that L(f) € R is in K. (Authors who use the 
term ‘continuous’ for cover-preserving functors normally call functors with this 
property ‘cocontinuous’; the term ‘comorphism of sites’ is also in use.) 


Proposition 2.3.18 Let (C,J) and (D,K) be small sites, and L: DC a 
functor. The following conditions are equivalent: 


(i) L is cover-reflecting. 
(ii) There is a geometric morphism g: Sh(D, K) > Sh(C, J) whose inverse 
image is the composite 


Sh(C, J) —> [c°?, Set] “> [D° Set] ——> Sh(D, K) 


(where i is the inclusion functor and a is the associated sheaf functor). 
Gii) The right Kan extension functor lim,: [D°?, Set] — [C°P, Set] maps 
K -sheaves to J-sheaves. 
(iv) L*: [C°P, Set]  [D°P, Set] maps j-dense monomorphisms to k-dense 
monomorphisms, where j and k are the local operators corresponding to J 
and K as in 2.1.10. 


Proof The composite in (ii) is cartesian, since each of the three factors is; 
so it is an inverse image functor iff it has a right adjoint, which can only be 
the restriction of lim, to sheaves. Thus (ii) and (iii) are equivalent; and the 
equivalence of (iii) and (iv) was shown in A4.3.12. So it suffices to show that (i) 
and (iv) are equivalent. 

First assume (i); let A’ — A be a j-dense monomorphism in [C°P, Set]. We 
have to show that, for each x € L*A(V) = A(LV), the sieve of morphisms 
f:V’ — V such that L* A(f)(x) € L*A’(V’) is K-covering. But this is exactly 
{f | L(f) € R}, where R is the J-covering sieve {g: U — LV | A(g)(x) € A’(U)}. 

Conversely, assume (iv), and let R be a J-covering sieve on an object of the 
form LV. We may regard R as a j-dense subobject of C(—, LV), and form the 
pullback 

5S —————> L’*R 


n 
D(-, V) Sa L*(C (=, LV)) 
where 7 is the unit of (lim, 4 L*), in other words ny (f: V'’ + V) = L(f). 
Then S — D(-—,V) is k-dense; but the sieve corresponding to S is precisely 
{f | L(f) € R}. So L is cover-reflecting. Oo 
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There is a ‘cover-reflecting’ version of 2.3.10: for any geometric morphism 
f: F — E between Grothendieck toposes, we can find sites of definition for € and 
F such that f is induced by a cover-reflecting functor between them. However, 
the proof of this fact requires consideration of internal sites, and so we defer 
it to 2.5.5 below. On the other hand, we may immediately establish the cover- 
reflecting analogues of 2.3.12 and 2.3.13: 


Lemma 2.3.19 Let L: D —C be a functor between small categories. 


(i) For each coverage J on C, there is a unique smallest coverage L* J on D 
for which L is cover-reflecting. 


(ii) For each coverage K on D, there is a unique largest coverage Ly K on C 
for which L is cover-reflecting. 


Proof (i) One might be tempted to define L*+J(V) to consist of all sieves of 
the form {f: V’ + V | Lf € R} as R ranges over J(LV). It is easy to verify that 
this definition satisfies (C’) and (M) if J does; but in general it fails to satisfy 
(L), so we actually have to take Lt J to be the Grothendieck coverage generated 
by this collection of sieves, as in 2.1.9. Alternatively, we could define Lt J to be 
the unique (Grothendieck) coverage on D making the square 


Sh(D, L+ J) —> Sh(C, J) 


[DP Set] ——> [C°?, Set] 


a pullback (where the bottom edge is the geometric morphism induced by L as 
in A4.1.4), and then appeal to 2.3.18. 

(ii) Here we can give an explicit description of L} K (though, once again, 
we could also define it as the coverage corresponding to the image of the com- 
posite Sh(D, K) — [D°?, Set] — [C°P, Set]): a sieve R on an object U of C is 
L,K-covering iff, for all f: LV — U in C, the sieve {g: V' => V | Lg € f*R} 
is in K(V). Clearly, if J is any coverage making L cover-reflecting, then any 
J-covering sieve must satisfy this condition; so L} K will be the largest such 
coverage provided it is indeed a coverage. The verification of conditions (C’) and 
(M) is easy; let us verify (L). Suppose given an L+K-covering sieve R on U, 
and another sieve S on U such that f*S is Li K-covering for each f € R. For 
any h: LV — U, let Ra denote the sieve {g | Lg € h* R} on V, and similarly let 
Sn = {g | Lg € h* S}. Then Rp is K-covering, and for each g: V’ > V in Rp the 
composite ho Lg is in R, so (LG)*h*S is in L4} K(LV"). But this implies that 
{g': V" + V’ | Lg’ € (Lg)*h*S} is in K(V’); and this sieve is exactly g* (S). 
So by condition (L) for K we deduce Sp € K(V); since this is true for all h, we 
deduce S € Li K(U). oO 
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As an application of 2.3.19, we give a description of the localic reflection 
(cf. A4.6.12) of a Grothendieck topos in terms of sites: 


Example 2.3.20. Let (C, J) be an arbitrary small site, and let Q: C — P be the 
preorder reflection of C, i.e. the quotient of C by the congruence which identifies 
all parallel pairs of morphisms in C. Let K be the coverage Q,J on P. Since Q 
is full and bijective on objects, it is not hard to see that the explicit description 
of Q,J given in the proof above may be simplified in this case: a sieve R on U 
in P is K-covering iff the sieve {f | Q(f) € R} is J-covering. Now the composite 
Sh(C, J) — [C°P, Set] — [P°?, Set] satisfies the hypotheses of A4.6.10, since its 
first factor is an inclusion and the second is hyperconnected by A4.6.9; hence 
Sh(P, K) is the image of this composite in the hyperconnected-localic sense as 
well as the surjection—inclusion sense. Thus, by A4.6.2(e), Sh(P, K) is also the 
image in the hyperconnected—localic sense of the unique geometric morphism 
Sh(C, J) — Set. 


Remark 2.3.21 In connection with Example 2.3.20, it now seems appropriate 
to extend the notion of ‘ideal for a coverage’, which we defined for posites in 
1.1.16(e), to arbitrary small sites. If (C,T) is a small site (where T is, for the 
moment, not necessarily a Grothendieck coverage), we define a subset I of ob C 
to be a T-ideal if it is a sieve and, whenever we are given an object U of C and 
a cover (U; -> U | j € J) such that all the U; lie in J, then necessarily U € J. It 
is easy to see that the subterminal objects of Sh(C, T) correspond bijectively to 
T-ideals in C (and the direct proof in 1.1.16(e) that the T-ideals form a frame 
still works without the assumption that C is a poset); and, in the context of 
2.3.20, we have a bijection between J-ideals in C and Q, J-ideals in its preorder 
reflection. This provides an alternative proof that the site constructed in 2.3.20 
yields the localic reflection of Sh(C, J). 


There is an important special case in which the description of the localic 
reflection provided by 2.3.20 may be appreciably simplified. 


Lemma 2.3.22 LetC be a small regular category, and let J be a coverage on C 
which contains the regular coverage. Then the localic reflection of Sh(C, J) may 
be identified with Sh(Subc(1), J’) where J’ is the coverage induced on Subc(1) 
by J, as in 2.2.2, or equivalently the coverage I°J where I: Subc(1) > C is 
the inclusion functor. Moreover, under this identification, the hyperconnected 
geometric morphism from Sh(C, J) to its localic reflection is identified with that 
induced by I as in 2.3.4. 


Proof First we recall that the support oA of an object A of C is defined to 
be the image of the unique morphism A — 1; we may regard o as a functor 
C — Subc(1), left adjoint to the inclusion (cf. A1.3.1(ii)). We note that, since 
J contains the regular coverage, the composite Jo is cover-preserving; for if R 
is a J-covering sieve on an object A, and S denotes the sieve on oA generated 
by {of | f € R}, then the pullback of § along A —> c A contains R, and so S is 
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covering by axiom (L). Hence ø is cover-preserving when we equip Subc(1) with 
the induced coverage. 

Let (P, K) be the site constructed in 2.3.20 for the localic reflection. Since 
the codomain of ø is a preorder, it factors uniquely through Q: C — P, say by 
a: P — Subc(1). From the description of K given in 2.3.20, it is clear that 7 
is cover-preserving. Moreover, the regularity of C implies that o preserves finite 
products; it is easy to see that Q also preserves them, and hence @ is a cartesian 
functor (even though ø itself is not, in general!). So by 2.3.3 composition with 
g yields a functor o*: Sh(Subc(1), J’) — Sh(P, K). But since J contains the 
regular coverage, we know that for any object A of C the singleton {A ->a A} 
generates a J-covering sieve; hence its image under Q generates a K-covering 
sieve, and so any K-sheaf F on P necessarily satisfies F(Q.A) = F(QoA) for all 
A. It now follows easily that g* is full as well as faithful, and that it is essentially 
surjective on objects; so the geometric morphism induced by @ is an equivalence. 

Finally, the inverse image functor Sh(P, K) — Sh(C, J) was described in the 
proof of 2.3.18 as ‘compose with Q, and then apply the associated sheaf func- 
tor’; hence, if we regard its domain as Sh(Subc(1), J’), the description becomes 
‘compose with g, and then apply the associated sheaf functor’. So the direct 
image of the geometric morphism is given by composition with the right adjoint 
I of o, as in 2.3.4. 


If a cartesian functor F:(D,K)—(C,J) between small cartesian 
sites is both cover-preserving and _ cover-reflecting, then the functor 
F*: Sh(C, J) => Sh(D, K) is both the direct image and the inverse image of 
geometric morphisms, which form an adjoint pair in Top. This occurs, of course, 
in the situation considered in 2.2.3, where F* is an equivalence; but there are 
other examples of interest. 


Example 2.3.23 Let (C,J) be the site considered in A2.1.11(d): that is, C 
is a small full subcategory of Sp closed under passage to open subspaces, and 
a sieve R on an object X is J-covering iff it contains an open covering of X 
in the traditional sense, i.e. a jointly-surjective family of open inclusions with 
codomain X. Let € = Sh(C, J). Then, for any object X of C, we may identify the 
topos €/1(X) with Sh(C/X, Jx) as defined in 2.2.17. Now we may identify O(X) 
with a full subcategory of C/X in the obvious way; the inclusion O(X) — C/X 
preserves finite limits, and it both preserves and reflects covers when O(X) is 
equipped with its canonical coverage. Thus we obtain geometric morphisms 


Sh(X) > E/I(X) 


such that 7* and r, are both the functor obtained by restricting sheaves along 
the inclusion functor O(X) — C/X. In particular, r is right adjoint to i in Yop, 
since r* is left adjoint to 7*. We note also that i, is full and faithful (i.e. i is an 
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inclusion), since it is obtained by right Kan extension along a full and faithful 
functor; hence r is a local geometric morphism in the sense of 1.5.6. 

Similar examples may be constructed using the sites of A2.1.11(e) and (f): 
if X is a smooth manifold (respectively, an affine scheme of finite type over a 
field k) then O(X) embeds in Mf/X (resp. in the category of affine schemes 
over X) in a way which preserves and reflects covers; so we get an inclusion from 
Sh(X) to a slice of the topos of A2.1.11(e) (resp. (f)) which has a left adjoint 
in Yop. We shall have more to say about these examples, and others like them, 
in Section C3.6 below. 


Suggestions for further reading: Artin et al. [36], Mac Lane & 
Moerdijk [751]. 


C2.4 Internal sites and pullbacks 


As with the notion of frame (cf. Section C1.6), the notion of (small) site can be 
‘internalized’ in any topos S: formally, a site is simply a model of a certain higher- 
order theory (cf. Section D4.1) and we can consider models of this theory in S. 
Further, we may study the category of (S-valued) sheaves on any such internal 
site (C,T) (for which our notation will be Shs(C,T)); this category is a topos, 
and in fact a subtopos of the internal diagram category [C°?, S] (cf. B2.3.11), by 
the appropriate S-indexed analogue of the arguments in Section C2.2. And the 
Giraud theorem (2.2.8) ‘relativizes’ to the assertion that every bounded geomet- 
ric morphism with codomain S is equivalent to one of the form Shs(C,T) — S 
for some internal site (C, T) in S; we proved this, in slightly different language, 
in B3.3.4. 

We need to introduce some standard notation for internal sites. (We shall 
assume familiarity with the notation developed in Sections B2.3 and B2.5 for 
internal categories.) It is convenient to think of an internal coverage on an 
internal category C as being given by a span 


EEEE ee T 


where Co and C, are the object of objects and object of morphisms of C; T is the 
‘object. of all covers’, b assigns to each cover the object which it covers (the base 
of the cover), and c assigns the set of morphisms which is the covering family. 
The usual definition of a coverage would demand that the pair (b,c) should be 
jointly monic, but it clearly does no harm if we omit this requirement, that 
is if we allow covers to ‘occur with multiplicity’ as members of T. Since this 
relaxation has the effect of simplifying some of the arguments relating to change 
of base, we shall henceforth allow it. 
On the other hand, we do require the ‘book-keeping’ condition 


(Vt: T)(Va: C,)((a € c(t)) > (do(a) = b(t))) 
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which says that the codomains of all the members of a cover coincide with the 
base of the cover; we shall denote this condition by (B). In ‘diagrammatic’ terms, 
it says that if we form the pullback 


P 
Ws ee 


q n 


| ane PO 


(where (n,e):€c,>> PC; x C; is, as usual, a tabulation of the universal relation 
PC, + Cı), then the square 


commutes. 

We shall also require our coverages to satisfy the diagrammatic form of the 
coverage axiom (C), of course; and in this section we shall generally require 
them to be sifted in the sense defined after 2.1.3 — we denote the latter condition 
by (S). In our present context, a sifted coverage is simply one for which the 
morphism c factors through the subobject Sv(C) — PC, of sieves on C, where 
a subobject S — Cı is called a sieve if it is closed under right multiplication, 
i.e. the composite 


d 
S Xo, Ci >> Cı PO 2 = SC 


factors through S — Cı. (Recall the convention of Section B2.5 regarding the 
use of the symbol xc,.) 

One reason for working with sifted coverages is that it makes the formulation 
of axiom (C) rather simpler: to say that (T,b, c) satisfies (C) means that, if we 
form the subobject Q — T xc, Ci Xc T which consists of ‘triples (t',a, t) such 
that aa’ € t for all a’ € t”, then the composite 


UPX 
Q >—— T xq, C1 sa Ts Xo, T 


is an epimorphism. We may similarly formulate the axioms (M) and (L) of 
2.1.8 in diagrammatic form; we leave this to the reader. Note that we shall not 
automatically assume in this section that our coverages satisfy axiom (L); the 
reason for this will become apparent in 2.4.9 below. 
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When dealing with coverages on an internal poset C, we shall (as in 1.1.16(e) 
and elsewhere) tend to replace the span Co «+ T — PC; by one of the form 
Co — T — PC; that is, we shall ‘identify the morphisms in a cover with their 
domains’. We again leave to the reader the task of reformulating the axioms (B), 
(C), (S), (M) and (L) in terms of coverages presented in this way. 

We shall be concerned in this section mainly with ‘change of base’; that 
is, with the effect on internal sites in S of a geometric morphism f: 7 — S. 
Of course, given a site (C,T) in S, we know from B3.3.6 that the pullback of 
(Shs(C,T) — S) along f (exists and) is a bounded T-topos; indeed, from the 
proof of that result, we know that it can be represented as Shr(f*C, fFT) for 
some coverage f#T on f*C. The problem, therefore, is to identify f*T. 

Before tackling the general question, we shall deal with the problem of change 
of base for locales. Clearly, given a geometric morphism f: T — S and a locale Y 
in TJ, we may construct the hyperconnected-—localic factorization of the composite 


f 


Shy 37S Ss 


in order to obtain a locale fi(Y) in S. We can be more explicit: O( f:(Y)) is simply 
f.(O(Y)), since the latter is the direct image under the composite above of the 
subobject classifier of Sh7(Y). (Recall from B2.3.7 that f, preserves complete- 
ness of internal posets; it also preserves Heyting algebras, since the latter can be 
defined equationally (A1.5.11), and thus induces a functor Frm(T) —> Frm(S), 
whose ‘dual’ we denote henceforth by fı: Loc(T) — Loc(S).) 

Since pullbacks of localic morphisms (exist and) are localic, we also have a 
functor f*: Loc(S) — Loc(7) defined by saying that the diagram 


Sh7(f*(X)) —~ Shs(X) 


T— -s 


should be a pullback for each locale X in S. The universal property of pullbacks, 
and functoriality of the hyperconnected-localic factorization, tells us immedi- 
ately that f* is right adjoint to fı; equivalently, the corresponding functor f* 
on frames (defined by f#(O(X)) = O(f*(X))) is left adjoint to f,. However, 
f# is not simply f* applied to frames (indeed, the latter does not in general 
preserve completeness of internal posets), which is why we have adopted a dif- 
ferent notation for it. (On the other hand, if X is a discrete internal locale, that 
is O(X) is a full power object PX, then f#(O(X)) is simply P(f*X), since we 
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know that 
T/F X — S/X 


f 


T ————— sS 
is a pullback. Thus our use of the notation f* for locales is consistent with its 
use for objects, considered as discrete locales.) 

To obtain an explicit description of the functor f#, it is actually convenient 
to extend it to a larger category, namely that of internal complete semilattices 
in S (cf. Section C1.1). It is of course obvious that f. extends to a functor 
CjSLat(T) — CjSLat(S) (the fact that it preserves morphisms of complete 
semilattices follows from the fact that it preserves adjunctions), so the problem 
is to define a left adjoint to this functor. But in fact this is easy, because the 
categories of complete semilattices are monadic over the underlying toposes; so 
we can simply use A1.1.3(i) to lift the left adjoint of f,: T — S to one at the 
level of complete semilattices. Explicitly, suppose we have a free presentation of 
a complete semilattice A in S, that is a coequalizer diagram 


a 
PR——3 PG 
b 


A 


in CjSLat(S). (Recall from the proof of 1.1.3 that free complete semilattices 
are simply power objects; the argument given there was constructive, and so 
valid in any topos.) Then, since we necessarily have f#(PX) = P(f*X) for 
any object X, and f# must preserve coequalizers, there will be a coequalizer 
diagram P(f*R) 3 P(f*G) > f#A in CjSLat(7). And the two morphisms 
P(f*R) 3 P(f*G) are easily described: since a and b correspond to morphisms 
R = PG, or equivalently to a pair of relations R % G, we simply apply f* to 
these to obtain a pair of relations f*R % f*G, and then convert them back into 
complete semilattice morphisms P(f* R) 3 P(f*G). 


Lemma 2.4.1 


(i) The adjunction (f# 4 f,) is enriched over CjSLat(S), i.e., for complete 
semilattices A,B in S,T respectively, we have [A, fB] = f,[f* A, B], 
where the square brackets denote the ‘internal homs’ of 1.1.5. 

(ii) f# is a strong monoidal functor, that is, we have f#(Qs) S Qr and 
f#(A@ B) = f*#A® f*B, compatibly with the coherence isomorphisms. 

(iii) f* restricts to a functor Frm(S) — Frm(T), left adjoint to the 
corresponding restriction of fs. 


Proof (i) First we note that the adjunction (f # 4 f,) extends in an obvi- 
ous way to an S-indexed adjunction; hence it is enriched over S, i.e. [A, f,B] 
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and f,[f# A, B] are isomorphic as objects of S. To see that this isomorphism is 
an isomorphism of internal posets in S, we observe that the order-relation By 
on B is itself a complete join-semilattice, and a parallel pair u,v: A 3 f,B 
of complete join-homomorphisms satisfies u < v iff (u,v) factors through 
f.(Bi) — f,(B x B). Hence the partial order on [A, f,B] may be identified 
with [A, fa Bı]; and similarly f,[f#A, Bı] is the partial order on f,[f# A, B]. So 
[A, f.B] and f,[f# A, B] are isomorphic as internal posets, and hence as complete 
join-semilattices. 

(ii) The identity f# (Qs) = Qr is immediate from the fact that Qs is the 
free complete semilattice on 1 (and f* preserves 1). The preservation of binary 
tensor products follows immediately from (i), since for any complete semilattice 
C in T we have natural bijections 


f#(A@B) — C 
ASB —> fC) 
A_— BAO] 
FA — _[fFB,C] 
fFAQ fFB — C . 


Alternatively, we could deduce the result from the explicit description of f# 
above, by showing that f* preserves the presentation of A® B given in the proof 
of 1.1.5. 

(iii) It follows immediately from (ii) that f# lifts to a functor 


CMon(CjSLat(S)) — CMon(CjSLat(7)) . 


The functor f,: CjSLat(7) — CjSLat(S) is not in general strongly monoidal, 
but as the right adjoint of a strong monoidal functor it is automatically lax 
monoidal, i.e. we have canonical maps Qs — f,(Q7) and f,(A) @ f.(B) —> 
f.(A ® B) satisfying the appropriate compatibility conditions; so f, does lift 
to a functor between the categories of commutative monoids, and the adjunc- 
tion (f# 4 f,) also lifts to this level. But f, also maps Frm(T) into Frm(S), 
as we observed earlier; and from the description of the coreflection functor 
CMon(CjSLat(S)) — Frm(S) given in the proof of 1.1.10, it is easily seen 
that f, commutes up to isomorphism with these coreflections. So by the dual of 
A1.1.3(ii) the left adjoint of f, may be lifted to a functor Frm(S) — Frm(T), 
as required. o 


As in 2.2.4(b), by a basis for a locale X (or for the corresponding frame 
O(X)), we mean a subobject B — O(X) which generates it as a complete join- 
semilattice, i.e. such that the induced CjSLat homomorphism PB — O(X) is 
an epimorphism. We saw in 2.2.4(b) that if we regard B as a poset, with the 
ordering it inherits from O(X). and equip it with the coverage T such that T(U) 
consists of all subsets of | (U) whose join (in O(X)) is U, then the T-ideals (in 
the sense of 1.1.16(e)) in B are exactly the subsets {U : B | U < V} where 
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V €e O(X), and hence Sh(B,T) ~ Sh(X). But we can also express this by 


saying that 
PO 222-2 PB 


is a coequalizer in CjSLat, where C is the set of those subsets of B whose join 
in O(X) lies in B, and the two complete semilattice morphisms PC <3 PB are 
induced by the inclusion C > PB and the mapping S + {V S}. 

The arguments in the preceding paragraph are constructive, and so make 
sense in an arbitrary topos. Hence we may obtain an explicit description of 
f#(L), for a frame L in S, in terms of a basis: 


O(X) 


Lemma 2.4.2 Let f: 7 — S be a geometric morphism; let L be an internal 
frame in S, and let m: B — L be a basis for L. Form the pullback 


t= = Y 


By seen eae ey 


Then f*#L may be presented as the coequalizer in CjSLat(T) of the pair of 
complete-semilattice morphisms P(f*T) 3 P(f*B) which correspond to the 
morphisms 


*e Q 
pr -ËS pp) 28s PUB) 
and 
PE yerat ; 
f*T ——> f*B ——> P(f*B). 
(In particular, the coequalizer thus described is a frame.) CO 


Equivalently, f#L may be presented by the internal site (f*B, f#T) in T, 
where f#T is the coverage given by the span 


*b *e 
f*B TES FT See P(f* B). 
We note that, since ¢g is not a monomorphism in general (cf. 3.1.7 below), 
the pair (f*b,ġBf*c) will not in general be monic even if (b,c) is. This is the 
reason why we chose to relax our definition of coverage at the beginning of this 
section; had we not done so, we should have had to replace f*T by its image in 
J*B x P(f*B) at this point. 

Note in particular that we may always take B to be the whole of L; in 
this case C is simply PL, but we note that the covers which we obtain in our 
internal site (f* L, f# PL) for f#L are indexed by the object f*(PL) rather than 
P(f*L). Thus, unless the comparison map ¢z is an isomorphism, we will not 
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simply obtain the ‘canonical coverage’ on f*L. This is another reflection of the 
fact that f#L is not simply f*L in general. 

On the other hand, let us note a particular case of this construction where 
things do work rather simply. By a coherent frame, we mean one of the form 
IB, where B is a distributive lattice (cf. 1.1.3). For such a frame, we may take 
B itself (or rather the set of principal ideals) as a basis, as we noted in 2.2.4(b); 
and we may take the coverage on it to be indexed by the set K B of finite subsets 
of B. But K does commute with inverse image functors (cf. D5.4.12), so we 
deduce 


Corollary 2.4.3 If L= TIB is a coherent internal frame in S, and f:T 3S 
is any geometric morphism, then f#L = I(f* B). o 


We now revert to arbitrary sites (C,T) in S. The foregoing provides the 
motivation for considering the span 


f*b bc, f*e 
Fois => IARR O 


as our candidate for the coverage f*T that we require to form the pullback 


Sh7(f* C ,ftT) — > Shs(C, T) 


| 


Lemma 2.4.4 Let C be an internal category in a topos S, and suppose given 
a span (Co + T — PC). If T satisfies any of the conditions (B), (C), (S) or 
(M), then so does f*T for any geometric morphism f: T > S. 


Tee ere 


Proof This is mostly straightforward verification, using the diagrammatic 
forms of the conditions which we described above and the fact that 


f*(€c,) —— Eji 
f*(n) n 
oc, 
F (PCi) —> P(f*C:) 


is a pullback by the definition of @c,. For example, it is easily seen that if 
a subobject S — Ci is a sieve, then f*S is a sieve in f*C; hence there is a 
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commutative square 


f*(Sv(C)) —> Sv(f*C) , 


* Pc, * 
f*(PC\) —> P(f*C) 
from which it follows that f#T inherits siftedness from T. o 


In contrast to 2.4.4, the ‘local character’ condition (L) is not preserved under 
change of base; we shall see a counterexample in 2.4.9 below. 

In proving that f#T is indeed the ‘pullback coverage’ that we require, the 
key ingredient is the following: 


Lemma 2.4.5 Let f:T — S be a geometric morphism, (C,T) an internal 
site in S, and E a cocomplete T-indexed category. Then a T-indered functor 
f*C — E sends covers in f#T to epimorphic families in E iff the corresponding 
S-indered functor C — (f*)*(E) (cf. B2.3.14) sends covers in T to epimorphic 
families. 


Proof This is really just a matter of unravelling the definitions. Let 


p 
Ws 


q n 


be a pullback in S; then in the diagram 


f*p 


FW A f” (€c) — > Efc 


f*q fin n’ 


*e 7 
pr ZS pea) Ss pio) 
both squares are pullbacks (the right-hand one by the definition of ¢c, ), i.e. f*W 
is the pullback of € f-c, along the morphism c’ of the coverage f #T. Thus to say 
that F: f*C — E sends covers in f#T to epimorphic families is to say that, if we 
form the object A = F(f*b: f*T — f*Co) of the fibre EFT and the morphism 
P(e'p!: f*W = f*Cy): B > (f*a)*(A) in EFW (where p' is the top composite 
in the second diagram above), then the transpose © f-g(B) — A of the latter is 


586 SHEAVES ON A SITE C2.4 


an epimorphism in €f°7. But this condition involves only the fibres of E over 
objects of the form f*J, so it is easily seen to be the same as the condition 
that F sends covers in T to epimorphic families when considered as a functor 


C — (f*)*E. | o 


Theorem 2.4.6 Let(C,T) be an internal site in a topos S (where the coverage 
T is assumed to be sifted, but not necessarily a Grothendieck coverage), and 
f:T — S a geometric morphism. Then there is a pullback square 


Sh7(f*C, f#T) —> Shs (C,T) 


T ——— & 
in Top. 


Proof Let g: E — T be a T-topos. We shall show that geometric morphisms 
E — Shr(f*C, f#T) over T correspond to geometric morphisms € — Shs (C, T) 
over S; this is clearly sufficient. But the latter correspond to S-indexed functors 
C — (f*)*(g*)*(E) (where E denotes the canonical indexing of E over itself) 
which are C°P-torsors in the sense of B3.2.3 and send covers in T to epimorphic 
families, by the S-indexed version of 2.3.9. The passage between such functors 
and J-indexed functors f*(C) — (g*)*(E) preserves and reflects the property 
of being a torsor, and we have seen in 2.4.5 that it preserves and reflects the 
condition on covers. So the result is established. o 


Remark 2.4.7 By analogy with 2.4.5, one might hope to show that the cor- 
respondence of B2.3.14 between T-indexed functors f*C°? — T and S-indexed 
functors C°P — (f*)*(T) identifies the f#T-sheaves on f*C with those functors 
CoP — (f*)*(T) which satisfy the sheaf axiom for covers in T. Unfortunately, 
this is not true in general. The reason is that, for a contravariant F: f*C°? — T, 
we obtain (in the notation of 2.4.5) a morphism (f*q)*(A) — B in T/f*W, 
which transposes to a morphism A — I]y-,(B) in T/f*T, which must be the 
equalizer of two other morphisms (also defined using H-functors) if F is to be 
an f#T-sheaf. But f* does not commute with IJ-functors in general, so this is 
not equivalent to the condition for F (in its other incarnation as an S-indexed 
functor) to be a T-sheaf. However, there are two particular cases in which we 
can make this identification: one is when the change-of-base morphism f is 
locally connected, as defined in 3.3.1 below, and therefore does commute with 
II-functors. The other is when C is cartesian (or at least has pullbacks) and 
every cover in T is finite (that is, indexed by a finite cardinal); for in this case 
we only need to consider I]-functors along morphisms q: W — T which are finite 
cardinals in S/T, and since inverse image functors preserve exponentiation to 


C2.4 INTERNAL SITES AND PULLBACKS 587 


the power of a finite cardinal (D5.2.11), they also commute with such I-functors 
(cf. D5.2.13). 

Thus, in either of these cases, we may interpret 2.4.6 as saying that the 
pullback of Shs(C,7) along f is nothing other than the category of ‘T-sheaves 
on C with values in T’. Note in particular that the second case includes the case 
where C is a coherent internal category and T is its coherent coverage; thus we 
may view this result as the generalization of 2.4.3 above to the non-localic case. 


Next, we establish a criterion in terms of internal sites for surjectivity of 
bounded geometric morphisms, which we shall use frequently in the next chapter. 


Lemma 2.4.8 Let (C, J) be an internal site in S, such that C has a termi- 
nal object t: 1 — Co and J is a Grothendieck coverage. Then the following are 
equivalent: 


(i) The canonical geometric morphism g: Shs(C, J) — S is a surjection. 
(ii) ‘Every cover of t is inhabited’; that is, if we form the pullback 


CS 


1 ————> Co 


then the composite Q — J — PC, factors through the object P*Ci of 
inhabited subobjects of Cı, i.e. the image of n:€c,— PC). 


Proof Since hyperconnected morphisms are surjective, the morphism of (i) 
is surjective iff its localic part is surjective; but by 1.5.1(ii) this happens iff the 
unique frame homomorphism à: Qs — g4(Qsn(c,7)) is monic. We recall from 
2.3.21 that we may identify the codomain of this morphism with the poset of 
J-ideals of C; its right adjoint À, sends a J-ideal J — Co to the truth-value 
[J = Co], or equivalently to ft € I], and so A(p) must be the largest J-ideal J 
for which (t € I) =p is valid. Consider the set 


I(p) = {u: Co | (Bj: Jb) = u) A (a(i) =p))} 


where o(j) denotes the assertion that c(j) is inhabited. Using the local character 
axiom, it is easily verified that I(p) is a J-ideal, and using axiom (M) we see 
that if p holds then I(p) is the whole of Co, i.e. p= A./(p). On the other hand, 
for any J-ideal K we have 


(Vu: Co)((u € I(.(K))) > (u € K)) 


since the hypothesis implies that u has a cover all of whose members lie in K. 
So the mapping p++ I(p) is the required left adjoint to A.. 
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Thus Sh(C, J) — S is surjective iff we have 


(Vp: Q)((t € I(p)) =p) - 
But this is exactly the statement that every cover of t is inhabited. o 


A site satisfying the hypotheses of 2.4.8 is called consistent. In particular, a 
frame O(X) (or the corresponding locale X) is called consistent if every covering 
of its top element is inhabited; since the canonical coverage on a frame clearly 
has local character, this is equivalent to saying that the geometric morphism 
Shs(X) — S is surjective. (Note that, in a non-Boolean topos S, it is strictly 
stronger than saying that O(X) is nontrivial, i.e. that its top and bottom ele- 
ments are disjoint; the latter is equivalent to saying that Shs(X) — S has dense 
image.) Note also that the local character condition (L) really is needed in the 
proof of 2.4.8: if we are given a general coverage T and we merely know that every 
cover of some particular object t is inhabited, this tells us nothing about ‘covers 
of covers of t’, and so the Grothendieck coverage generated by T' (defined as in 
2.1.9) may fail to have the corresponding property. (But compare 3.1.18 below.) 


Example 2.4.9 We may now, as promised, give an example to show that 
condition (L) is not stable under change of base. For the condition ‘every cover 
of t is inhabited’ is clearly stable under change of base, as are all the other 
hypotheses of 2.4.8; hence if (L) were also stable, the class of bounded surjections 
would be stable under pullback in Top. But we know that this is not so; for we 
saw in 1.2.12 that epimorphisms are not stable under pullback in Loc, and 
by 1.5.1(ii) epimorphisms of locales correspond to surjections between localic 
toposes (and pullbacks of locales correspond to pullbacks of toposes, by 1.4.8). 


In view of 2.4.9, we often need to replace the coverage f#T obtained after a 
change of base by its ‘Grothendieck completion’ as in 2.1.9. In general, it is hard 
to give an explicit description of the latter; but in certain cases we can deduce 
properties of it from the fact that it has the same sheaves as the coverage from 
which we start. A particular example, which will be of use in Section C3.2, is 
the following: 


Lemma 2.4.10 Let (C,T) be an internal site in a topos S, and U an object 
of C (that is, a morphism 1 — Co in S). If U ıs T-irreducible (in the sense that 
‘the only T-covering sieve on U is the maximal sieve My’ is true in the internal 
logic of S), then it remains irreducible in the Grothendieck coverage T generated 
by T. 


Proof If U is irreducible, then the (S-indexed) functor C (U, —): C — S maps 
covers in T to epimorphic families in S; hence, for any object A of S, the functor 
AC (U=); CP — S is a T-sheaf. So by 2.1.9 it is also a T-sheaf; in particular, if 
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(U; ~U|ieljisa T-covering family, then the induced map 


AC WU) —> [p40 œ Al. CU,U,) 
icl 


is monic. Putting A = Q (so that A‘): S°P — S is faithful by A2.2.3, and 
hence reflects monomorphisms), we deduce that Į fiez C(U,U:) — C(U,U) is 
epic in S. Pulling the latter back along the morphism 1 — C (U, U) which names 
the identity morphism on U, we deduce that (it is true in the internal logic of S 
that) some U; — U must be split epic; equivalently, if the U; — U form a sieve, 
then that sieve must be My. 0 


Our next aim in this section is to establish a Beck-Chevalley condition for 
internal frames in toposes. As we shall see, this is closely related to the pullback- 
stability of the hyperconnected-localic factorization of Section A4.6. 


Theorem 2.4.11 Suppose given a pullback square 


E E, 


H 
J 
Fi Se 


of toposes and geometric morphisms, and suppose either that f is bounded or 
that g is bounded and E has a natural number object. Then 


(i) The square 


k# 
Frm(G) —> Frm(H) 


UP h, 


f# 
Frm(€) —> Frm(F) 


commutes; more precisely, the canonical natural transformation f#g, — 
h,k is an isomorphism. 


(ii) If g is hyperconnected, so is h. 


Proof We note first that the two parts of the theorem are equivalent. If (ii) 
holds then the hyperconnected-localic factorization is stable under pullback, 
since we know that pullbacks of localic morphisms are localic. So, using the 
descriptions of f# and g, in terms of their effects on localic toposes, with which 
we began, we see that both ways round the square produce the same effect 
when applied to a localic topos over G. On the other hand if (i) holds and g is 
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hyperconnected, then on chasing the subobject classifier of G around the Beck- 
Chevalley square we see that h, must also preserve Q, i.e. h is hyperconnected. 
In the case when we assume f to be bounded, we have already proved (ii), 
in B3.3.7. So we need only consider the case when g is bounded, but f need not 
be; in this case we shall prove (i). 
If g is bounded, then we may factor it as a composite 


Gs 10 6) — Ss 


where C is an internal category in € (which may be assumed to be cartesian) 
and 7 is an inclusion; and it clearly suffices to prove the result for the two factors 
separately. In fact we shall prove it when g is of the form [C°?, €] — E and when 
g is localic; this also suffices, since every inclusion is localic (A4.6.2(a)). But in 
the case when g (and hence also h) is localic, there is nothing to prove; for in 
this case, the effect of g, on localic toposes over G is simply that of composition 
with g, and similarly for h,. So the assertion reduces, as in A2.2.4, to the fact 
that a composite of two pullback squares is a pullback. 
We are thus reduced to showing that the square 


(cer, fy l 
Frm([C°?, €]) oe Frm([f CEF] 


f* 
Frm(€) ————————> Frm(F) 
commutes, where C is an internal cartesian category in € and the vertical arrows 
are obtained by applying the direct image functors (‘evaluate at 1’, where 1 is 
the terminal object of the appropriate category) to internal frames. In fact we 
show that the diagram commutes when we replace Frm by CjSLat through- 
out; since we know that the functors on Frm are the restrictions of those 
on CjSLat, this will suffice. Now in 1.6.9 we gave a description of internal 
complete (semi)lattices in [C°?, Set] where C is a small cartesian category; the 
description was constructive, and so remains valid when Set is replaced by an 
arbitrary topos. That is, an object of CjSLat([C°?, €]) is just an €-indexed func- 
tor from C°P to the €-indexed category CLat(€) of internal complete lattices 
in E, satisfying the Beck—Chevalley condition for pullback squares in C. Given 
such a functor, we may compose it with f*: CLat(€) — CLat(F) - note that, 
although f# was defined as a functor on complete semilattices, it in fact pre- 
serves morphisms of complete lattices since these are exactly the morphisms of 
CjSLat which have left adjoints in CjSLat - to obtain an €-indexed functor 
CP — (f*)*(CLat(F)), which by B2.3.14 is essentially the same thing as an 
F-indexed functor f*C°P — CLat(F). It is straightforward to verify that the 
Beck—Chevalley condition of 1.6.9 is preserved by this construction, and so we 
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have defined a functor CjSLat((C°?,€]) — CjSLat({f*C°?, F]). Moreover, it 
is easy to verify that this is left adjoint to the functor induced by f,, so it is 
the functor [C°P, f]#* by uniqueness of adjoints. And since, as already indicated, 
the vertical arrows in the square above are given by evaluation at the terminal 
objects of C and of f*C, the commutativity of the square is now immediate. O 


The assumption, in the case when g is bounded, that € has a natural number 
object is needed in order to justify restricting our attention to cartesian internal 
categories in the course of the proof: we need to be able to replace an arbitrary 
site of definition for G over E£ by one whose underlying category is cartesian, as in 
the proof of 2.2.8 (and cf. also B3.2.6). The same assumption is of course needed 
for the second case of the following corollary: 


Corollary 2.4.12 The hyperconnected-localic factorization of an arbitrary 
(resp. bounded) geometric morphism is stable under bounded (resp. arbitrary) 
pullback. O 


In fact the Beck—Chevalley square of 2.4.11(i) still commutes if we replace 
frames by complete join-semilattices throughout. We omit the details of the 
proof, which may be found in [560]. 

Another consequence of 2.4.11(ii) is worth stating explicitly: 


Corollary 2.4.13 For any topos S, the localic reflection functor BYop/S — 
LTop/S (cf. A4.6.12) preserves finite products. 


Proof It clearly preserves the terminal object. Let pj: E; > S (i = 1,2) be 
bounded morphisms, and let L; — S denote their localic reflections. Then we 
may form the diagram 


E xs E: —> Lı xs Eg ——> Ez 


E xs Lı —> Lı xs La ——> Lz 


E& ————> Lı ———_ > S 


in which all the squares are pullbacks; so 2.4.11(ii) implies that all four edges, 
and hence the diagonal, of the upper left square are hyperconnected. Similarly, 
the diagonal of the lower right square is localic; so Lı Xs £2 is the reflection of 
E xs Ein LTop/S. Q 
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We may now prove a long-promised result, namely the converse of B3.3.8(ii). 


Proposition 2.4.14 Letp: E — S be a bounded geometric morphism such that 
the diagonal Ap: E.— E xg E is an inclusion. Then p is localic. 


Proof Let A be an object of E. Since E/A xg E/A ~ E/Ax A (cf. B3.2.14), 
we have a pullback square 


JAKA cE 
Ap 


EJA xs EJA — E xs E 


and hence the left vertical edge of this square is an inclusion. So by A4.6.12 
and 2.4.13 its reflection in £Top/S is an inclusion into a product; equivalently, 
by 1.5.1(i) and 1.4.8, we have an inclusion g: Y — X x X in Loc(S), where X 
and Y are the internal locales in S defined by O(X) = p.(Q4) and O(Y) = 
p.(QAXA), 

We now assume for notational simplicity that S is the classical topos of sets. 
Identifying O{ X xX) with the CjSLat tensor product O(X)®O(X), asin 1.1.11, 
we see that g* sends a typical element W = V;ez(U: 8 Vi) of O(X) ® O(X) to 
Uier(Ui x Vi). In particular, taking W to be the element g,(A), where A is 
the diagonal subobject A — A x A regarded as an element of O(Y), we have 
an expression for A = g*g,(A) as a union of ‘rectangles’ U; x V;. Applying the 
inverse image of the continuous map X — Y corresponding to A, we obtain 
X =Uie (Ui N Vi); but the fact that U; x Vi < A in O(Y) clearly implies that 
U,V; is a subterminal object. So we have expressed X as an J-indexed union of 
open sublocales which are subterminal objects; equivalently, we have constructed 
a diagram of the form 


B A 
p* (I) 
in E. But we can do this for any object A of £; so p is localic. o 


For an alternative proof of 2.4.14 using descent theory, see 5.3.10 below. Next, 
we note one further consequence of 2.4.11: 


Corollary 2.4.15 A geometric morphism f: F — € is hyperconnected iff the 
functor f#: Frm(€) — Frm(F) is full and faithful. 
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Proof First suppose f is hyperconnected. Then for any localic €-topos 
g: €' — E, the top and right edges of the pullback square 


F ———> €&' 


F ———>E 


form the hyperconnected—localic factorization of its diagonal; so the counit of 
the adjunction (fi 4 f*) between categories of internal locales, or equivalently 
the unit of the adjunction (f# 4 f,) between categories of internal frames, is an 
isomorphism. Hence f* is full and faithful. The converse follows from considering 
the component of the unit at the initial object Qe of Frm(€). oO 


In A4.1.16, we introduced the Beck-Chevalley condition for objects of toposes 
(in what was then merely assumed to be a commutative square of geometric 
morphisms — though in practice the squares for which we are able to verify the 
condition are usually pullbacks). Of course, objects may be regarded as discrete 
internal locales; but, although functors of the form f* preserve discrete locales, 
as we noted earlier, those of the form f, usually do not. We cannot therefore 
hope to deduce a Beck-Chevalley condition for objects from the result of 2.4.11; 
and indeed it does not hold in general. 

We need to introduce some terminology to describe those morphisms for 
which it does hold: 


Definition 2.4.16 Let f: F — E be a geometric morphism. 


(i) We say f is a right (weak) Beck-Chevalley morphism if, for every bounded 
morphism g: G — E with the same codomain, the pullback square 


Sot Rs 


H G 
pattie 


satisfies the (weak) Beck—Chevalley condition as defined in A4.1.16, i.e. the 
canonical natural transformation 0: g* f, — k.h* is an isomorphism (resp. 
monic). 

(ii) We say f is a left (weak) Beck-Chevalley morphism if, for every bounded 
g as above, the pullback square satisfies the ‘opposite-handed’ Beck- 
Chevalley condition that f*g. — h,k* is an isomorphism (resp. monic). 
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(iii) We say f is a stable left/right (weak) Beck-Chevalley morphism if the 
pullback of f along any bounded morphism is a left/right (weak) Beck- 
Chevalley morphism. 


The assignment of the adjectives ‘right’ and ‘left’ to the first two parts of the 
above definition is more or less arbitrary. The convention that we have adopted 
is justified by the fact that, if f is a right Beck—Chevalley morphism, then it is 
the right adjoint f, which appears as one of the functors involved in the Beck- 
Chevalley condition. We shall often abbreviate the properties defined in 2.4.16 
by their initials; thus ‘SLBC morphism’ means ‘stable left Beck—Chevalley mor- 
phism’, ‘RWBC morphism’ means ‘right weak Beck-Chevalley morphism’ and 
so on. Fortunately, we shall see in the next chapter that the stable properties, 
at least, turn out to be equivalent to other properties that have less cumber- 
some names. (It would be possible to introduce yet another variable into 2.4.16, 
besides the choices of left or right, weak or strong Beck—Chevalley condition, and 
stable or ‘unstable’: we could also allow the variant where the morphism f is 
assumed to be bounded, and we consider its pullbacks along arbitrary geometric 
morphisms rather than bounded ones. However, it seems not to be worth making 
this distinction in practice.) 

We note that A4.6.8 has an immediate reinterpretation in our new 
terminology: 


Lemma 2.4.17 Hyperconnected morphisms are both SLWBC and SRWBC. 


Proof By 2.4.11(ii), any pullback of a pullback of a hyperconnected morphism 
yields a square in which two opposite sides are hyperconnected. So by A4.6.8 the 
square satisfies both weak Beck—Chevalley conditions. oO 


We shall have more to say about Beck-Chevalley morphisms at several points 
in Chapter C3. 


Suggestions for further reading: Joyal & Tierney [560], Kock & 
Schmidt [648]. 


C2.5 Fibrations of sites 


We saw in 2.3.10 that any geometric morphism between Grothendieck toposes 
may be induced by a morphism of sites. However, this result is not best possible: 
there is a very special type of morphism of sites, called a fibration of sites, which 
suffices to obtain all such geometric morphisms, and such fibrations may be 
used to give characterizations of particular classes of geometric morphisms. The 
construction of a fibration of sites from a given geometric morphism involves con- 
sideration of internal sites, which is why we postponed it until after Section C2.4; 
however, in this section we shall not be concerned with change of base (except 
in the proof of 2.5.14 below), and so we shall assume for notational convenience 
that our base topos is Set. (Nevertheless, all our arguments are constructive, 
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and so they apply to morphisms between bounded S-toposes, for any S.) We 
shall also reinstate our standing assumption from Sections C2.2 and C2.3 that 
all coverages are Grothendieck coverages unless stated otherwise; however, we 
shall not assume as in Section C2.3 that the underlying categories of our sites 
are cartesian. (For the reason why not, see 2.5.9 below.) 

In fact a fibration of sites, like a geometric morphism of toposes, is not asingle 
functor but an adjoint pair of functors. The key to understanding the notion is 
the following result, which could equally well have been proved in Section C2.3: 


Lemma 2.5.1 Let (C,J) and (D,K) be small sites, and let F:C —D and 
L:D—-C be functors with (L 4 F). Then F is a morphism of sites iff L is 
cover-reflecting. 


Proof As F has aleft adjoint, it is automatically flat in the sense of 2.3.7, since 
the comma category (V | F) has an initial object for each V; so we have to prove 
that F preserves covers iff L reflects them. First suppose F preserves covers; let 
V € ob D and let R € J(LV). Then the morphisms F(f), f € R, generate a 
K-covering sieve on FLV; pulling this back along the unit ay: V — FLV of 
the adjunction, we obtain a K-covering sieve S such that, for every g € S, the 
composite ayg factors through some F(f), f € R. But this is equivalent to 
saying that L(g) factors through f, and so implies that L(g) € R. Thus we have 
verified that L is cover-reflecting. 

Conversely, suppose L reflects covers. Let U € ob C, and let R € J(U). 
Pulling back along the counit By: LFU — U, we obtain a J-covering sieve 7, (R) 
on LFU; applying the definition, we deduce that the sieve 


S = {g:V > FU | BuL(g) € R} 


isin K(FU). But Gy L(g) is simply the transpose g of g across the adjunction 
(L 4 F); and g itself factors as F'(g)ay. Thus S is contained in the sieve gener- 
ated by the images under F of members of R, and in particular the latter must 
be K-covering. oO 


If the conditions of 2.5.1 are satisfied, then it is clear that the functor 
L*: [C°P, Set] — [DP , Set] is left adjoint to F*, and hence that the left adjoint F; 
of F*: Sh(D, K) > Sh(C, J) may be obtained as the composite aL*7 of 2.3.18(ii). 
In other words, we have 


Scholium 2.5.2 If the conditions of 2.5.1 are satisfied, then the geometric 
morphism Sh(D, K) — Sh(C, J) induced by F as in 2.3.4 coincides with that 
induced by L as in 2.3.18. oO 


Now suppose we have a geometric morphism f: F — E between Grothendieck 
toposes, and suppose we have chosen a site (C, J) for E (where C is small, but 
not necessarily cartesian). By B3.1.10(ii), f is bounded, and so we can represent 
F as She(D, K) for some internal site (D, K) in E. In Section C1.6 we saw how 
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to ‘externalize’ the site corresponding to an internal locale in a Grothendieck 
topos; here our immediate aim is to do something similar for an arbitrary site. 

To begin with, let us consider the case where the coverages J and K are 
trivial; i.e. E is [C°P, Set] and F is [ID°P, (C°P , Set]]. In this case we may regard D 
as a functor C°P — Cat, and use the Grothendieck construction (cf. B1.3.5) to 
convert it into a split fibration C » D — C. (The use of the ‘semidirect product’ 
sign x is inspired by the case of groups: if C is a group G in Set, and D is a 
group H in [G, Set] (that is, a group on which G acts by automorphisms), then 
this is exactly the semidirect product G x H as usually defined.) Thus objects 
of C x D are pairs (U,V) with U € ob C and V € ob D(U), and morphisms 
(U',V’) — (U,V) are pairs (a,b) where a: U’ > U in C and b: V’ > D(a)(V) 
in D(U’). 


Lemma 2.5.3 With the above notation, [D°P, [C°P, Set]] ~ [(C x D)°?, Set]. 


Proof If F is a diagram of shape D? in [C°?, Set], represented by a discrete 
fibration y: F — D, we define 


F(U,V) = {z € Fo(U) | yo(x) = V}, 


with F(a, b)(x) = F(b)(Fo(a)(x)), where F(b) denotes the appropriate cartesian 
morphism of F(U’) lying over b. Conversely, given G: (C x D)°P — Set, we define 


Go: CP — Set by 
I] ewy), 


VEDo(U) 


with Go(a)(x) = G(a,1)(xz) € G(U’, Do(b)(V)). It is easy to see that Go comes 
equipped with a morphism go: Go —> Do sending each element of G(U, V) to V, 
and that the latter has a right D-action, so that we obtain a discrete fibration 
G — D in [C°P, Set], or equivalently a diagram G of shape D in [C°P, Set]. 
Further, the constructions F + F and G + G are inverse to each other up to 
isomorphism. 0 


Note that the particular case of 2.5.3 in which D is a discrete internal category 
in [C°P, Set] (i.e. takes discrete categories as values) is just A1.1.7. 

Next, we ‘put the coverages back’. We claim that, given coverages J and K 
on C and D respectively, there exists a coverage J x K on C x D such that, under 
the above correspondence, Fisa (J x K)-sheaf iff F takes values in the category 
of J-sheaves and is itself a K-sheaf. Explicitly, we say that a sieve R on an object 
(U,V) is (J x K)-covering iff there exists an element S of K(U) with b(S) = V 
(in the notation developed in Section C2.4) such that if we regard the element 
c(S) of PD,(U) as a subsheaf of C (—, U) x Dj, i.e. as a set of pairs (f,g) with 
f:U’'— U inC and g: V’ > D(f)(V) in D(U’), then for each such pair there is 
a J-covering sieve of morphisms’ f’: U” — U’ for which the pairs (f f’, D(f’)(g)) 
all belong to R. 
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It is tedious but straightforward to verify that this is indeed a Grothendieck 
coverage on C x D, and that F is a (J x K)-sheaf iff F is a K-sheaf with values 
in Sh(C, J). Thus we have proved most of 


Proposition 2.5.4 Let E = Sh(C, J) be a Grothendieck topos, and (D, K) an 
internal site in E. Then She(D, K) ~ Sh(C xD, J x K). Moreover, the projection 
functor P:C xD —>C is cover-reflecting, and induces the geometric morphism 
She (D, K) > E as in 2.3.18. 


Proof [It remains to establish the second assertion of the proposition. But if 
R is any J-covering sieve on U, and V any object of D(U), then the sieve of all 
morphisms (f, g): (U’, V^) — (U,V) for which f: U’ = U € Ris (JxK)-covering 
(take S to be the top element of K(U) in the definition); so P is cover-reflecting, 
and by 2.3.18 we know that the right Kan extension functor lim p maps sheaves 
to sheaves. Now, to compute lim p(G)(U), we have to form the limit of the 
diagram of shape (P|U)°? sending ((U’,V),a: U’ > U) to G(U,V); but this is 
precisely the limit in S which we need to compute to obtain f,(G)(V), where 
f« is the direct image of the given morphism, i.e. the restriction of the functor 
limp to sheaves. So we have identified f with the geometric morphism induced 
by P. O 


Corollary 2.5.5 Any geometric morphism between Grothendieck toposes may 
be induced by a cover-reflecting functor between sites. O 


So far, we have not made any assumptions about the categories C and D. 
But we may clearly assume that D has a terminal object, i.e. that each D(U) 
has a terminal object ty and that the transition functors D(U) — D(U’) induced 
by morphisms U’ — U of C preserve terminal objects. Then we have a functor 
T:C +C x D sending U to (U, ty); and it is clear that T is a full and faithful 
right adjoint to the projection P. This prompts the following definition: 


Definition 2.5.6 Let (C, J) and (D, K) be small sites. By a fibration of sites 
(P,T): (D, K) — (C, J), we mean a pair of functors P: D >C and T: C => D 
such that P is left adjoint to T, T is cover-preserving (equivalently, by 2.5.1, P 
is cover-reflecting), T is full and faithful (equivalently, the counit PT — 1 is an 
isomorphism), and P is a fibration in the sense of B1.3.3. 


A fibration of sites is called a continuous fibration by some authors (cf. [828]), 
but we have chosen to avoid that name for reasons already explained in Section 
C2.3. As with geometric morphisms (cf. A4.1.1), there is an arbitrary choice of 
the direction in which we consider a fibration of sites to be pointing: we have 
chosen to make it point in the same direction as the geometric morphism which it 
induces — though it should be noted that this is the direction of the left adjoint P, 
whereas a geometric morphism f points in the direction of the right adjoint f,. 
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We may now strengthen 2.5.5, as follows. 


Corollary 2.5.7 Any geometric morphism f: F — E between Grothendieck 
toposes may be induced (as in 2.5.2) by a fibration of sites (P,T): (D, K) > (C, J). 
Moreover, the categories C and D may be chosen to be cartesian, and P to be a 
cartesian functor. 


Proof First we choose (C, J) to be a site of definition for €; then we choose 
(D, K ) similarly for F as a bounded €-topos, but additionally requiring D to 
contain the terminal object of F. Then we define D = C x D as in 2.5.3, and 
equip it with the coverage derived from those on C and D as in 2.5.4. The 
arguments above show that the projection D — C and its right adjoint have 
the required properties. For the final assertion, we may choose both C and D 
to be cartesian (i.e. closed under finite limits in their respective toposes): for D, 
this means that the corresponding functor C°P — Cat takes cartesian categories 
and functors as values. So it follows from B1.4.1 that the Grothendieck category 
C x D has finite limits, and P preserves them. O 


Note that we have actually proved rather more than we claimed in the state- 
ment of 2.5.7: the ‘base site’ (C, J) of the fibration may be chosen to be any site 
of definition for € over Set, although the choice of (D, K) will in general depend 
on f as well as F. 

Next, we look at some particular examples of geometric morphisms from the 
point of view of the results established in this section. 


Examples 2.5.8 (a) Let f:[D°P, Set] — [C°P, Set] be the geometric mor- 
phism induced as in A4.1.4 by a functor F: D — C. We may factor F as 


pas GI(F) ae, 


where GI(F’), as in A2.1.12, denotes the comma category (C | F), P is the pro- 
jection (U, V,a) + U and R is the functor V + (FV,V,1rv) (the right adjoint 
of the other projection). P is a split fibration, corresponding to the functor 
CoP — Cat which sends U to (U | F), and a morphism a: U’ — U to the func- 
tor induced by composition with a; R is full and faithful, and so if we use it 
to equip GI(F}) with a coverage K as in 2.2.4(d), we obtain an equivalence 
[D°P, Set] ~ Sh(G1(F), K). It is clear that P is cover-reflecting, since there are 
no nontrivial covers on C; for an arbitrary F, P will not have a right adjoint, 
but if we assume that D has a terminal object and F preserves it, then we have 
a right adjoint T sending U to (U,1,U — 1). So in this case we may take the 
fibration of sites (P,T): (GI(F}, K) — (C, D) (where D is the ‘discrete’ cover- 
age on C, whose only covers are the maximal sieves) to represent the geometric 
morphism f. 

(b) Let f: Y — X be a locale morphism. We saw in 1.6.1 how to ‘internalize’ 
O(Y) as a frame in Sh(X); if we then externalize it again (not by the method of 
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1.6.2 but) by the method of 2.5.4, we obtain the posite P whose objects are pairs 
(U,V) € O(X) x O(Y) with V < f*(U), with the coverage K in which a sieve 
R covers (U,V) iff the set of those V’ for which there exists U’ € O(X) with 
(U',V") € R covers V in O(Y). In particular, we note that the single morphism 
(U,V) — (X,V) generates a covering sieve for any V < f*(U), and hence any 
sheaf A on P satisfies A(U, V) = A(X,V), i.e. it is determined by its values on 
objects of the form (X, V}. But the latter must satisfy the sheaf axiom for covers 
of the form ((X,V;) > (X,V) |i € I) where V = U{V; | i € I}; so we deduce 
that Sh(P, K) is equivalent to Sh(Y). The projection P: P — O(X) is cover- 
reflecting, since if U = U{U; | i € I} then ((U;, V A f*(U;)) > (U,V) | i € D) 
generates a K-covering sieve; also P is a fibration, corresponding to the functor 
O(X)°P — Poset sending U to |(f*(U)), and it has a right adjoint given by 
U + (U, f*(U)). So we have represented the geometric morphism Sh(Y) — 
Sh(X) induced by f by a fibration of sites. 

(c) More generally, let (C, J) be a small cartesian site, and let L be an internal 
frame in Sh(C, J), considered as a functor CP — CLat satisfying the conditions 
of 1.6.10. Then we may externalize L by the method of 2.5.4, to obtain a category 
C x L whose objects are pairs (U,V) with U € ob C and V € L(U), with 
morphisms (U’, V’) — (U,V) taken to be morphisms a: U’ > U of C for which 
V’ < L(a)(V) in L(U’) (or equivalently ©,(V’) < V in L(U)). The detailed 
description of the coverage in this case is left to the reader. 

(d) A particular case which we have seen before: let G be a group, and let 
X be an internal locale in [G, Set}. Since the forgetful functor [G, Set] — Set 
is logical (and hence preserves completeness of posets, by B2.3.9), an internal 
locale in [G, Set] is simply a locale in Set on which G acts by automorphisms. If 
we externalize the site in [G, Set] given by O(X) with its G-action and canonical 
coverage, we obtain the site (Og(X),7') for the topos of G-equivariant sheaves 
on X, which we described in A2.1.11(c). The fibration Og(X) — G is the faithful 
functor arising from the description of Og(X) as a category structured over G, 
which we gave in A2.1.11(c); its right adjoint sends the unique object of G to 
the top element X of O(X). 

(e) A generalization of (d) which will be of importance in Section C5.4: let G 
be a topological group, and consider an internal frame A in the topos Cont(G) 
of continuous G-sets. Since the geometric morphism [G, Set] — Cont(G) whose 
inverse image is the forgetful functor is hyperconnected, we know from 2.4.15 
that Frm(Cont(G)) is a coreflective subcategory of Frm([G, Set]); hence A is 
the ‘continuous part’ of a frame A in [G, Set], that is a frame in Set on which 
G acts (not necessarily continuously) by automorphisms. (Explicitly, A may be 
constructed as the MacNeille completion of A, that is the poset of pairs (L, U) 
of subsets of A such that L is the set of lower bounds of U, and U is the set 
of upper bounds of L; we recall that the MacNeille completion of any Heyting 
algebra is a Heyting algebra (520, III 3.11], and hence a frame, and any G-action 
on a poset extends in an obvious way to its MacNeille completion. However, we 
do not need this identification for our present purposes.) 
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We recall that in 2.2.4(c) we constructed a site of definition (C,J) for 
Cont(G): the objects of C are the open subgroups of G, with morphisms H — K 
labelled by cosets gK of K such that H C gKg™, and every inhabited sieve is 
J-covering. Now, given an internal frame A in Cont(G), let Ay denote the set of 
elements of A which are invariant under the action of H; note that Ag C Ag if 
H C K, and that the union of the Ay is the whole of A. The information that A 
is the continuous part of a complete Heyting algebra in (G, Set] tells us that the 
inclusions Ax — Ay are Heyting algebra homomorphisms, and that they have 
left and right adjoints (the left adjoint sends an element U of Ay to the join of 
its images under all the elements of K). Applying the construction of 2.5.4, we 
obtain a site (D, J’) where the objects of D are pairs (H,U) with H an open 
subgroup of G and U € Aq, and whose morphisms (H,U) — (K, V) are labelled 
by left cosets gK of K such that H C gKg™! and U < gV in A. We note that 
any morphism gK of D has a factorization 


H 
(H,U) S, (g-1Hg,g~1U) ate (K,V) 


where the first factor is an isomorphism; so a sieve on (K, V) is determined by 
the morphisms in it which are labelled by K itself. Thus we may define a sieve 
R on (K,V) to be J’-covering if V is the join of all those U for which there 
exists H such that U € Ay and K: (H,U) — (K,V) € R. In particular, we note 
that the single morphism K: (H, V) — (K, V) generates a covering sieve for any 
H C K and any V € Ax; so the projection P: D — C is cover-reflecting. Its 
right adjoint T is of course given by H ++ (H,T), where T is the top element 
of A. 


Remark 2.5.9 If C has and P preserves pullbacks, then the fact that P is a 
fibration follows from the other conditions of 2.5.6: we may obtain a prone lifting 
of an arbitrary morphism V — PU by applying T to it and then pulling back 
along the unit U — TPU. However, we do not wish to assume the existence 
or preservation of pullbacks as part of our definition of a fibration of sites: in 
part this is because we wish to be able to use the concept over base toposes 
which do not have a natural number object, so that we cannot in general choose 
our sites to be closed under finite limits (and thus we have to sacrifice the final 
assertion of 2.5.7). But we also wish to use 2.5.7 as a means of ‘externalizing’ 
characterizations of classes of geometric morphisms in terms of sites, and the 
latter may not be ‘stable’ under the operation of adjoining finite limits to the 
underlying category of a site. 


? 


To explain what we mean by the preceding sentence, let P be a property of 
geometric morphisms, and suppose we have a property P’ of sites (with terminal 
objects) such that a bounded geometric morphism p: € — S has property P iff 
€ is definable by an internal site in S having property P’. Then we may use 
the ideas above to ‘externalize’ P’ to a property P” of fibrations of sites, such 
that a morphism of bounded S-toposes has property P iff it may be induced by 
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a fibration of sites having property P”. We give one example of this technique 
here; others will occur at various points in Chapter C3. 


Proposition 2.5.10 A morphism of BZop/Set is surjective iff it can be 
induced by a fibration of sites (P,T) such that P preserves covers of objects 
in the image of T. 


Proof By 2.4.8, f: F — E is surjective iff we can choose a site (D, K) for F 
over € such that D has a terminal object, and every covering of the terminal 
object is inhabited. If £ is itself given as Sh(C, J), then this condition readily 
translates into the one in the statement of the proposition for the corresponding 
fibration of sites (C xD, J% K) — (C, J). Conversely, if f is induced by a fibration 
of sites (P,T): (D, K) — (C, J) satisfying the condition, then 2.3.4 and 2.3.18 tell 
us that the composite f, f* sends a sheaf A on C to the functor a(AoP) oT, where 
a is the associated K-sheaf functor. But from the fact that P preserves covers 
of objects in the image of T, we see that if two elements of AP(TU) = A(U) 
become equal when restricted to some covering sieve R on TU, then they become 
equal on the cover of PTU % U generated by the Pb, b € R, and hence they 
must already be equal; so, from the construction of the associated sheaf functor 
in the proof of 2.2.6, we see that the unit map (Ao P) — a( A o P) is monic at 
each object of the form TV. Hence the unit of (f* 4 fx) is monic, ie. f is a 
surjection. O 


Another application of fibrations of sites occurs in the description of (cofil- 
tered) inverse limits in BTop/Set (or more generally in BTop/S, where S is 
any topos with a natural number object). For the moment, we shall restrict 
ourselves to limits of inverse sequences 


fs f fi 
= 


| a EEN EEE 


of Grothendieck toposes and geometric morphisms; dealing with more gen- 
eral cofiltered limits requires descent theory, and so we shall postpone it 
to Section C5.1. By the remark immediately after the proof of 2.5.7, we 
may successively choose sites (Cn, Jn) for each En, and fibrations of sites 
(Pa, Tn): (Cn, In) > (Cn—1, Jn—1) inducing the morphisms fn as in 2.5.7. 

We define a category Cæ as follows: its objects are sequences (U,, | n > 0) of 
objects of C, for each n, equipped with isomorphisms an: P,(Un) > Un_, for 
each n such that, for some m, the transpose Un — T,(Un—1) of an is also an 
isomorphism whenever n > m. A morphism (U, | n > 0) > (Vn | n > 0) of 
Coo is simply a sequance of morphisms an: Un —> Vn commuting in the obvious 
sense with the an; note that we shall tend to suppress the names of the a,, when 
referring to an object of Cæ, as is our usual custom with coherence isomorphisms. 
(Cœ could equivalently be described as the pseudo-colimit in Cat of the filtered 
diagram formed by the C, and T;,; and indeed 2.5.11 below is really a special 
case of 2.3.14, except that we are here working with not-necessarily-cartesian 
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sites. However, the left adjoints P, allow us to embed this pseudo-colimit as a 
subcategory of the pseudo-limit of the Pa, and this description is more useful for 
calculations.) 

For m < n, we write P™ : Can —> Cm (resp. T7": Cm — Cn) for the composite 
of the P; (resp. T,) with m +1 < j < n (we take this to be the iden- 
tity functor if m =n, of course). We have functors P% : Cə —> Cn defined by 
P} (Um | m > 0) = Un, and T% : Cn > Co defined by TZ (U) = (Vm | m > 0), 
where Vm = P} (U) if m < n, and Vm = T} (U) if m > n, with the obvious 
coherence isomorphisms. It is easy to see that TX is right adjoint to P$, and 
the counit of the adjunction is an isomorphism. Also, P% is a fibration: given an 
object (Um | m > 0) of Cœ and a morphism a: V > U,, = P} (Um | m > 0) 
in Ca, we obtain a prone lifting of a by taking its mth component to be 
P? (a): P?(V) > P}(Un) = Um if m < n, and the appropriate prone lifting 
ofatoCmifm>n. 

We equip Cə with the smallest coverage J. for which the functors 
T% are all cover-preserving, i.e. the join (in the lattice of Grothendieck 
coverages on Cæ) of the coverages (T%)e(Jn) as defined in 2.3.12. By 
2.5.1, this coverage also makes each P% cover-reflecting. So we have fibra- 
tions of sites (P%,T%): (Coo, Joo) > (Cn, Jn), and hence geometric morphisms 
fZ: Exo = SH(Coo, Joo) > En, for each n; it is clear that these form a pseudo-cone 
over the diagram formed by the fn. 


Theorem 2.5.11 With the above notation, Eœ is the limit of the diagram 
formed by the En and fn in BYop/Set. 


Proof Suppose given an arbitrary (pseudo-)cone (gn: F >En | n > 9), 
where F is a Grothendieck topos. By 2.3.9, each gn corresponds to a flat 
cover-preserving functor G,:C, — F (where F is equipped with its canonical 
coverage); and we have coherence isomorphisms Gn 0 Ta = Gn-1 for all n > 1. 
Given an object (Un | n > 0) of Cæ, we define G.(U, | n > 0) to be the limit 
in F of the diagram formed by the objects G,,(U,,), with transition maps 


Gn(@n) e 
Gn (Un) => GnTa(Un-1) => Gn-1(Un-1); 
note that, for a given object (Un | n > 0), these transition maps are isomorphisms 
for all sufficiently large n, so we could equivalently define G.(U, | n > 0) to 
be simply Gm(Um) for some sufficiently large m. Hence also we have canonical 
isomorphisms G% 0 T% = Gn for all n, which are compatible with the coherence 
isomorphisms between the Gn. It follows easily that Gə is cover-preserving; for 
each Gæ © T% is cover-preserving, and hence we have (T%)e(Jn) < (Gæ)? (C) 
as coverages on Cæ, where C is the canonical coverage on F. It is also easy to 
see that G. is a flat functor, and hence that it induces a geometric morphism 
Joo: F — Ex. Moreover, the functor Gæ (and hence the geometric morphism 
Joo) is determined up to canonical isomorphism by the requirement that we have 
Go ° Th = Gn for all n, since any object (Un | n > 0) of Cao is isomorphic to 
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T% (Um) for sufficiently large m. And the construction of gs. from (gn | n > 0) 
is clearly functorial on 2-cells of BYop/Set, so we have an equivalence 


Bop /Set (F, Ex) ~ lim, BTop/Set (F, En), 
as required. 0O 


We note that, if we wished to avoid the use of the large site (F, C') in the 
proof of 2.5.11, it would be possible to replace it by a small site (D, K), where 
D is any full subcategory of F containing a generating set and also all objects 
of the form g*(l,(U)), U € ob Cn (where ln: Cn —> Sh(Cy, Jn) is the canonical 
functor), and K is the induced coverage. Thus the proof of 2.5.11 may be carried 
out in BTop/S, for any base topos S with a natural number object. 

We note the following important application of 2.5.11: 


Corollary 2.5.12 The localic reflection functor BYop/Set — LTop/Set 
preserves limits of inverse sequences. 


Proof We saw in 2.3.20 that we may construct the localic reflection at the 
level of sites, by forming the preorder reflection of the underlying category and 
equipping it with an appropriate coverage. If we are given an inverse sequence 
of fibrations of sites (Pa, Tn): (Cn, Jn) > (Cn-1, Jn-1) as in the proof of 2.5.11, 
and form the preorder reflections P„ of the categories Cn, then the P, and Tah 
clearly induce functors between the latter, and it is easy to verify that these are 
again fibrations of sites. Moreover, the construction of the limit site (Cx, Joo) 
also commutes with the preorder reflection; that is, the preorder Pa constructed 
from the P, is the preorder reflection of Cœ, and the coverage which it inherits 
as a quotient of Cə coincides with that which it acquires as the limit of the Pn. 
So the result is immediate from 2.5.11. O 


Remark 2.5.13 Combining 2.5.12 with 2.4.13, we see that the localic reflec- 
tion functor preserves countable products (and, once we have extended 2.5.12 
to more general cofiltered limits in 5.1.12 below, we may similarly deduce that 
it preserves arbitrary S-indexed products, where S is our base topos). However, 
it does not preserve arbitrary limits: the following counterexample is due to 
I. Moerdijk [828]. 

Let C be the finite category represented by the diagram 


E er 9 i 


b 


and let D be the functor C°P — Poset defined by taking D(U) = 1, 
D(V) = 2, D(a)(0) = 0 and D(b)(0) = 1. (Here we are using n to denote 
the n-element totally ordered set {0,1,....2—1}.) Let E be defined similarly, 
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but with the rôles of a and b interchanged. By B3.2.14 we have a pullback 
diagram 
[(D x E)°P, [C°P, Set]] —> [E°P, [C°?, Set]] 


[DP (c°?, Set]] ————> [C?, Set] 


and by 2.5.3 we may rewrite this in the form 


[(C 4 (D x E))°?, Set] — > [(C x E)°?, Set] 


[(C x D)®, Set] —————> [C°?P, Set] 


But it is easy to compute these finite semidirect products explicitly, and to 
verify that if we take their preorder reflections we get a diagram which is not 
a pullback in Poset, and hence cannot give rise to a pullback of localic Set- 
toposes. 


Using 2.5.12, we may extend the result of 1.1.12 from locales to toposes, at 
least in the countable case. 


Corollary 2.5.14 Let 


fs f fi 
> 


E2 ———> E): ——> & 


be an inverse sequence of Grothendieck toposes, such that the transition mor- 
phisms fn are all surjections. Then the legs f$ : Eso —> En of the limit cone in 
BTop/Set are also surjections. 


Proof Fix a particular value of n, and consider the sequence in BYop/E€, 
formed by the Em for m > n. If we replace each such Em by its reflection 
(Lm, say) in LLop/E,, then since hyperconnected morphisms are surjective, 
it is clear that the transition morphisms Lm+1 — Lm are still surjective. So by 
1.1.12 (applied to internal locales in €,,) the inverse limit Læ of the Lm maps 
surjectively to each Lm, and in particular to Ln = En; but, by 2.5.12 applied to 
bounded €,,-toposes, the comparison Eœ — Læ is hyperconnected, and hence 
surjective. 0 


In principle, it should be possible to prove 2.5.14 directly from the char- 
acterization of surjections in 2.5.10. The problem with trying to do so is that 
our definition of the limit coverage J,. gives us an explicit description only of 
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a generating set of covering sieves, not of 
that P$, preserves those covers of objects 
the generating family, but we cannot conch 
objects. 


Suggestions for further reading: Artin 
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all of them; it is not hard to verify 
in the image of TZ, which belong to 
ide that it preserves all covers of such 


et al. [36], Moerdijk [828]. 


C3 


CLASSES OF GEOMETRIC MORPHISMS 


C3.1 Open maps 


Our aim in this chapter is to revisit some of the ‘topologically inspired’ classes 
of geometric morphisms which we introduced in Section C1.5, in the con- 
text of morphisms between localic toposes, and to investigate their properties 
in the more general context of morphisms between Grothendieck toposes. As 
in Section C2.4, we shall henceforth interpret ‘Grothendieck topos’ loosely as 
meaning any topos defined and bounded over a base topos S having a nat- 
ural number object; but we shall often treat S notationally as if it were the 
classical category Set of sets, relying on the reader to translate our argu- 
ments as required into the language of S-indexed categories as developed in 
Part B. 

We begin with the study of open maps. Recall that in 1.5.4 we defined a 
geometric morphism f: F — E to be open if its inverse image f* is a Heyting 
functor. However, this definition is equivalent to another elementary one, which 
is more convenient for use in some contexts; so we shall begin this section by 
establishing the equivalence. 

As we saw in Section A2.1, any cartesian functor F: € — F between toposes 
comes equipped with a family of comparison morphisms ¢4: F(PA) — P(FA) 
for each object A of E (in fact they form a natural transformation between 
functors EP — F); explicitly, a is the name of the relation tabulated by 
F(€4) — F(PA x A) & F(PA) x FA. We call F a logical functor if 4 is an 
isomorphism for any A; we shall call it sub-logical if we merely know that ¢, is 
monic for each A. Similarly, we shall call F a sub-cartesian-closed functor if the 
comparison map 04.5: F(B4) > FB¥4 (defined in Section A1.5) is monic for 
all objects A and B of €. 


Lemma 3.1.1 A cartesian functor F: E — F between toposes is sub-logical iff 
it is sub-cartesian-closed and ġı: Fe — Qs is monic. 


Proof If F is sub-logical, then clearly ¢, is monic; and from the description 
of the exponential B4 as a subobject of P(A x B) given in A2.4.11, it is easy to 
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see that the diagram 


F(P(A x B)) ZB PUPA x FB) 


commutes. Hence 64, is monic if 64x 8 is. Conversely, it is not hard to see that 
oa may be decomposed as the composite 


ban gra 
F(Q4) —S FaPA ——s oF A, 


so it is monic if #4 9 and ġ, are. O 


Lemma 3.1.2 Let f:C —D be a functor between small categories. Then the 
functor f*:[D, Set] — [C, Set] induced by composition with f is sub-logical iff 
f satisfies the following condition: given any morphism b: fU — V in D whose 
domain is in the image of f , there exista: U > U', r: fU'— V andi: V = fU' 
such that ri = ly and ib = fa. 


Proof Suppose first that this condition holds: let A be an object of [D, Set] and 
U an object of C. Then, by the construction of A1.5.5 and A1.6.6, f*(PA)(U) = 
PA(fU) is the set of subfunctors of A x D(fU,—), and (ġa)u maps such a 
subfunctor R to the subfunctor S of f*A x C (U, —) given by 


S(U") = {(z,a) | (z, fa) € R(fU")} . 


But if b: fU — V in D and y € A(V), then (defining a, r and 7 as in the 
condition) it is easily seen that (y,b) € R(V) iff (Ai(y),a) € S(U'); so R is 
uniquely recoverable from S, i.e. (¢a4)u is injective. 

Conversely, suppose f* is sub-logical. Given b: fU — V, let A be the functor 
D (V,—), and consider the following two subfunctors of A x D (fU, —) as elements 
of f*(PA)(U): 


( 


R,(V') = {(e: V > V',d: fU > V’) | cb = d}, 
R2(V') ={ 


) = {(e,d) € Ri(V’) | c is not split monic} . 


Since (1y,b) € Ri(V), we have R, # R2, and so their images under (ġa)u must 
be distinct; but this is equivalent to saying that we can find i: V — fU’ and 
a: U = U' such that (i, fa) € Ri(fU') and ¢ is split monic, i.e. that the given 
condition holds. O 
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Although the second half of the above argument is clearly non-constructive, 
the first half is constructive, and so can be applied to a functor f:C — D 
between internal categories in any topos S, to obtain a sufficient condition for 
the corresponding inverse image functor [D, S} — [C, S] to be sub-logical. 


Remark 3.1.3 Reverting to the case S = Set, a similar argument shows that 
f*: [D, Set] — [C, Set] is sub-cartesian-closed iff f satisfies the condition that, 
given b: fU > V, we can finda: U = U',r: fU' = V andi: V — fU' satisfying 
ri = ly and r(fa) = b. We omit the details, which may be found in [513]; 
but we note that this result enables us to give an example to show that sub- 
cartesian-closedness alone is not sufficient to imply that a functor is sub-logical. 
Let D be the monoid of all endomorphisms of the set N of natural numbers, 
C the submonoid of injective endomorphisms, and f the inclusion map. Then 
f satisfies the condition just given, since any b: N — N can be factored as the 
composite of an injection a and a split surjection r: for example, we can take 
a(n) = 2?) (2n+1). But the stronger condition of 3.1.2 fails for any b which is 
not injective. 


We shall have more to say about the difference between the conditions of 
3.1.2 and of 3.1.3 in the next section (see 3.2.4 and 3.2.14). 

Sub-logical and sub-cartesian-closed functors have straightforward stability 
properties under composition: 


Lemma 3.1.4 Let F:€ — F andG: F — G be two cartesian functors between 
toposes. 


(i) If both F and G are sub-logical (resp. sub-cartesian-closed), then so is the 
composite GF. 
(ii) If GF is sub-logical (resp. sub-cartesian-closed) and G is faithful, then F 
is sub-logical (resp. sub-cartesian-closed). 
(iii) Jf GF is sub-logical (resp. sub-cartesian-closed) and F is logical (resp. 
cartesian closed) and essentially surjective on objects, then G is sub-logical 
(resp. sub-cartesian-closed). 


Proof Let A be an object of E. Then the comparison map 
65°: GF(PA) > P(GF A) is easily seen to be the composite 


G(o4) FA 
GF(PA) —“> G(P(FA)) £S P(GFA), 


from which the ‘sub-logical’ cases of the three statements follow immediately. The 
‘sub-cartesian-closed’ cases follow from the similar factorization of the exponen- 


tial comparison map 694,: GF(B4) > GF BCFA, o 


Next, we note a useful extension of Proposition A4.5.1. 
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Lemma 3.1.5 Let j be a local operator on a topos E, and let L: E > sh,(€) 
denote the associated sheaf functor. The following conditions are equivalent: 


(i) j ts open. 
(ii) L is a logical functor. 


(iii) L is a cartesian closed functor. 
(iv) L is a sub-logical functor. 
(v) L is a sub-cartesian-closed functor. 


(vi) Ifm: A — A is j-dense, so is m? : A'B — AP for any B. 


Proof The equivalence of (i), (ii), (iii) and (vi) was shown in A4.5.1. The impli- 
cations (ii) > (iv) and (iii) > (v) are trivial, and (iv) = (v) was shown in 3.1.1 
above. So it remains to prove that (v) implies (vi). But since the functor (—)? 
preserves pullbacks, it suffices to verify (vi) for the generic j-dense monomor- 
phism d: 1 — J. And L(J) = 1, so (v) implies that we have a monomorphism 
65,7: L(JP) = LJP = 148 & 1 for any B. So L(d®) must be an isomorphism; 
i.e. d? is j-dense. 0 


Now we embark on establishing the connection between the foregoing ideas 
and our previous definition of open map. 


Proposition 3.1.6 LetF:E — F bea cartesian functor between toposes which 
commutes with universal quantification. Then F 1s sub-logical. 


Proof Let A € ob €. Consider the diagram 


A 
RED pea BOO) ve 


F(A x OA) 
0x8 0 
FOFA x FOFA Baia FOFA 1 
ofA x of ‘a 
SFA vV(FA) 


in which the right-hand cell commutes by A2.3.11, and the lower left cell com- 
mutes by a similar argument using A1.4.13. Since 04 x Q4 & (Q x )4, and 
similarly for FQF^4, we can regard the upper left square as an instance of the 
naturality of 6. So the whole diagram commutes. But the bottom row classifies 
the diagonal subobject of Q74 x QF, and the top row is similarly the image 
under F of the classifying map of the diagonal subobject of 04 x 04; so the 
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commutativity of the outside of the diagram is tantamount to saying that 
PA 
F(A) hee 9 lee 


F(A) A 
PAX OA 
— > 


F(Q4 x OA) OFA x OF4 


is a pullback, i.e. that 4 is monic. O 


To obtain a converse to 3.1.6, we need to assume that F has a right adjoint, 
i.e. that it is the inverse image of a geometric morphism. Then we have 


Theorem 3.1.7 For a geometric morphism f:F —€E, the following are 
equivalent: 

(i) f is open, i.e. f* is a Heyting functor. 

(ii) f* commutes with universal quantification. 
(iii) f* is a sub-logical functor. 

) For any object A of E, the inverse image of the induced geometric 
morphism f/A: F/f*A— E/A (cf. A4.1.3) is sub-logical. 
(v) For any object A of E, the inverse image of f/A is sub-cartesian-closed. 


(vi) For any object A of E and any subobject B of f*A, the image of the 
composite 


(iv 


_, F/A 
F/B —> F/f"A——> E/A 


is an open subtopos of E/A. 

(vii) The transpose $1: Qe > fNQF of 61 has an internal left adjoint. 

(viii) For every A, the transpose p4: PA — f,(P(f*A)) of ġa has an internal 
left adjoint.: 


Proof (i) © (ii) since inverse image functors are always coherent, and (ii) 
> (iii) is 3.1.6 above. For (iii) > (iv), we recall from the proof of A2.3.2 that 
the power object of an arbitrary object h: C — A of E/A is constructed as a 
subobject of A*(PC), and it is straightforward to verify that the diagram 


(f/A)*(Ph) — > P(r) 


<A * 
(F/A) A* (PC) Ua (f* A)*(P(F*C)) 
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commutes. So ġar is monic if gc is. (iv) > (v) is immediate from 3.1.1; and 
(v) = (vi) follows from 3.1.4 and 3.1.5, for if (f/A) has sub-cartesian-closed 
inverse image then 3.1.4 implies that the same holds for the image of the displayed 
composite. 

(vi) > (viii): Given a morphism r: B — f,(P(f*A)) in E, corresponding to a 
subobject R — f* Bx f*Ain F, let A(R) be the subobject of Bx A corresponding 
to the image of the composite geometric morphism F/R — F/f*B x f*A > 
E/B x A, and let Ar: B — PA be the name of this subobject. It is easy to see 
that the construction R +> A(R) is stable under pullback, so that Ar is indeed the 
composite of r with a morphism \(= 41): f.(P(f*A)) —> PA. Moreover, given 
a subobject S — B x A, named by a morphism s: B — PA, it is clear that we 
have A(R) < S in Sub(B x A) iff R < f*S in Sub(f*B x f*A), ie. that Ar < s 
iff r < $8, yielding the required internal adjunction (4 ga). 

(viii) > (vii) is trivial, so it remains to prove (vii) = (ii). In fact we first show 
that (vii) implies a strengthened version of (viii): given a left adjoint A = A, for 
$i, we define 44: f.(P(f*A)) — PA to be the composite 


\4 


mae) TA 
{nf"A (S ) > (f.)4 > Q4 


pea) L (4,2) 


where 6’ is the exponential comparison map for f, and a is the unit of (f* 4 fẹ). 
Then it is straightforward to verify that \4 is an internal left adjoint to ¢,, 
and further that A+ A4 is ‘natural’ in the sense that, for any h: A —> B, the 
diagram 


AB 
PAP eB) = PB 
f.(P(f*A)) Ph 


ÀA 
JAPA) ÁRA 


commutes. Taking internal right adjoints of the morphisms in this diagram, 
we get 


PA ^> (PISA) 


Yh fe(W(F*h)) 


pB B> §.(P(f*B)) 


But by A2.3.11 this diagram commutes iff f* preserves universal quantification. 
o 
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We digress for a moment to remark that the converse of 3.1.6 holds for direct 
image functors as well: 


Proposition 3.1.8 For a geometric morphism f: F — E, the following are 
equivalent: 


(i) fx commutes with universal quantification. 
(ii) fa is sub-logical. 
(iii) f is an inclusion. 


Proof (ii) > (i): Suppose f is an inclusion, so that we can identify F with 
the full subcategory of sheaves for some local operator 7 in €, and f, with the 
inclusion. Given a morphism g: A > Bin sh; (E) and a subobject A’ — A which 
is a sheaf, we have to show that V,(A’) — B (as computed in £) is a sheaf. But 
the subsheaves of a 7-sheaf are exactly its 7-closed subobjects, by A4.3.8; and 
since g*: Sub(B) — Sub(A) commutes with j-closure, it is immediate that its 
right adjoint V, preserves closed subobjects. 

(i) => (ii) again follows from 3.1.6; note that we could also deduce (ii) directly 
from (iii) using 3.1.1, since the direct images of inclusions are cartesian closed 
functors by A4.2.9, and the comparison map ġı in this case is the canonical 
monomorphism Q, — Q. 

(ii) = (iii): Suppose f, is sub-logical. Then it is sub-cartesian-closed by 3.1.1, 
so, for any two objects A, B of F, any pair of morphisms 1 = f,(B4) having 
equal composites with 04,8: f.(B4) — f,B/4 must be equal. But this says 
that fs must be faithful; hence by A4.6.2(a) f is localic. And since ¢; is monic, 
the internal locale in € which corresponds to f is a sublocale of the terminal 
locale; so f is an inclusion by 1.5.1(i). E] 


Given the result of A4.1.12 that any cartesian functor between toposes can be 
factored as the composite of a direct image functor and an inverse image functor, 
one might hope to prove the converse of 3.1.6 for arbitrary cartesian functors by 
combining the results of 3.1.7 and 3.1.8. However, the hypothesis that a functor 
is sub-logical is not ‘stable’ under this factorization. 

Now we revert to the study of geometric morphisms whose inverse images are 
Heyting functors. We remark that any geometric morphism whose codomain is a 
Boolean topos is open: this follows either from the observation after A1.4.10 that 
a coherent functor defined on a Boolean coherent category is a Heyting functor, 
or from the result of A4.5.22 that every subtopos of a Boolean topos is open. A 
more substantial consequence of 3.1.7 is the following: 


Corollary 3.1.9 

(i) Any hyperconnected geometric morphism is open. 

(ii) A geometric morphism is open iff the localıc part of its hyperconnected-localic 
factorization is open. 


Proof (i) By A4.6.6, a morphism f: F — € is hyperconnected iff the com- 
parison map ¢): fQ — Q is an isomorphism, and an examination of the proof 
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of A4.6.6 will show that its inverse is then necessarily 4). So this is immediate 
from condition (vii) of 3.1.7. 

(ii) follows immediately from (i) and 3.1.4, since the inverse image of a hyper- 
connected morphism is faithful. 


In connection with 3.1.9(ii), we note also that a geometric morphism is open 
iff both parts of its surjection-inclusion factorization are open: this follows easily 
from 3.1.4 and 3.1.5. 

It follows from 3.1.9 that the study of open maps of toposes reduces, to a 
large extent, to the study of open localic maps — equivalently, to the study of 
open maps of locales, where by ‘locales’ we understand internal locales in an 
arbitrary topos. We therefore turn now to the study of such maps, following the 
approach pioneered by A. Joyal and M. Tierney [560]. 

We recall the relationship between frames and complete join-semilattices, 
which we studied in 1.1.10. In particular, we recall that if f: A> Bisa 
frame homomorphism, then B can be regarded as a commutative monoid in the 
category of A-modules in CjSLat. The connection with open maps is given by 


Lemma 3.1.10 Let f: X —Y be a map of locales. Then f is open iff 
f*: O(Y) — O(X) has a left adjoint fr: O(X) — O(Y) which is a O(Y)-linear 
morphism of complete join-semilattices. 

Proof If f* has a left adjoint fi, the latter is automatically a morphism 
of complete join-semilattices. The condition that it should be O(Y)-linear is 
precisely the Frobenius reciprocity condition of 1.5.3(ii). 


As an example of the advantages of this way of viewing open maps, we prove 


Proposition 3.1.11 Let 


pa ay 

h l 
f 

X ——> Z 


be a pullback square in Loc with f open. Then 
(i) k is open; 


(ii) the square 
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commutes (we say that the pullback square satisfies the Beck-Chevalley 
condition); 


(iii) if f is also surjective, then k is surjective. 


Proof (i) By 1.1.9, the above pullback in Loc corresponds to the pushout 


O(Z) E O(X) 
h* 
ov) 5> ox) Boz) OY) 


in Frm, where h*(U) = U @ Y (recall that we identify Y with the top element 
of O(Y)) and k*(V) = X & V. By 3.1.10, we know that f* has an O(Z)-linear 
left adjoint fi. We define kı: O(X) @ez) O(Y) — O(Y) by 


ki(U @ V) = g* f(U) NV; 


this is well-defined since (U,V) + g* fi(U)NV is clearly O(Z)-bilinear. Moreover, 
kı is left adjoint to k* since we have kk*(V) = g* fi(X)NV < V and 


k*ki(U @V) =X (g fi(U) AV) = (XO Ft fi(U)) @V >U@V 


where we used the O(Z)-bilinearity condition at the second step. Finally, kı is 
O(Y)-linear, since 


k((U @V)Nk*(W)) =kh((U @V)N(X @W)) 
=h(U @(VNW)) 
= g f(U)AVAW =kh(U @V) AW 


for any U € O(X) and V,W € O(Y). So we have shown that k is open. 

(ii) The identity kih* = g* fı is clear from the definition of ky. 

(iii) If f is a surjection, then fi(X) = Z; so it follows easily that kik*(V) = 
go fi(X) OV =V for all V € O(Y), and hence k is a surjection. o 


Corollary 3.1.12 
(i) Open surjections are regular epimorphisms in Loc. 


(ii) If an open map is a monomorphism in Loc, then it is an inclusion (i.e. a 
regular monomorphism). 
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Proof (i) Given an open surjection f: X — Y, form the pullback 


ae | 


P 
f 
X ——— Y 


we must show that f is the coequalizer of h and k in Loc, i.e. that 


* h* 
O(Y) = O(X) ——S O(P) 
Kt 


is an equalizer diagram in Frm. But by 3.1.11 k is an open surjection, 
and the Beck-Chevalley condition holds; so the maps fi: O(X) — O(Y) and 
ky: O(P) — O(X) give this diagram the structure of a split equalizer in Set (in 
fact in CjSLat), and the forgetful functor Frm — Set creates equalizers. 

(ii) If f is open and a monomorphism, then by (i) and the remark after 3.1.9 
the surjective part of its surjection—inclusion factorization is both monic and 
regular epic, and hence an isomorphism. 0 


The proof of 3.1.12(i) should be compared with that of A2.2.7. We shall prove 
a much stronger result — the fact that open surjections are descent morphisms 
in Loc - in 5.1.4. 


Remark 3.1.13 If we specialize the Beck-Chevalley condition of 3.1.11 to the 
case when g is the inclusion of an open sublocale of Z, we recover the Frobenius 
reciprocity condition of 1.5.3(ii). Thus the assertion ‘for every pullback k of f, 
k* has a left adjoint kı, and the Beck-Chevalley condition holds for all such 
pullbacks’ is yet another equivalent way of defining what it means for f to be an 
open map. 


We digress briefly to note a couple of results relating openness and fibrewise 
denseness (cf. 1.2.14), which will be needed in Section C5.3. 


Lemma 3.1.14 
(i) Suppose given a pullback square 
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in Loc/X for some X, where i is fibrewise dense over X and g is an open 
map. Then 7 is fibrewise dense over X. 


(ii) Suppose given a commutative diagram 


Fo 
P f 
X 


where i is fibrewise dense over X. Then f is open iff g is open. Moreover, if 
these conditions are satisfied, then the pullback of i along an arbitrary locale 
map a: X' — X is fibrewise dense over X'. 


Proof (i) The Beck-Chevalley condition of 3.1.11 yields the equality 
gis = jh*: O(Y') — O(Z), by taking right adjoints. To say that 7 is fibrewise 
dense over X means that i,i*f* = f* (where f is the structure map Y — X); 
so we immediately deduce j,7*g* f* = jah*i* f* = g*i,i* f* = g* f*. 

(ii) First suppose g is open. Since i is fibrewise dense, we have f* = ixg*; 
so f* has a left adjoint g:27*. To verify the Frobenius reciprocity condition, let 
U € O(X), V € O(Y). Then we have 


FFU) OV) = gi*(f*(U) AV) 
= gı(g9* (U) N © (V)) 
=UNgi*(V) 
= U N f(v), 
as required. 
In the case when f is assumed open, we define g: = fii,. Then for U € O(X) 
and W € O(Z) we have 
g(W) <USfii.(W) <U 
Si (W) < f*(U) = i4g* VU) 
<W < g*(U) 
using the injectivity of i, at the final step. The verification of the Frobenius 
reciprocity condition is similar to the previous case. 
For the second assertion of (ii), suppose given a: X — X’. We may identify 
O(Y xx X’) with the tensor product O(Y) Qox) O(X’) of O(X)-modules in 
CjSLat, by 1.1.9; in terms of this description, we have to show that if U € O(X') 


and W € O(Y xx X’) are such that (i*@1)(W) < Z@U, then W < YQU. But 
W may be written as a join of tensors V QU’, so it suffices to prove the result for 
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a single such tensor, i.e. that i*(V) @U’ < Z@U implies V @U' < Y QU. Now 
the projection Z x x X’ — X’ is open, and by the proof of 3.1.11 the inequality 
*(V) @U' < Z @U is equivalent to a*gi*(V) NU’ < U in O(X’'). Since 7 is 
fibrewise dense, we have gi* = fi; so this inequality is in turn equivalent to 
a* fi(V) AU’ < U, and hence to V @U'’ < Y @U in O(Y xx X’). oO 


Next, we note an important characterization of local homeomorphisms 
in Loc. 


Lemma 3.1.15 A morphism f: X — Y in Loc is a local homeomorphism iff 
both f and the diagonal map X — X xy X are open. In particular, a locale X 
is discrete iff both X — 1 and the diagonal X — X x X are open maps. 


Proof The second assertion is the case Y = 1 of the first, by the remark after 
1.6.3. (On the other hand, the first could be deduced by applying the second to 
internal locales in the topos Sh(Y).) It therefore suffices to prove the first. 

If f is a local homeomorphism, then it is clearly an open map, since any 
open U > X can be covered by opens V for which fiV is open in Y (and 
homeomorphic to V}, and fı preserves unions. On the other hand, the pullback 
X xy X is also a pullback in the category LH, by 1.3.2(iv); so the diagonal 
X — X xy X is also a local homeomorphism, and hence open. 

Conversely, suppose f and the diagonal are both open. Let us call an open 
U € O(X) small if U @U < A in O(X xy X) S O(X) 8o) O(X), where 
A is the element of this frame corresponding to the diagonal. We know that 
O(X) Qoy) O(X) is generated as a complete join-semilattice by the elements 
U & V, so we have 


A=\/{U@V|U,VEO(X), U@V <A}; 


but U QV < A implies U @V = V @U = (UNV) @(UNYV,), since the diagonal 
is the equalizer of the identity on X xy X and the twist map. So 


A=\/{U@U|UEO(X), UU <A}. 
Applying (771), we obtain 
X = (m)(A) = \V/{(m)(U 8 U) | U small} 
< {U € O(X) | U small} 


since U @U < U & X = r{(U) for any U, small or not. Now for any U € O(X) 
the composite U — X — Y is open; but the assertion that U is small says 
precisely that this map is a monomorphism in Loc, and hence it is an open 
inclusion by 3.1.12(ii). So we have shown that X can be covered by opens U 
for which U — X — Y is an open inclusion; but this was how we defined local 
homeomorphisms in Section C1.3. o 
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For an internal locale X in a Boolean topos, the unique continuous map 
X — 1 is always open, by the remark before 3.1.9. In a non-Boolean topos this 
is not so; however, the Frobenius reciprocity condition is automatic in this case. 
The reason is that the O(1)-linearity of the left adjoint X1: O(X) — O(1) of X* 
is simply part of what it means to say that X, is a morphism of internal posets in 
the topos S in which X lives (equivalently, that its externalization in the sense of 
B2.3.3 is an S-indexed functor). Alternatively, we could derive it from 1.2.17(ii), 
since the terminal locale 1 is regular: note that the existence of a left adjoint 
for f*: O(Y) — O(X) says that for each open U — X there is a smallest open 
V + Y through which U >= X — Y factors, but if every sublocale of Y is an 
intersection of open sublocales then the latter must coincide with the image of 
U under f. 

It is natural to say that X is an open locale if X — 1 is an open map, even 
though this requires us to take care in distinguishing between open sublocales of 
a locale X and sublocales of X which are open locales ‘in their own right’. (The 
former implies the latter if X itself is an open locale, but in general there is no 
implication in either direction.) 

We note in passing that a locale X is spatial iff it is a surjective image in 
Loc(S) of a discrete locale. Thus the openness of discrete locales, combined with 
3.1.4(ii), yields a result which we observed for localic toposes over Set in 1.6.6(i): 


Corollary 3.1.16 Every spatial locale is open. 0 


Open locales form a useful generalization of the class of spatial locales, which 
tend to be scarcer in non-Boolean toposes than one might expect. For example, 
in Section D4.7 we shall encounter the locale of ‘formal reals’, whose space of 
points is the Dedekind real number object, and we shall see in D4.7.13 that it is 
not always spatial; but it is easily seen (from the definition of the formal reals, 
and from Lemma 3.1.18 below) that it is always open. 

If X is a locale, we say an element U of O(X) is positive if every open cover 
of U is inhabited, i.e. if the sentence 


(YS: PO(X)((V S =U) = (3V : O(X))(V € $)) 


is valid in the internal language of our topos S. (Classically, of course, every 
element other than the bottom element @ is positive, but constructively this is 
a stronger condition than ‘being nonzero’.) If X is an open locale, it is easy to 
see that the left adjoint à: O(X) — O(1) = Q of the unique frame homomor- 
phism *: Q — O(X) sends U to the truth-value of the assertion ‘U is positive’. 
Moreover, we have the following result: 


Lemma 3.1.17 For a locale X, the following are equivalent: 


(i) X is an open locale. 


(ii) Every element of O(X) can be covered by positive elements. 
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(iii) Every covering of an element of O(X) can be refined to a covering by 
positive elements. 


(Conditions (ii) and (iii) are of course to be interpreted as sentences in the 
internal language of the topos in which X lives.) 


Proof (i) > (ii): If X is open, then for every U: O(X) we have U < A*\(U); 
but for any p: Q we have 


A*(p) = VIV: O(X) | (V = X) Ap}, 


since p = V{q: Q | (q = T) Ap} and »* preserves the top element and joins. 
Thus we obtain U < V{V: O(X) | (V = X)AA(U)}, whence U = V{V: O(X) | 
(V = U) A A(U)} by the infinite distributive law 1.1.1. But since A (U) is the 
assertion that U is positive, the latter is a covering of U by positive elements. 

(ii) = (iii) since, if (ii) holds, we can refine any covering S of U to a covering 
which is the union of coverings of each of the members of S by positive elements. 
And (iii) = (ii) follows by applying (iii) to the singleton cover U = V{U}. 

(ii) => (i): Let us, for the moment, define \,(U) to be the truth-value of the 
assertion ‘U is positive’; then we have to show that A; is left adjoint to A*. The 
definition of \*(p) as a join easily implies that we have 


(Vp: Q)AA*(p) > p), 


which yields the counit of the adjunction. For the unit, we note that if U has 
any covering at all by positive elements then it must be covered by the set 
{V: O(X) | (V <U)AX(V)}, and since à is order-preserving this is contained 
in the set {V: O(X) | (V <U)AXA(U)}. And this in turn is a refinement of the 
set {V: O(X) | (V = X) A A(U)}, whose join is defined to be A*A (U); so the 
latter must be greater than or equal to U. O 


Lemma 3.1.17, sometimes known as the ‘Positive Covering Lemma’, forms 
a useful constructive substitute for the classical idea that, in considering open 
coverings of a set in a topological space, we can restrict our attention to coverings 
by nonempty open sets. Using it, we may establish an important characterization 
of open locales in terms of sites: 


Lemma 3.1.18 
(i) Let (A,T) be a site whose underlying category is a poset, such that every 
T-covering family is inhabited. Then the locale corresponding to the frame 
T-Idl(A) of T-ideals of A (cf. 1.1.16(e)) is open. 
(ii) A locale X is open iff O(X) is isomorphic to T-Idl(A) for a site (A,T) as 
Proof (i) We claim first that, for each S C A, the T-ideal (S) generated by 
S (i.e. the intersection of all T-ideals containing S) has the same support as S 
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itself (where the support of a set is defined to be the truth-value of the assertion 
that it is inhabited). To see this, note that for any p: Q the set {a: A | p} isa 
T-ideal whose support is contained in p (indeed, equal to p if A is inhabited); but 
if we take p to be the support of S then it contains S, and hence must contain 
(S). It now follows easily that any inhabited T-ideal is positive as an element 
of the frame of T-ideals, since the join of a family of T-ideals is the T-ideal 
generated by their union, and the latter can only be inhabited if the family 
itself is inhabited. But any T-ideal has a covering by inhabited ones (specifically, 
I = \V {(a) | a € I}); so by 3.1.17 the locale corresponding to this frame is open. 
(ii) One direction is part (i). Conversely, suppose X is open. Let A be the 
set of positive elements of O(X), and for each U € A let T(U) denote the set 
of subsets of A whose join (in O(X)) is U. Clearly, every member of T(U) is 
inhabited. But A is a basis for X, by 1.1.17(ii), so we have a bijection between 
T-ideals in A and arbitrary elements of O(X), given by I |> VI and Ve {U: 
A|U < V}. (This is really an instance of the Comparison Lemma; cf. 2.2.4(b).) 
Q 


We remark that the proof of 3.1.18(i) in fact shows that if every T-covering 
family is inhabited, then the nucleus on the frame LA of lower subsets of A 
which corresponds to the fixset T-Idl(A) is strongly dense (i.e. fibrewise dense 
relative to Q). Thus the result could alternatively be deduced using 3.1.14(ii). 
Note also that, in contrast to 2.4.8, 3.1.18(i) does not require the assumption 
that T satisfies the ‘local character’ axiom (L). The same is true of its analogue 
for arbitrary bounded geometric morphisms, although we shall in fact prove the 
latter under that assumption. 


Proposition 3.1.19 
(i) Let (C, J) be an internal site in a topos S, such that every J-covering family 
is inhabited. Then the geometric morphism Shs(C, J) — S is open. 


(ii) A bounded geometric morphism f:E — S is open iff there exists a site of 
definition for E over S in which every cover is inhabited. 


Proof (i) We note in passing that, by 2.3.20 and 3.1.9(ii), we may reduce 
to the case when the underlying category C is a preorder (so that Shg(C, J) is 
localic over S). However, rather than relying on 3.1.18, we shall give a direct 
proof that the ‘constant sheaf’ functor A: S > Sh(C, J) is sub-logical. 

In fact, using 3.1.1, we shall verify separately that the comparison maps for Q 
and for exponentials are monic, rather than working directly with the comparison 
maps for power objects. First we consider 2. We note that the condition that all 
sieves are inhabited ensures that each constant functor C°P — S is separated; 
hence, to obtain its associated sheaf, we need only apply the ‘+ construction’ of 
2.2.6 once, rather than twice. Thus, for an object U of C, an element of AQ(U) 
may be represented by a pair (R,a), where R is a J-covering sieve on U and 
a assigns to each f € R a truth-value a(f) € 2, subject to the condition that 
a(f) = a(fg) for every g composable with f (i.e. a is ‘constant on connected 
components of R’). Moreover, two such pairs (R, a) and (S, 3) represent the same 
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element of AQ(U) iff there is a covering sieve T C ROS such that the restrictions 
of a and 8 to T agree. On the other hand, the subobject classifier of Sh(C, J) 
is the sheaf which sends U to the set Sv 7(U) of all J-closed sieves on U; and 
the comparison map (¢1)y: AQ(U) — Sv z(U) sends the element represented by 
(R, a) to the sieve S defined by saying that the truth-value of (f: V > U € S) 
is the truth-value of the assertion that the sieve 


{g:W > V | fg E€ Rand a(fg)=T} 


is J-covering. However, we note that if f happens to belong to R then the truth- 
value of this assertion is precisely a(f) (again, this uses the assumption that 
every J-covering sieve is inhabited, plus the fact that œ is constant on connected 
components of R); hence if (R,a) and (S, 8) are mapped to the same J-closed 
sieve then a and 3 must agree on RMS, and so they define the same element of 
AQ(U). Thus ¢; is monic. 

The argument for exponentials is similar. Let A and B be two objects of S; 
then, as before, elements of A(B4)(U) are represented by pairs (R,a) where R 
is a J-covering sieve on U and a is now a function R — B4 which is constant on 
connected components. On the other hand, elements of AB44(U) are natural 
transformations C (—,U) x A — AB (where we have written A for the constant 
functor with value A); and the comparison map 6y sends the element represented 
by (R,a) to the transformation whose value at a pair (f: V — U,a) is repre- 
sented by (f*(R),(g > a(fg)(a))). We note that if f € R then ly € f*(R); 
in particular f*(R) is connected, and so the function which forms the second 
component of the latter representative must be (globally) constant. Hence if 0y 
maps (the elements represented by) (R,a) and (S, 3) to the same natural trans- 
formation, then for any f E€ RNS and any a € A we see that a(f)(a) = G(f)(a) 
(this uses the fact that every cover of the domain of f is inhabited), and hence 
(R,a) and (S, 8) represent the same element of A(B“)(U). So @ is monic, as 
required. 

(ii) Once again, one direction is part (i). For the converse implication, suppose 
f: E — Sis open, and let B be a bound for E over S. Then from B3.1.6 we know 
that the S-indexed family of all subobjects of B (indexed by f,(PB)) may be 
taken as a separating family for E gua S-indexed category. However, for every 
object A of E, the internal locale f,(PA) is an open locale in S; this means 
that (in the internal logic of S) every subobject of B has a covering by positive 
subobjects (where an object A of £ is positive iff E/A — E — S is surjective), 
and hence that we may alternatively take the family of all positive subobjects of 
B (indexed by the object J in the pullback 


I >——> f.(PB) , 


Àr 


1 >> Qs 
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where »; is as usual the left adjoint of the unique frame homomorphism) as a 
separating family. And if we use the latter to construct a site of definition for E 
over S, then it is clear that it will have the property that every cover is inhabited. 
i o 


Of course, if S is Boolean, then every bounded S-topos may be defined by a 
site with all covers inhabited, as we saw in 2.2.4(e): given an arbitrary site, we 
simply eliminate those objects which are covered by the empty sieve, and impose 
the induced coverage on the full subcategory on the objects which remain. But 
in general this process will not work, unless it happens that the subobject of Co 
consisting of objects with empty covers is complemented, and we also know that 
every cover is either empty or inhabited. 

Before leaving this topic, we note a further refinement of 3.1.19. 


Scholium 3.1.20 A bounded geometric morphism f: E — S is an open surjec- 
tion iff there is a site of definition (C, J) for E over S such that C has a terminal 
object and every J-covering sieve is inhabited. 


Proof One direction follows immediately by combining 3.1.19 with 2.4.8. 
Conversely, if f is surjective as well as open, then the site of definition for E 
constructed in the proof of 3.1.19(ii) will contain the terminal object of £, pro- 
vided the latter occurs as a subobject of our bound B (and we can ensure the 
latter by replacing B by B H 1, if necessary). O 


Corollary 3.1.21 A geometric morphism f: F — E between bounded S-toposes 
is an open surjection iff it can be defined by a fibration of sites 
(P,T): (D, K) —> (C, J) in S for which P preserves covers. 


Proof One direction is immediate from 3.1.20: if we take an internal site for F 
over E satisfying the conditions of 3.1.20 and ‘externalize’ it as in 2.5.7, we obtain 
a fibration of sites satisfying the condition given here. Conversely, suppose f is 
induced by a fibration of sites as above; then by 2.5.10 we already know that f 
is surjective. We shall show that f* is sub-logical, using 3.1.1. 

We recall that f* is the composite 


Sap ip 
Shs(C, J) maine (CP, S] is [D°?, S] As Shs(D, K) 

where 1, is the inclusion and j* the associated sheaf functor. Now the first 
factor is cartesian closed by A4.2.9, and the second is sub-cartesian-closed since 
P, being a fibration, satisfies the dual of the condition of 3.1.2. Also, since P 
preserves covers, it is easy to see that Ao P is K-separated for any J-sheaf A on 
C (cf. the remark after 2.3.3), from which it follows easily that the comparison 
map j*((Bo P)'4°P)) > j*(B o P) P) is monic for any A and B. So f* is 
sub-cartesian-closed. 

It does not seem possible to prove by the same method that the comparison 
map f*(Qe) — Qs is monic, but we can easily do this by direct calculation. 
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Qe(P(U,V)) is the set of J-closed sieves on U, whereas 2(U,V) is the set of 
K-closed sieves on (U,V). But if R is any J-closed sieve on U, then the sieve 
{(a, b): (U', V’) — (U,V) | a € R} is easily seen to be K-closed, using the fact 
that P preserves covers; so the canonical map from the first set of sieves to 
the second is injective, and applying the associated K-sheaf functor yields the 
result. Q 


We remark that 3.1.21 provides a further reason why we did not include the 
existence and preservation of finite limits in our definition of a fibration of sites 
(cf. 2.5.9): if we had done so, then the condition in 3.1.21 would imply that 
P was a morphism of sites, and so it would induce not just a single geometric 
morphism but an adjoint pair of morphisms as in 2.3.23. 


Corollary 3.1.22 Let 


fs fo fi 
Ss 


E2 ———> E ———> Eo 


be an inverse sequence of bounded S-toposes (where S has a natural number 
object), such that the transition maps fn are all open surjections. Then the legs 
fZ: Exo —> En of the limiting cone in BTop/S are open surjections. 


Proof We use the technique of 2.5.11 to express E% as the topos of sheaves 
on a ‘limit site’ (Coo, Joo) constructed from a sequence of fibrations of sites, each 
satisfying the condition of 3.1.21. As we noted after 2.5.14, we do not have an 
explicit description of all the sieves in the coverage Jæ, but the generating sieves 
which arise from covers in J, for some n form a coverage (that is, they satisfy 
the stability axiom (C’)), and they are all preserved by P% , from which it follows 
that P% preserves all covers in Jæ. So the result follows from 3.1.21. CO 


Next, we consider the stability of open maps and open surjections under 
pullback in Yop. As with hyperconnected morphisms (cf. 2.4.11(ii)), the stability 
theorem has two entirely different proofs, depending on which side of the pullback 
Square we assume to be bounded. 

One of the two cases follows easily from 3.1.19 and 3.1.20 — or rather, it would 
follow from these results if we had not assumed in the proof of 3.1.19 that our 
coverage had local character (cf. 2.4.9). Since we did so, we shall instead use the 
hyperconnected-localic factorization to reduce to the localic case, for which we 
can use 3.1.18 (whose proof did not assume local character). 


Theorem 3.1.23 Let 
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be a pullback square in Top, where f is open and bounded. Then k is open. If f 
is also surjective, then so is k. 


Proof Since we know that the hyperconnected-—localic factorization is stable 
under pullback (2.4.12), it suffices by 3.1.9 to prove the result when f is not 
merely bounded but localic. But then we may express F in the form Sh¢e(B,T), 
where B is a poset and T is a coverage in which every cover is inhabited; as 
we observed in 2.4.9, the latter condition is inherited by the pullback coverage 
g*T, and so k is open by 3.1.18. For the surjective case, we need the extension 
of 3.1.18 (similar to 3.1.20) where the poset B is assumed to have a top element; 
but this is straightforward. Q 


To prove the stability of arbitrary open maps and surjections under bounded 
pullback, we may of course deal separately with the cases when the bounded 
morphism is an inclusion and when it has the form [C,€] — E. In both cases, 
the key ingredient is 3.1.7, characterizing open maps as those whose inverse 
image functors are sub-logical. 


Lemma 3.1.24 Let f: F — E be a geometric morphism, and C an internal 
category in E. If f is open (resp. surjective), then so is the induced morphism 
IC FCF] = [CE]: 


Proof  [C, f]* is simply f* applied to discrete opfibrations over C. It is clear 
that this is faithful if f* is, so the ‘surjective’ case is immediate. The ‘open’ case 
is not much harder: if F is a diagram of shape C in €, corresponding to a discrete 
opfibration F — C, then its power object in [C, E€] may be computed in £ as a 
subobject of PF, x Co, equipped with a suitable left C-action, and it is not hard 
to show that the diagram 


Per —"" —_» P(g) 


OF, x 1l 
f (PF; x Co) ————> P(f*Fi) x f*Co 


commutes, where ¢ is the comparison map for [C, f]*. Thus if ¢ is monic, so is ¢. 
E 


To deal with pullbacks of inclusions, we need a result which is of interest in 
its own right. 


Lemma 3.1.25 Let f: F —E be an open geometric morphism; suppose that 
j is a local operator on E€, and let k be the pullback local operator on F 
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(cf. A4.5.14(e)). Then there are pullback squares 


PO) == 0%. and fJ) — K 


Qı Qı 


f°(Q) ——> 2 Pe 


in F (where the symbols in the top row have their usual meanings). 


Proof We recall the construction of k given in A4.5.14(e): it is the smallest 
local operator such that the composite f*(J) — f*(Q) — Q factors through 
K — Q.. In particular, we have 


(Qk = Q) = Va. ((7I(f* I) > O) > 1x Q) 
and 
(K = Q) = Va, ((73 (4) = OB) I x Q) 


where O — Q x Q, as in A4.5.12, is the equalizer of > and 72. But since f* is 
a Heyting functor, it commutes with implication and universal quantification; 
thus it is easy to see that we have a pullback square 


P (Op) Oe 


Poca) 0 


and hence that the squares in the statement of the lemma are also pullbacks. Q 


We remark that the conclusion of 3.1.25 actually holds under the weaker 
assumption that only the comparison map ¢): f*(Qe) > Qs is monic. For a 
proof under this assumption, see [513]. 


Corollary 3.1.26 Let f and j be as in 3.1.25, and let f’: sh,(F) — sh, (£) 
be the pullback of f along sh,(E) — E. Then 

(i) f’ is open; 

(ii) if f is surjective, so is f’. 


Proof (i) First we note that the comparison map ¢} for the geomet- 
ric morphism f’ is obtained by applying the associated k-sheaf functor 
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to the morphism f*(Q;) —> Qk which forms the top edge of the left- 
hand square in the statement of 3.1.25, since the lemma implies that the 
square 


PO) 
f*(T5) Tk 


f*(Q3) ——\> Nk 


is a pullback. So we may immediately deduce that j is monic, since ¢1 
is. To show the same thing for an arbitrary comparison map ¢/,, we 
recall that sh,(F) is an exponential ideal in F (A4.3.1) and that BC & 
BŁ? for any sheaf B, where L is the associated sheaf functor. Using 
the ideas in the proof of 3.1.1, we may construct ¢/, by factoring the 
composite 


FULA 2 A —S (A & (IA 


through the associated k-sheaf of its domain; but this composite is monic, 
since both @ and f*(Q,) — Qk are, and the associated k-sheaf func- 
tor preserves monomorphisms. So we have verified that f’* is a sub-logical 
functor. 

(ii) Let m: A’ > A bea monomorphism in €E, with classifying map x: A > Q. 
Since the right-hand square in the statement of 3.1.25 is a pullback, we see 
that f*m is k-dense iff f*x factors through f*J — f*Q (equivalently, iff the 
pullback of f* J — f*Q along f*x is an isomorphism). But if f* is conservative, 
this happens iff x factors through J — Q., ie. iff m is j-dense. Since the k- 
dense monomorphisms are exactly those inverted by the associated sheaf functor 
F — sh;(F), it now follows by A1.2.4 that f” is conservative. o 


Putting together 3.1.24 and 3.1.26, we have established the second version of 
the pullback-stability theorem: 


Theorem 3.1.27 Let 


NN 


H G 
| ! 
f 
F—— >€ 


be a pullback square in Lop, where f is open and g is bounded. Then k is open. 
If f is also surjective, then so is k. E 
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Finally (!) in this section, we may establish the connection between openness 
and the Beck—Chevalley conditions of 2.4.16: 


Theorem 3.1.28 A geometric morphism is stably left weak Beck-Chevalley (in 
the sense of 2.4.16) iff it is open. 


Proof First suppose f: 7 — E is a SLWBC morphism. Then, for any 
morphism g: B > A in E, the weak Beck—Chevalley condition holds for the 
square 


FJB LS Fra ; 


f/B f/A 


£/B —2-> E/A 


i.e. the canonical natural transformation (f/A)*II, — II;-g(f/B)* is monic. 
Combining this with the natural isomorphism (f/B)*g* = (f*g)*(f/A)*, and 
recalling that the composite II,g* is the exponential functor (—)9: €/A — &/A, 
we deduce that (f/A)* is sub-cartesian-closed, for any object A of E. But we 
showed that this is equivalent to openness of f in 3.1.7(v). 

Conversely, suppose f: F — E is open. Since openness is stable under pull- 
back, it suffices to show that f is a LWBC morphism. But if we consider the 
pullback of f along an arbitrary bounded morphism g: G — €E, then on forming 
the hyperconnected-localic factorizations of both f and g we may ‘factor’ the 
pullback square into four quarters, as in the proof of 2.4.13; and in three of them 
the weak Beck—Chevalley condition holds by A4.6.8. Thus we are reduced to the 
case when both f and g are localic (and f is still open, by 3.1.9). If we now 
replace them by the corresponding internal] locales in E, then 3.1.11(ii) tells us 
that the (strong) Beck—Chevalley condition holds for subterminal objects of the 
toposes involved (actually for the left adjoints of the functors we are now con- 
sidering, but the result we want follows from uniqueness of adjoints). Since our 
hypotheses are ‘stable under slicing’ by an object of €, it follows that we have 
verified condition (iii) of A4.1.17, so by the latter result the weak Beck—Chevalley 
condition holds for arbitrary objects of the toposes under consideration. O 


Suggestions for further reading: Johnstone [513, 523], Joyal & Tierney [560]. 


C3.2 Proper maps 


Having studied open maps of locales and toposes, the reader might reasonably 
expect us next to turn our attention to closed maps. We begin with closed maps 
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of locales, which were deliberately omitted from Section C1.5: 
Lemma 3.2.1 Fora map f: X —Y of locales, the following are equivalent: 


(i) For every closed sublocale C of X, the image of the composite C — X — Y 
is a Closed sublocale of Y. 

(ii) For all U € O(X) and V € O(Y), we have f,(UU f*(V)) = fa(U) UV, 
where f,: O(X) — O(Y) denotes the right adjoint of f*. 


Proof Since closed sublocales are complementary to open sublocales 
(1.1.16(b)), it is clear that, for any closed sublocale CU of X, Cf,(U) is the 
smallest closed sublocale of Y through which the composite CU — X — Y fac- 
tors — i.e. it is the closure of the image of this composite. Condition (i) is thus 
equivalent to saying that, for every such U, the induced locale map CU — Cf,(U) 
is epic. But, if we identify O(CU) with the principal filter }(U) in O(X), the 
frame homomorphism corresponding to this map sends an open V > f,(U) in 
O(Y) to UUf*(V), and its right adjoint is simply fẹ restricted to elements above 
U. So condition (ii), for a fixed U € O(X), is easily seen to be equivalent to the 
assertion that the unit of this adjunction is an isomorphism. Q 


Note that condition (ii) of 3.2.1 is precisely the assertion that the adjunction 
(f. 1 f*) between O(X)°? and O(Y)°? satisfies the Frobenius reciprocity condi- 
tion of A1.5.8. So, if O(X) and O(Y) happen to be coframes as well as frames, 
it is equivalent to saying that f* preserves ‘co-exponentials’. 


Remark 3.2.2 If O(Y) is Boolean, then every locale map X — Y is closed: 
this follows from 3.2.1(i) and the fact that every sublocale of Y is closed 
(cf. 1.1.20(ii)), but it may also be deduced from 3.2.1(ii), since if V has a 
complement W in O(Y) then the identity of 3.2.1(ii) is clearly equivalent 
to f.(f*(W) =U) =(W=> f.(U)), and this follows easily from the adjunction 
(f* 4 f..) and the fact that f* preserves finite meets. 


Lemma 3.2.3 Closed maps of locales are stable under pullback along local 
homeomorphisms. 


Proof Suppose given a pullback 
f' 


Xe Sy" 


X ——> Y 


where f is closed and q is a local homeomorphism. We must show that the 
condition of 3.2.1(ii) holds for opens U’ € O(X’), V’ € O(Y'). Consider first 
the case when q is an open inclusion; then we may regard Y’ as an element of 
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O(Y), and V’ and U’ may be identified respectively with an element of O(Y) 
satisfying V’ < Y’, and an element of O(X) satisfying U’ < X’ = f*(Y’). Under 
this identification, we have f{(U’) = f,(U’) NY’, and hence 


UUW! = (RU) AY) UV! 
=(f,(U')UV')NY' since V’ < Y’ 
= f,(U'Uf*(V’))NY’ since f is closed 
= f,(U'U f*(V')). 


Now consider the general case: then we have a covering of Y’ by open sublo- 
cales Y; such that each composite Y; — Y’ — Y is an open inclusion. Using the 
argument above, we may deduce that (fi (UU VYA Y; = fi(U' U f"(V')) OY, 
for each 2; so the result follows by the infinite distributive law in O(Y). E 


We thus define an arbitrary geometric morphism f: F — E to be closed if, 
for every object A of € and every closed subtopos C of F/f*(A), the image of 
the composite C + F/f*(A) — €/A is a closed subtopos of €/A. By 3.2.3, this 
agrees with our previous definition for maps between localic toposes. 

The next (non-constructive!) lemma should be compared with 3.1.2. 


Lemma 3.2.4 Let f:C —PD be a functor between small categories. Then the 
induced geometric morphism |C, Set] — [D, Set] is closed iff f satisfies the fol- 
lowing condition: given any b: V > fU in D, there exist morphisms a: U' + U 
inC andi: V — fU', r: fU' 4 V in D such that ri = ly and (faji = b. 


Proof We note first that the given condition is stable under pullback along 
discrete opfibrations: given a discrete opfibration p: D’ — D, corresponding to 
a functor P: D — Set, lifting the diagram formed by the morphisms a, b, r 
and 7 to a similar diagram in D’ is equivalent to choosing an element of P(V). 
Hence, if the condition is satisfied, it suffices to show that the image under the 
geometric morphism of any closed subtopos of [C, Set] is closed in [D, Set]. But 
this is easy; for, as we saw in A4.5.5, the closed subtoposes of [C, Set] are those 
of the form [S, Set], where S is a sieve in C. And by A4.2.12(b), the image of the 
composite geometric morphism [S, Set] — [C, Set] — [D, Set] may be identified 
with [7, Set], where T is the full subcategory of D on all objects which are 
retracts of ones of the form fU, U € ob S. But the given condition clearly 
implies that 7 is a sieve if S is; so the image is a closed subtopos of [D, Set]. 
Conversely, suppose [C,Set] — [D,Set] is closed. Given a morphism 
b: V — fU in D, we first slice over the object D(V,—): that is, we pull back 
to the morphism [(V | f), Set] — [V\D,Set] induced by f. Now consider the 
closed subtopos of [(V | f), Set] corresponding to the principal sieve S generated 
by b (i.e. the sieve of all objects b': V — fU’ which admit a morphism to b). 
This is clearly nonzero, so its image [T, Set] (where 7, as before, is the full sub- 
category of V\D on those objects which are retracts of some b’: V > fU’ in S) 


630 CLASSES OF GEOMETRIC MORPHISMS C3.2 


is nonzero, and if it is closed (i.e. if 7 is a sieve) then 7 must contain the initial 
object ly: V — V of V\D. But this says precisely that the given condition is 
satisfied for the morphism b. O 


Note that the condition of 3.2.4 is dual not to the condition of 3.1.2 for open- 
ness of a geometric morphism [C, Set] — [D, Set], but to the weaker condition of 
3.1.3 for f* to be sub-cartesian-closed. The explanation is that the condition of 
3.1.3, unlike that of 3.2.4, is not stable under pullback along discrete opfibrations 
(equivalently, the condition of 3.2.4 is not stable under pullback along discrete 
fibrations), although it is easy to see directly that it does imply that every open 
subtopos of [C, Set] maps to an open subtopos of [D, Set]. And if we ‘stabilize’ 
it under pullback along discrete opfibrations, we obtain a condition equivalent 
to that of 3.1.2, by 3.1.7(v). 

The preceding paragraph indicates one respect in which the class of closed 
maps is not as well-behaved as that of open maps: closed maps are not in general 
stable under pullback. Indeed, this failing already manifests itself in the category 
of locales (and even in the category of spaces). In Sp, the remedy for it is well 
known: we need to restrict our attention to the closed maps which have compact 
fibres, otherwise known as proper maps. We shall therefore devote the bulk of this 
section to setting up the theory of proper maps for locales and toposes, which 
is largely due to J. Vermeulen [1203] in the localic case, and to I. Moerdijk and 
J. Vermeulen [858] in the topos-theoretic one. 

There is a remarkable ‘duality’ between this theory and the Joyal—Tierney 
theory of open maps, described in the last section. Where the Joyal—Tierney 
theory exploits the relationship between the categories Frm and CjSLat, 
Vermeulen’s theory makes use of the relationship between Frm and the cat- 
egory PFrm of preframes, which we also studied in 1.1.10. By analogy with 
3.1.10, we make the following definition: 


Definition 3.2.5 A map of locales f: X — Y is said to be proper if 
f*: O(Y) — O(X) has a right adjoint f,: O(X) — O(Y) in PFrm which is 
O(Y )-linear. 


Of course, the right adjoint fẹ always exists as a map of ordered sets; so the 
force of Definition 3.2.5 is that it is a map of preframes (i.e. preserves directed 
joins) and is O(Y)-linear (i.e. satisfies condition (ii) of 3.2.1). In particular, 
proper maps are closed. 


Proposition 3.2.6 Proper maps, and proper surjections, are stable under pull- 
back in Loc; proper surjections are regular epimorphisms in Loc, and proper 
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monomorphisms are inclusions. Moreover, if 


pissy 

h l 
f 

X ———> Z 


is a pullback square in Loc with f (and therefore k) proper, then the Beck- 
Chevalley square 


O(X) Bie O(Z) 
h* g* 


ky 
O(P) ——> O(Y) 
commutes. 


Proof This is proved exactly like 3.1.11 and 3.1.12, replacing the tensor prod- 
uct O(X) @ez)O(Y) of O(Z)-modules in CjSLat with the corresponding tensor 
product O(X) Ooz) O(Y) of O(Z)-modules in PFrm, and replacing binary 
intersections by binary unions (and left adjoints by right adjoints) throughout 
the argument. 


Remark 3.2.7 T. Plewe [979] studied a common generalization of open and 
proper surjections of locales, which he called triquotient maps (this term was first 
used for maps of spaces by E. Michael [807]). A locale map f: X — Y is called 
a triquotient map if there exists a function fy: O(X) — O(Y) which preserves 
directed joins and satisfies both the Frobenius conditions 


fe(UNf*V)) =fe(U)OV and fe(UUF(V)) =fe(U)UV 


for U € O(X), V e O(Y). (Note that these two equations force f to be an 
epimorphism, since the first implies fy f*(V) < V and the second implies the 
reverse inequality.) Clearly, if f is an open surjection then fi satisfies these 
conditions (the second Frobenius condition follows from surjectivity and the fact 
that fı preserves binary joins), and if f is a proper surjection then f, does so. 
(Another class of examples is provided by split epimorphisms in Loc: if f is a 
split epimorphism, then we can take fy = s* for any one-sided inverse s for f.) 
In general, for a triquotient map f there may be many different choices for the 
‘triquotient assignment’ fy; but Plewe has shown that if we have a pullback 
Square as in 3.2.6 and f is triquotient, then k is also triquotient, and we may 
choose its triquotient assignment so that the ‘Beck—Chevalley’ square commutes. 
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If X is a locale in a Boolean topos, then the unique locale map y: X — 1 
is proper iff y, preserves directed joins (i.e. is a morphism of preframes), by 
Remark 3.2.2; but this is exactly how we defined compactness for locales in 1.5.5. 
Constructively, the Frobenius reciprocity condition in the definition of propriety 
(unlike that for openness) is a nontrivial restriction even on locale maps with 
codomain 1; so we include it in the definition of compactness — that is, we define 
X to be compact if X — 1 is proper. 

The justification for studying proper maps of locales is the following result, 
in which the equivalence of (i) and (ii) parallels a well-known result for spaces. 


Theorem 3.2.8 Let f: X —Y be a map of locales. The following are 
equivalent: 


(i) f ts proper. 
(ii) f is stably closed, i.e. its pullback along any morphism with codomain Y 
is closed. 


(iii) The wnternal locale in Sh(Y) corresponding to f under the equivalence of 
1.6.3 is compact. 


Proof (i) = (ii) is immediate from 3.2.6 and the fact, already noted, that 
proper maps are closed. 

(ii) => (iii): Condition (ii) is ‘stable’ under passage across the equivalence 
of 1.6.3, since the notion of closed sublocale is stable in this sense. Thus it is 
sufficient to prove the implication (provided we do so constructively) in the case 
Y = 1: that is, given a locale X such that the projection Z x X — Z is closed 
for every Z, we have to show that X is compact. The particular case Z = 1 
takes care of the Frobenius reciprocity condition, so we have only to verify that 
the top element of O(X) is inaccessible by directed joins. Let D be a directed 
subset of O(X) with join X; we construct a ‘test locale’ Z as follows. As at the 
end of Section C1.2, we write d: Xa — X for the dissolution of X; then d*(U) is 
closed as well as open in Xq for each U € O(X) (in particular, for each U € D), 
and we write e(U) for its complement. Let F be the filter in O(X4) generated 
by {e(U) | U € D}, and let O(Z) be the subframe 


{(p,V) |p>(V € F)} 


of Q x O(Xa). (Thus O(Z) is obtained by applying the frame-theoretic version 
of the glueing construction (A2.1.12) to the classifying map x: O(Xa) > Q of 
F: note that x preserves finite meets precisely because F is a filter.) 

Now consider the element 


W=\/{(T,e(U)) ®@U|U ED} 
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of O(Z x X) = O(Z) @ O(X). We have 
W v ((1,Xa)® X)=Wv\/{(1, Xa) 8U |U ED} 
= VET, e(U)) V(1,Xa))@U |U €D} 
=\V/{(T, Xa) @U|UED} =Z@X, 


where we have twice used the fact that D covers X. Since the projection 
mı: Z Xx X — Z is closed, it follows from 3.2.1(ii) that we have 7.(W) V 
(L, Xa) = Zin O(Z). So ™1.(W) must have the form (T,V) for some V € F, and 
hence 7.(W) > (T,e(U)) for some U € D, or equivalently (T,e(U)) 9 X < W. 
Applying the frame homomorphism O(Z) $ O(X) — O(Xqa) which sends 
(p,V) @U to V Ad*(U), we obtain 


e(U) < \/{e(U') Nd*(U') |U' ED} =90. 


But this implies that d*(U) is the whole of X4, and hence (since d is a surjection) 
that U is the whole of X. 

(iii) > (i): Suppose f is such that Z(f) is a compact internal locale in Sh(Y). 
Since, by 1.6.2, the inverse of Z is simply the global sections functor applied to 
internal frames in Sh(Y), it is easy to see that f inherits the Frobenius reciprocity 
condition of 3.2.1(ii) from the unique morphism Z(f) — 1 in Loc(Sh(Y)). So 
we have only to prove that f, preserves directed joins. 

Let D be a directed subset of O(X); it will be convenient to suppose that 
D is also downwards closed, and hence an ideal. Recalling that O(Z(f)) is the 
sheaf on Y whose value at V € O(Y) is O(f*V), we define an ideal D of this 
internal lattice in Sh(Y) by 


DV) = {U € O(f*V) | {V'< V | UN f*V' € D} covers V}. 


It is clear that this a sheaf; moreover, computation of the truth-value of the 
assertion ‘D covers T(f)’ in Sh(Y) yields the element Vo = f«(V D) of O(Y). So 
the compactness of Z(f) implies that f*(Vo) € D(Vo), i.e. that Vo can be covered 
by opens V’ whose inverse images under f are in D. But for each such V’ we 
have V’ < f, f*(V") < Vo; hence Vg is also the join of {f,(U) | U € D}, ie. fu 
preserves the join of D. 0 


Remark 3.2.9 An inclusion in Loc is proper iff it is closed in the sense of 
3.2.1, iff it is the inclusion of a closed sublocale as defined in 1.2.6(b). The 
latter equivalence is immediate from 3.2.1(i); for the former, we note that if 
f: CU — X is a closed inclusion, then f, maps O(CU) onto the principal filter 
t(U) © O(X), and so preserves all nonempty joins. (Alternatively, we could use 
1.2.10 and 3.2.8(ii).) Thus the ‘proper’ analogue of the class of discrete locales 
(3.1.15) is the class of compact Hausdorff locales (cf. 1.2.17). 
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We should also remark that, just as there is a (largely obsolete) tradition in 
general topology whereby ‘compact’ means ‘compact Hausdorff’, and the term 
‘quasicompact’ is used for spaces which satisfy the open-covering definition of 
compactness but are not necessarily Hausdorff, so there is a tradition — which 
has been followed by some writers on locales and/or toposes — whereby ‘proper 
map’ means one which is proper in our sense and also has closed (i.e. proper) 
diagonal; authors who adhere to this tradition use the word ‘perfect’ to describe 
the maps which we call proper. However, ‘perfect’ is also used by some authors 
to denote a stronger condition than properness, so it seems safest to avoid the 
word altogether. Thus we shall say that a locale morphism f: Y — X is proper 
and separated if it corresponds to a compact Hausdorff internal locale in Sh(X), 
i.e. if it is proper and the diagonal Y — Y xx Y is closed. 


Compactness has its expected properties in Hausdorff locales: 


Lemma 3.2.10 
(i) Any continuous map from a compact locale to a Hausdorff locale is proper. 
In particular, any compact sublocale of a Hausdorff locale is closed. 


(ii) A compact locale X is Hausdorff iff it is regular; that is, each U € O(X) 
can be expressed as a union of opens whose closures are contained in U. 


Proof (i) Suppose f: Y — X is a locale map such that Y is compact and 
X is Hausdorff. Then (1, f): Y — Y x X is a closed inclusion, because it is 
the pullback of the diagonal A: X — X x X along f x 1. But the projection 
n2: Y x X — X is proper (because it is a pullback of Y — 1) and hence f = 
m2(1, f) is proper. The second assertion is the special case of the first when f is 
an inclusion, by 3.2.9. 

(ii) One direction was proved (without the assumption of compactness) in 
1.2.17. For the converse, suppose X is compact and Hausdorff, and let U be open 
in X. The complement CU is compact (as is any closed sublocale of a compact 
locale), and so the projection 72: CU x U — U is proper by 3.2.6; equivalently, 
by 3.2.8, it corresponds to a compact internal locale Z(72) in Sh(U). But CU x U 
is disjoint from the diagonal sublocale of X x X, so it is contained in the open 
complement of the latter, i.e. the union of all open rectangles Px Q = XxX 
such that PN Q = 0. Now O(Z(72)) is the internal frame (V + O(CU x V)). 
The sub-presheaf F of this given by 


V= {Z xV | Z= PALU for some P € O(X) with PAV =0} 


is not a sheaf; but we may form its closure F in the sheaf O(Z(r2)) already 
mentioned, that is the sheaf whose sections over V € O(U) are those open 
sublocales W of CU x V such that V has an open covering by sublocales V; for 
which WN (CU x V;) is of the form (P; NCU) x V; for some P; € O(X) disjoint from 
V;. It is easily seen that F is directed in the ordering inherited from O(Z(72)) 
(indeed, it is closed under finite joins in the latter), and that its join is the top 
element CU x U. So by compactness of Z (r2) we must have CU x U € F(U); but 
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this says that U can be covered by opens V for which CU < =V - equivalently, 
for which the closure of V is contained in U. So X is regular. B 


One further result on proper maps of locales which we shall need is the 
following: 


Proposition 3.2.11 Suppose given a diagram of locales (X; | i € ob T) indexed 
by a cofiltered category T, such that the transition maps fa: X, — X; induced by 
morphisms a: j — i in T are all proper. Then the legs g,: Xx = limzX; > X, 
of the limit cone in Loc are all proper. 


Proof First suppose that the transition maps are surjective as well as proper. 
By 1.1.8 and 1.1.10, we know that the filtered colimit O(Xæ) in Frm is also 
a colimit in PFrm; now, for each j, the right adjoints fas: O(X;) > O(X,) 
form a cone in PFrm under the T/j-indexed diagram formed by the O(X;) with 
morphisms a:i —> j, since (fag)sf3 = farfosfh = fas by surjectivity. Since 
T is cofiltered, the forgetul functor T/j — T is initial, and so this diagram 
has the same colimit as the original one; so we get a preframe homomorphism 
O(Xæ) — O(X;) for each j, which is easily verified to be the right adjoint g,. of 
g}, and to be O(X, )-linear (since all the (fa)« are O(X, )-linear). So gj is proper 
(and surjective - though we already knew the latter fact, from 1.1.12). 

Now consider the general case. We may replace each X; by the closed sublo- 
cale C; which is the intersection, over all a: j — i, of the image of fa: Xj > Xi; 
equivalently, 


Ci = UU ta (0) |a: j > a). 


Clearly, each gi: Xœ — X; must factor through Ci — X;, so this change does 
not affect the limit. Also, since the union appearing in the definition of the open 
complement of C; is directed, it is preserved by the right adjoints of the transition 
maps; hence, since they are closed maps, their restrictions to maps Cj — C; are 
surjective as well as proper. So, by the particular case already considered, each 
Xoo — C; is proper; composing with the closed inclusion C; — X; yields the 
result. 0 


In fact a stronger result than 3.2.11 is true: if the vertices X; of our cofiltered 
diagram are compact locales, then the limit Xs is compact, even if the transition 
maps fail to be proper. However, the proof of this stronger result requires a more 
complicated argument, and we shall not give it here - we refer the interested 
reader to [435]. 


In seeking to generalize the notion of proper map from locales to toposes, we 
have to decide whether our definition of ‘proper map of toposes’ should include 
all hyperconnected maps. We have seen in the last section that the natural defini- 
tion of ‘open map’ for toposes does include all hyperconnected maps, and hence 
that a map is open iff the localic part of its hyperconnected—localic factorization 
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is open. Thus, if we wish to preserve the ‘duality’ between openness and proper- 
ness which we have observed at the localic level, we should expect the same 
thing to be true for proper maps: indeed, we may reasonably define a geometric 
morphism f: F — E to be proper if the localic part of its hyperconnected—localic 
factorization is proper, i.e. if f,(Qz¢) is a compact internal frame in £. 

What does it mean to say that an internal frame (in particular, one of the 
form f,(Q¢)) is compact in a general topos €? For the case E = Set, we saw 
how to answer this in topos-theoretic terms in 1.5.5: it means that the direct 
image functor Sh(X) — Set (where X is the locale corresponding to the frame 
in question) preserves directed colimits of subterminal objects. Another way of 
saying the same thing is that, if Z is any directed poset, then the canonical 
natural transformation in the diagram 


IT, Sh(X)] —> [Z, Set] 
limz | lim z 


Sh(X) ———> Set 


is an isomorphism when evaluated at a subterminal object of [Z,Sh(X)]. But 
this is easily recognizable (modulo ‘stabilization under slicing’) as a weak Beck- 
Chevalley condition, given that the limz are inverse image functors. 

Our formal definition is thus as follows: 


Definition 3.2.12 (a) We say that a geometric morphism f: F — E is proper, 
or that F is compact as an €-topos, if, for any object A of € and any directed 
internal poset I in €/A, the Beck—-Chevalley natural transformation for the square 


R f/A 
FIAS SS eA 


CO] CO] 


(I, F/f* A] Ee [I,£/A] 


(where oo; denotes the geometric morphism whose inverse image is limy; 
cf. B2.6.9) is an isomorphism when evaluated at subterminal objects of 
(I, F/f* Al. 

(b) We say that a bounded geometric morphism f: F — € is separated, oF 
that F is Hausdorff as an E-topos, if the diagonal A f: F — F xe F is proper. 


In view of B2.6.13, we could replace directed posets by arbitrary filtered 
categories in this definition; but it seems more natural to use posets here since 
we are dealing with subterminal objects. We could also eliminate the variable A 
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from the definition by working with weakly filtered categories (or weakly directed 
posets), since we saw in B2.6.6 that an internal category C is weakly filtered iff 
it is filtered as an internal category in €/7oC. Note also that the square in the 
definition is a pullback, since its bottom edge is the pullback of f/A along the 
canonical morphism [I,€/A] — €/A; and since f/A is itself a pullback of f, 
the following result is immediate from A4.1.17. 


Lemma 3.2.13 Any stable right weak Beck-Chevalley morphism (as defined 
in 2.4.16) is proper. ad 


In particular, any hyperconnected morphism is proper, by 2.4.17. Once we 
have proved the stability of proper maps under pullback, we shall obtain a 
converse to 3.2.13. 


Lemma 3.2.14 Let f:C—D be a functor between small categories. The 
induced geometric morphism |C, Set] — [D, Set] is proper iff 


(i) For each object V of D, the comma category (V | f) has a finite solution- 
set (that ıs, there exists a finite family of morphisms b,: V — fU; 
(1 < j < n) such that every b: V — fU factors as (fa)b, for some j 
and some a: U, > U inC); and 

(ii) For any b: V — fU, there exist a: U' 3 U, i: V — fU' andr: fU' => V 
such that ri = ly and br = fa. 


Note that condition (ii) above is stronger than the condition of 3.2.4: it is 
dual to the condition of 3.1.2, rather than that of 3.1.3. This is not surprising, 
given the fact that proper maps are stable under pullback (cf. the remarks after 
3.2.4). We note also that the two conditions could be combined into one, saying 
that for any V there exists a finite set of pairs (ij: V — fU,,r,: fU; — V) with 
rji; = ly, such that for any b: V — fU there exists a:U, —U for some j 
such that br; = fa. However, it is usually more convenient to consider the two 
conditions separately. 


Proof First suppose the conditions are satisfied. We note that both of them 
are stable under pullback along discrete opfibrations (if p: D’ — D is a discrete 
opfibration, f’: C’ > D’ the pullback of f along p and V’ an object of D’, then 
the comma category (V’| f’) is isomorphic to (pV’ | f)). So it suffices to prove 
that for any directed internal poset Q in [D, Set] and any f*Q-indexed family of 
subterminal objects of [C,Set], the functor f, preserves the join of this family. 
Now a directed internal poset in [D, Set] is just a functor Q: D — Poset 
whose values are directed posets; equivalently, we may regard it (via the 
Grothendieck construction) as a split opfibration q: Q — D whose fibres are 
directed posets. Similarly, we shall regard P = f*(Q):C — Poset as a split opfi- 
bration p: P — C. Now a subterminal object R in [P, [C, Set]] ~ [P, Set] is just 
a cosieve R in P, and its join in [C, Set] is the cosieve in C which is the image of 
the composite fibration R — P — C. Applying f. to this, we obtain the cosieve 
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of those V € ob D such that, for every b: V — fU, U lies in the image of R; 
equivalently, there exists an index 7 in P(U) = Q(fU) which belongs to the sub- 
set R(U). However, if we first apply the direct image functor [P, Set] — [Q, Set] 
and then compute the join in [D, Set], we obtain the cosieve of those V for 
which there exists an index y in Q(V) such that, for every b: V — fU, we have 
Qb(y) € R(U). It is clear that the first cosieve contains the second; we have to 
show the reverse inclusion. 

To do this, suppose V belongs to the first cosieve. We first choose a finite 
solution set (b,: V > U; | 1 < j < n) as in condition (i), and then for each j 
we choose aj, i, and r; as in condition (ii). By assumption, we have elements 
m, € P(U;) = Q(fU;) (where U; is the domain of aj) which belong to R(U}); 
let yy = Qr,(7;) € Q(V). Since Q(V) is directed, we may choose a single 
index y in it which is an upper bound for the y,; then for each 7 we have 
Qb (Y) > Qb;(7;) = Pa;(7,) € P(U,), and hence Qb;(y) € R(U;) since R is a 
cosieve in P. But since the b; form a solution set for (V | f), this is sufficient. 

To show the necessity of condition (i), let V be an object of D. Pulling back 
along the discrete opfibration corresponding to D(V,—): D — Set, we obtain 
the functor f: (V | f) => V\D induced by f; so the latter preserves directed joins 
of subterminal objects. Now let P be the (external) poset of finitely-generated 
cosieves in (V | f), ordered by inclusion; we may regard it as a constant directed 
internal poset in both [V\D, Set] and [(V | f), Set], and in the latter we have a 
P-indexed family of subterminal objects, namely the elements of P themselves. 
If we compute the join of this family in [C, Set], we obtain the terminal object, 
since each object of C belongs to a finitely-generated cosieve; and f, preserves 
the terminal object. So the initial object ly of V\D belongs to the join of the 
cosieves fS, S € P, and hence belongs to some particular f,S. But this says 
precisely that S is the whole of (V | f); ie. (V | f) is finitely-generated as a 
cosieve in itself. 

We shall not give a detailed proof of the necessity of (ii), since it follows 
from two general results about proper maps to be proved later. We shall see 
in 3.2.17(ii) that proper maps are closed; hence if f:C — D induces a proper 
map between functor categories then it must satisfy the weak version of (ii) 
considered in 3.2.4. But we shall also see that proper maps are stable under 
arbitrary pullback (3.2.21); and we observed after 3.2.4 that if we ‘stabilize’ the 
weak condition under pullback (specifically, along discrete fibrations), we obtain 
the stronger condition (ii) above. (d 


Although the last part of the above proof is non-constructive, the proof of 
sufficiency of the conditions is not; so they can be used to give a criterion for 
a functor between internal categories in an arbitrary topos to induce a proper 
map. (In this context, the notion of finiteness appropriate to condition (i) is, 
as usual, the Kuratowskian one studied in Section D5.4. In particular, it is not 
hard to see that a local homeomorphism €/A — € is proper iff A is a K-finite 
object of E.) 
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Example 3.2.15 Nontrivial examples of functors satisfying the conditions of 
3.2.14 are not all that common, but the following one will be of use in 5.2.9 
below. Let D be the category of finite sets and surjections between them, and 
let C be the poset whose elements are finite composable strings (b1, b2,..., bn) 
of non-invertible morphisms of D (that is, strings such that dom b, = cod b)41 
for all j), ordered by setting (b1,b2,...,bn) < (b{,55,...,0),) iff the latter is 
an initial segment of the former, i.e. m < n and b, = bj for all j < m. (It 
is convenient to include strings of length 0, that is objects of D, as maximal 
elements of C, with (b1,b2,..-,0n) < V iff V = cod bı.) We have a functor 
f:C — D which sends (bi, b2,...,6n) to dom bn, and an inequality U’ < U to 
the composite of the morphisms belonging to the string U’ but not to U; we 
claim that this functor satisfies the conditions of 3.2.14. 

The first condition is trivial, because the comma category (V J f) is in fact 
finite for each object V of D (this is where we use the non-invertibility condi- 
tion in the definition of C). For the second, we must distinguish two cases: if 
b: V — fU is invertible, then we set a = ly, i = b and r = b`}, and if not 
we let U’ < U be the string obtained by adjoining b to the end of U, and set 
a7 = ly. 


We now embark on the study of proper morphisms in general. 


Lemma 3.2.16 Suppose given a commutative triangle 


PEIE = 
h | 
E 


(i) If f and g are proper, then h is proper. 


of geometric morphisms. 


(ii) If h ıs proper and g is a surjection, then f is proper. 
(iii) If h is proper and f is an inclusion, then g is proper. 


Proof (i) If is a directed internal poset in €/A, then f*I is a directed internal 
poset in F/f* A. So we may patch together the weak Beck—Chevalley squares for 
f and g, to obtain that for h. 

(ii) As before, suppose I is a directed internal poset in E/A. The morphism 
[h*I,G/h* A] — [f*1,F/f* A] induced by g is surjective, so any subterminal 
object V of its codomain satisfies V = g.g*V (cf. A4.2.6(v)). But this is exactly 
what we need to deduce the weak Beck-Chevalley condition for f from that 
for h. 

(iii) Here we cannot use the same style of argument, since we do not know that 
every directed internal poset in F is the image under f* of one in E. However, 
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if we write X and Y for the internal locales in E corresponding to the frames 
h,(Qg) and f,(Qs), then we have a pullback diagram in Loc(€), since Y — 1 is 


X 
1 
monic and X — 1 factors through it. So X — Y is a proper map of internal 


locales, by 3.2.6; but this says that g,(Qg) is a compact internal locale in F ~ 
Sh¢e(Y), by 3.2.8. o 


1 
— 


<< e E 


SS 


We remark that, for a closed inclusion f, the direct image functor f, is regular 
(i.e. preserves epimorphisms) by A4.5.4, and hence preserves directed internal 
posets. So the elementary argument which we rejected in the proof of (iii) above 
would in fact suffice to prove the following corollary. 


Corollary 3.2.17 


(i) A geometric morphism is proper iff both halves of its surjection-inclusion 
factorization are proper, iff it can be factorized as a proper surjection 
followed by a closed inclusion. 


(ii) Proper geometric morphisms are closed. 


Proof (i) The first assertion is immediate from 3.2.16; the second follows from 
it and the fact that an inclusion is proper iff it is closed (cf. 3.2.9). 

(ii) It is immediate from (i) (and the propriety of closed inclusions) that if 
f: F — E is proper then the image under f of any closed subtopos of F is a 
closed subtopos of €. But the definition of propriety is stable under slicing, so 
this is sufficient. g 


In seeking a characterization of propriety in terms of sites, we shall make 
the standing assumption that the underlying category of our site is coherent (in 
fact, it might as well be a pretopos, although we shall not assume this), and 
that its coverage contains the coherent coverage P of 2.1.12(d). It turns out 
that, in addition to the sieves in the coherent coverage, we need only consider 
those covers which are generated by directed families of monomorphisms: we 
shall say that a sieve R on an object U of a coherent category C is a dm-sieve 
if, whenever it contains a morphism a: V — U, it also contains the monic part 
im a of a, and in addition the monomorphisms in R form a directed subset of 
Subc(U). (Equivalently we could say that, for any finite subset R’ of R, there 
is a monomorphism m: U’ — U in R such that every morphism in R’ factors 
through m.) And we call a coverage T a dm-coverage if it consists of dm-sieves. 
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We shall also require a ‘compatibility condition’ between the dm-coverage 
and the coherent coverage: we shall say that T is P-compatible if it satisfies the 
following two conditions: 


(i) Given R € T(U) and a cover a: U ->V in C, the sieve on V generated by 
the images of the composites ab, b € R, is T-covering. 


(ii) Given monomorphisms U — W, V — W with W =U U V, and a sieve 
R € T(U), the sieve on W generated by the subobjects (im aU V), a € R, 
is T-covering. 


Lemma 3.2.18 LetC be a (small) coherent category. 


(i) If J is a Grothendieck coverage on C containing the coherent coverage P, 
then the set Ja of all dm-sieves in J is a P-compatible Grothendieck cover- 
age, and J is the join of P and Ja in the poset of Grothendieck coverages 
on C. 


(ii) Conversely, if D is a P-compatible Grothendieck dm-coverage on C, then 
the join P V D(U) consists of all sieves R on U for which there exists 
S € D(U) such that a*(R) € P(V) for eacha: V >U in S. In particular, 
D consists precisely of the dm-sieves in PV D. 


Proof (i) It is clear that Jy inherits the closure properties of a Grothendieck 
coverage from J. For the P-compatibility of Ja, we use the local character of J; 
for each of the sieves asserted by the compatibility conditions to lie in Jy, we 
see that there is a P-covering sieve (that generated by the single morphism a in 
(i), and that generated by U — W and V — W in (ii)) such that the pullback 
of the given sieve along each morphism in the P-covering sieve is J-covering. 
Finally, to show that J is the join of P and Jy, let R be an arbitrary J-covering 
sieve on U. Let S be the sieve on U generated by all monomorphisms U’ — U 
which are the unions of the images of finite subsets of R. Then S is a dm-sieve 
(and in Jy, since it contains R), and the pullback of R along any member of S 
is P-covering. Hence any Grothendieck coverage containing both P and Ja must 
contain J; but J contains P by assumption and Ja by definition, so it must be 
their join. 

(ii) Clearly, the join P V D must contain all the sieves as described; to show 
that it contains only those sieves, we must show that they form a Grothendieck 
coverage — specifically, that they satisfy the local character axiom (L). For this, 
it suffices (since P and D separately satisfy (L)) to show that we can ‘commute 
a D-cover past a P-cover’: that is, if R is a sieve on U, and we can find S € P(U) 
such that a*(R) € D(V) for each a: V — U in S, then we can also find S’ € D(U) 
such that b*R € P(W) for each b: W — U in S’. So suppose S contains a finite 
family (a;: Vi > U | 1 <i < n), such that the union of the images of the a; is 
the whole of U. We define S’ to be the sieve generated by those monomorphisms 
U’ — U for which there exist (V! — V;) € až R for each i such that U” is the 
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union of the images of the V’ — V; => U (1 < i < n); Clearly, S’ is a dm- 
sieve; and the P-compatibility conditions (plus the local character of D, which 
ensures as we saw after 2.1.8 that D(U) is closed under finite intersections) imply 
that S’ € D(U). But the pullback of R along each U’ — U e S(U) contains a 
P-covering family (Vf = U’ | 1 <i < n), as required. 

For the final assertion, suppose R is a dm-sieve on U and belongs to P V D. 
By the result just proved, there exists S € D(U) such that the pullback of R 
along each morphism a: V — U in S contains a finite P-covering family; but 
a* R is a dm-sieve since R is, and hence it must contain ly. So R D S, and hence 
Re DU). ag 


Theorem 3.2.19 For a Grothendieck topos E, the following are equivalent: 


(i) E is compact, i.e. the geometric morphism E — Set is proper. 


(ii) There exists a small site of definition (C,T) for E such that C is a coherent 
category and T is the union of the coherent coverage P and a P-compatible 
dm-coverage D such that the terminal object of C is D-irreducible. 


(iii) There exists a small site of definition (C, J) for E such that C is a coherent 
category and J is the join, in the poset of Grothendieck coverages on C, of 
the coherent coverage P and a P-compatible dm-coverage D such that the 
terminal object of C is D-irreducible. 


(iv) There exists a site (C, J) as in (iii), but with the added assumption that C 
18 a pretopos. 


Proof (i) (iv): Take C to be a small full subcategory of € containing a 
generating set, and closed under finite limits and colimits; then C is a pretopos 
by A2.4.5. Let J be the Grothendieck coverage on C induced by the inclusion 
C — E; then J contains the coherent coverage since the inclusion is a coherent 
functor, and so it has a join decomposition of the required sort by 3.2.18(i). 
Finally, the compactness of € implies that the terminal object has no nontrivial 
J-covering dm-sieves. 

(iv) > (iii) and (iii) > (ii) are trivial. 

(ii) > (iii): Given T = PU D as in (ii), let D be the Grothendieck coverage 
generated by D as in 2.1.9, and let J be the join of P and D in the poset 
of Grothendieck coverages on C. By 2.4.10, 1e is still D-irreducible; also, the 
J-sheaves on C are exactly the functors which are sheaves for both P and D 
(equivalently, for both P and D), so they coincide with the T-sheaves. 

(iii) => (i): Clearly, any subterminal object of Sh(C,.J) is a union of subob- 
jects of the form [(U), where U is subterminal in C (and |: C + Sh(C, J) is the 
canonical functor). So any covering of 1 by subterminal objects of Sh(C, J) is 
refined by a J-covering of 1 by subterminal objects of C; but 3.2.18(ii), plus the 
D-irreducibility of 1 in C. implies that any covering of the latter sort has a finite 
subcover. So € is compact. oO 
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The arguments of 3.2.18 and 3.2.19 are all constructive, and so may be applied 
to internal sites in any topos with a natural number object. As in one of the proofs 
of 2.4.11, the natural number object is needed for the proof of 3.2.19(i) > (iv), in 
order to ‘close off’ the site under finite limits and colimits. However, in the case 
when € is localic over the base topos, we can dispense with the natural number 
object provided we sacrifice condition (iv) of 3.2.19; in this case we simply take C 
to be the (internal) poset of all subterminal objects of E, which is a distributive 
lattice and hence a coherent category. 

Before proceeding to the proof of the pullback-stability theorem, we note a 
result which extends 3.2.19 in the same way that 3.1.20 extended 3.1.19: 


Scholium 3.2.20 Let p: € —S be a bounded proper map, and let (C, J) be an 
internal site for E over S as in 3.2.19(ii), (iii) or (iv). Then p is surjective iff C 
is non-degenerate (i.e. the morphism 0 — 1 in C is non-invertible). 


Proof At first sight, it is rather surprising that the ‘positive’ condition of 
surjectivity should be (constructively) equivalent to a negative condition (non- 
degeneracy) on the site. However, the key point is that any finite object of S (in 
the Kuratowskian sense) is either inhabited or empty (cf. D5.4.5(ii)); so the same 
is true for P-covering sieves, and so the assertion that 0 — 1 is not invertible 
is equivalent to saying that every P-covering sieve on 1 is inhabited. Since the 
same is clearly true for D-covering sieves, the result follows from 2.4.8. g 


Another explanation of why the ‘negative’ condition of 3.2.20 suffices for 
surjectivity is the fact that proper maps are closed (3.2.17(ii)): we should expect 
the negative condition to imply that p has dense image, but we already know its 
image is closed. 

In theory, it should be possible to translate our site condition for propriety 
into a condition on a fibration of sites for it to induce a proper map of toposes, 
and hence to prove a stability theorem for proper maps under cofiltered limits, 
similar to 3.1.22 for open surjections. However, we shall not do this here; having 
already proved the stability of proper localic maps under cofiltered limits in 
3.2.11, we shall be able to deduce the corresponding result for proper maps 
of toposes via the stability of the hyperconnected-localic factorization in 5.1.13 
below. Instead, we now proceed with the proof of the pullback-stability theorem. 


ne 
i 

f 
F ——> 


Theorem 3.2.21 Let 


Mea aS 
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be a pullback square in Top where f is proper, and either g is bounded, or f 
is bounded and € has a natural number object. Then k is proper. If f is also 
surjective, then so is k. 


Proof As in the ‘open’ case, this theorem has two entirely different proofs 
depending on which of f or g we assume to be bounded. 

(i) For the case where f is bounded, we use our site characterization (3.2.19) 
for bounded proper maps: if E has a natural number object, we represent F as 
She(C,T) for an internal site satisfying the hypotheses of 3.1.14(ii), so that T 
is the union of the coherent coverage P and a P-compatible dm-coverage D. It 
is clear from the construction of g#T that it is then the union of g#P and g* D; 
but g# P is simply the coherent coverage on g*C by 2.4.7. It is straightforward to 
verify that g#D inherits the properties of being a dm-coverage and P-compatible 
from D; also that the condition that the terminal object of C is D-irreducible 
‘lifts’ to the corresponding condition for g*C. So (g*C, 9*T) is an internal site 
for H over G satisfying the hypotheses of 3.2.19(ii). 

To deal with the ‘surjective’ case, we have simply to observe that the condi- 
tion of 3.2.20 is preserved under the passage from C to g*C, since it is equivalent 
to saying that there is a pullback square 


0 ————_> C 


(do, dı) 


(07,1) 
1 ——> Co x Co 


(ii) To show that arbitrary proper maps/surjections are stable under bounded 
pullback, it suffices as before to consider separately the cases when g has the form 
[C,E] — E for an internal category C, and when g is an inclusion. In fact, in 
place of the second of these, we shall consider the more general case when g is 
localic. 

First suppose that G is of the form [C, €] for an internal category C in £. 
Since every slice of G is again of this form, it suffices to show that the square 


c.f] 
[f*C, F] ————=> [C, £] 


(FI [f°C, F] ——> [I [C, E] 


satisfies the Beck-Chevalley condition for subterminal objects, whenever I is a 
directed internal poset in [C, £]. But since the forgetful functor [C, E] > E/Co 


C3.2 PROPER MAPS 645 


is regular, it maps I to a directed internal poset (J, say) in €/Co. Now consider 
the diagram 


Pe OF ee BIE] 


Pe ae CE] 


[f*I, F/ f*Co] ———— | ———=> [J, €/Co] 


Lp Oye a CG 


where the horizontal maps are all induced by f,, the diagonal maps are colimit 
functors and the vertical maps are forgetful functors. It is easily verified that each 
of the vertical faces commutes up to isomorphism, and the forgetful functors 
reflect isomorphisms; so, from the fact that the bottom face commutes up to 
isomorphism at subterminal objects, we deduce that the top face does so too. 
The second assertion of the theorem is immediate, since surjectivity is always 
preserved under pullback to a topos of the form [C, E], as we saw in 3.1.24. 

In the case where g: G — € is localic, we may form the hyperconnected— 
localic factorization of f and pull back each factor separately. The pullback of 
the hyperconnected part is hyperconnected by 2.4.11, and hence proper and 
surjective; so we are left with the case when f as well as g is localic. But if 
we translate the problem across the equivalence between localic €-toposes and 
internal locales in €, then the result we require is contained in 3.2.6. q 


Corollary 3.2.22 A geometric morphism is stably right weak Beck-Chevalley 
(in the sense of 2.4.16) iff it is proper. 


Proof One direction was established in 3.2.13. For the converse, since proper 
maps are stable under pullback it suffices to show that they are RWBC. But this 
may be proved in exactly the same way as 3.1.28: given a pullback of a proper 
morphism along a bounded morphism, we may use the hyperconnected-localic 
factorization of each side to factor the pullback square into four quarters, three 
of which have hyperconnected maps on a pair of opposite sides and so satisfy 
the weak Beck—Chevalley condition by A4.6.8. So we are reduced to the localic 
case; but in this case the result we require is contained in 3.2.6. CJ 
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As an application of the Beck-Chevalley characterizations of open and proper 
maps, we prove the following result, which will be of importance in Section C5.1. 


H pass s 

f 

F ———_> 
be a pullback square in Bop such that f is surjective and either open or proper, 
and h is either open or proper. Then g satisfies the same condition as h. 


Proposition 3.2.23 Let 


mea aT S 


Proof The proposition has four separate cases. The two when f and h are 
both open, or both proper, are easy; for in these cases the diagonal of the square 
is open (resp. proper) by 3.1.4(i) (resp. 3.2.16(i)), but k is surjective by 3.1.27 
(resp. 3.2.21), and so g is open (resp. proper) by 3.1.4(ii) (resp. 3.2.16(ii)). 

In the two cases when f and h satisfy opposite conditions, we exploit the 
Beck-Chevalley characterizations of open and proper maps (3.1.28 and 3.2.22). 
The two cases are very similar; we shall prove the one where h is open and f 
is proper, by showing that g is a SUWBC morphism. Suppose given a bounded 
morphism p: €' — E; we may form the cube 


Fe SRE 
r 
8 
h! Herc eG 
k 


S9 
~ 
RS 
Ae) 
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in which all faces are pullbacks. Now the diagram 


g“ fxq = go" Ds fy T rag’ fi 


k,h* qs >—> kys,h™ =r, kh” 


of Beck—Chevalley transformations commutes, and the indicated edges are monic 
because f and f’ are proper and h is open. So g*p, f/(B) — r.g” fi(B) is monic 
for any B € ob F'; but f’ is surjective, since it is a pullback of f, and so for any 
A € ob €’ the unit map A — f! f'*(A) is monic. Hence g*p,(A) — r,g’(A) is 
monic for any A. Thus we have shown that g is a LWBC morphism, but the same 
argument may be applied to any pullback of g, so it is a SEWBC morphism. 

In the other case, we similarly prove that f/p*g. — f'*gir* is monic, and 
then use the fact that f’* reflects monomorphisms. qg 


We have not had very much to say in this section about separated maps 
of toposes, as defined in 3.2.12(b). However, we may now note one interesting 
example: 


Example 3.2.24 For a (discrete) group G, the topos [G, Set] is Hausdorff 
(that is, [G, Set] — Set is separated) iff G is finite. For we have an open sur- 
jection s: Set — [G, Set] whose inverse image is the forgetful functor; and if 
we pull back this morphism against itself, we obtain Set/G (as may easily be 
seen by identifying s with the morphism |G, Set]/G — [G, Set], where G acts 
on itself by left translation). In other words, the pullback of the diagonal of 
[G, Set] along (s,s): Set — [G, Set] x [G, Set} is Set/G — Set, which is mani- 
festly proper iff G is finite. But since (s,s) is an open surjection, 3.2.23 implies 
that Set/G — Set is proper iff the diagonal of [G, Set] is proper. 


For the record, we also note the basic stability properties of separated 
morphisms: 


Lemma 3.2.25 

(i) Composites of separated morphisms are separated. 

(ii) Pullbacks of separated morphisms are separated. 

(iii) If fg is separated and g is a proper surjection, then f is separated. 
(iv) If fg is separated and f is localic, then g is separated. 


Proof (i) Suppose f: F — E and g: G — F are separated. Then the diagonal 
of fg can be factored as the composite 


A 
G2 8G ep GH SG GG. 
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where the second factor is the pullback of Ay along g x g; so the result follows 
from the stability of proper maps under composition and pullback. 

(ii) is immediate from the stability of proper maps under pullback. 

(iii) The composite (Ays)g is equal to (g x g)Az,, which is proper by the 
hypotheses, so the result follows from 3.2.16(ii). 

(iv) If f is localic then Aș is an inclusion by B3.3.8(ii); hence so is the second 
factor of the composite displayed in the proof of (i), and the result follows from 
2.2.18(iii). o 


Note in particular that a morphism is separated iff both halves of its 
hyperconnected-localic factorization are separated, iff the surjective part of its 
surjection—inclusion factorization is separated. (Inclusions are always separated, 
by B3.3.8(iii).) 

Finally in this section, we return to the problem with which we began it: 


Lemma 3.2.26 A geometric morphism f is proper iff it is stably closed (i.e. 
all pullbacks of f along bounded morphisms are closed). 


Proof One direction is immediate from 3.2.21 and 3.2.17(ii). For the converse, 
we note that f is closed iff the localic part of its hyperconnected-—localic factor- 
ization is closed, since a hyperconnected morphism g: F — G induces a bijection 
between closed subtoposes of F and of G. And the hyperconnected-localic fac- 
torization is stable under pullback, by 2.4.12; so if f is stably closed, its localic 
part corresponds to a stably closed map of locales, and is therefore proper by 
3.2.8. Hence f itself is proper. q 


Suggestions for further reading: Moerdijk & Vermeulen [858], Plewe [979], 
Vermeulen [1203]. 


C3.3 Locally connected morphisms 


It will be recalled that in A4.1.5 we defined a geometric morphism f: F — E 
to be essential if its inverse image functor f* has a left adjoint fi as well as its 
right adjoint f,. However, the class of essential geometric morphisms is not as 
well-behaved as one might like. One reason (which we noted in B1.2.4) is that 
although, when we make F and € into €-indexed categories, the adjunction 
(f* 4 f.) is always indexed over E, the adjunction (fi J f*) need not be, 
even though (f/A)* has a left adjoint for every object A of €. Accordingly, 
it is advantageous to study the more restricted class of morphisms f: F —> E 
for which f* has an €-indexed left adjoint: for reasons adumbrated after 1.5.9 
above, we call such morphisms locally connected. 


C3.3 LOCALLY CONNECTED MORPHISMS 649 


Our first result in this section is the analogue over an arbitrary base of 1.5.9: 


Proposition 3.3.1 For a geometric morphism f: F — E, the following are 
equivalent: 


(i) f is locally connected. 


(ii) ‘f* commutes with I1-functors’; that is, for every morphism h: A — B in 
E, the canonical (Beck-Chevalley) natural transformation in the square 


Ip, 
pA sep 


(f/A)* l (f/B)" 


yeh 
F/f'A——> F /f*B 


is an isomorphism. 


(iii) For each object A of E, the functor (f/A)*: EJA — F/f*A is cartesian 
closed. 


Proof (i)e (ii): Since f* always preserves finite limits, condition (ii) is just 
the assertion that f* is continuous as an €-indexed functor E — F (cf. B1.4.13). 
Since both these €-indexed categories are complete and locally small, and E 
is well-powered and has a coseparator by B3.1.13, the result follows from the 
Indexed Adjoint Functor Theorem B2.4.6. 

(ii) > (iii): Given an object h: B — A of €/A, consider the diagram 


h* I, 
€/A ——_> €/B ——"- s E/A 
(f/A)* (f/B)* (f/A)* 
(f*h)* If-p, 


F/F A ———> F¥ /f* B ——> FI f'A 


of which the left-hand cell commutes since f* preserves pullbacks, and the right- 
hand cell commutes by (ii). But the top and bottom composites are the functors 
(—)” and (—)4°” respectively. 

(iii) => (ii) similarly follows from the construction of I]-functors from expo- 
nentials and pullbacks, given in the proof of A1.5.2(i). g 


If we already know that f is essential, then the implication (iii) > (i) of 3.3.1 
may also be deduced from A1.5.8, as we observed in B1.2.4. 
Corollary 3.3.2 


(i) Any stable left Beck-Chevalley morphism (in the sense of 2.4.16) is locally 
connected. 
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(ii) Locally connected morphisms are open. 
(iii) A composite of locally connected morphisms is locally connected. 


(iv) Given a commutative diagram 


E 


where h is locally connected and g is either connected or an open surjection, 
then f is locally connected. 


Proof (i) is immediate from condition (ii) of 3.3.1. 

(ii) If f* commutes with the functors II}, then it commutes with the functors 
Vn which are their restrictions to subterminal objects; so it is a Heyting functor. 
(Alternatively, we may use condition (iii) of 3.3.1 and (v) of 3.1.7.) 

(iii) follows easily from (iii) of 3.3.1, since a composite of cartesian closed 
functors is cartesian closed. 

(iv) In the case where g is connected, we have f* © g,9g* f* = gsh*, so it has 
an €-indexed left adjoint hig*. 

In the case where g is an open surjection, we use 3.3.1 (iii); since the 
hypotheses are stable under slicing by an object of E£, it is sufficient to 
show that f* is cartesian closed. Consider the exponential comparison map 
0a B: f*(B4) > f*BF4, where A and B are objects of €. We have a 
commutative diagram 


*(@ 
p (B4) LABI gf Br4) 


h* Bra 


where the diagonal arrow is an isomorphism since h* is a cartesian closed functor, 
and the vertical arrow is monic by 3.1.7(iv) since g is open. It follows easily that 
g*(@4,8) is an isomorphism; but g* reflects isomorphisms since g is surjective. 


O 


Once we have proved that locally connected morphisms are stable under 
pullback, we shall be able to establish the converse of 3.3.2(i). 

We recall from 1.5.7 that f is said to be connected if f* is full and faithful. 
For locally connected morphisms, there is a particularly simple criterion for 
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connectedness: 


Lemma 3.3.3 Let f: F — E be a locally connected geometric morphism. Then 
f is connected iff the left adjoint fı of f* preserves 1. 


Proof If f is connected, then the counit of the adjunction (f, 4 f*) is an 
isomorphism, so fi(1) = f,f*(1) = 1. The converse is immediate from A1.5.9(i), 
since f* is cartesian closed. o 


We call a geometric morphism f: F — E a local homeomorphism if there is an 
object A of € and an equivalence F ~ €/A identifying f* and f, with A* and I4 
respectively (cf. A2.3.8). Clearly, a local homeomorphism is locally connected, 
since A* is a logical functor (A2.3.2) and so commutes with II-functors. 


Lemma 3.3.4 Connected morphisms are orthogonal to local homeomorphisms 
in Top; that is, given a commutative square 


GPG 


F ————— E 


where h is connected and g is a local homeomorphism, there exists (uniquely up 
to unique 2-isomorphism) a morphism l: F — G making both triangles commute. 


Proof Suppose G ~ E/A as an E-topos. Then by B3.2.8(b) the geometric 
morphism k corresponds to a morphism 1 — h* f*(A) in H; but h* is full and 
faithful, so this derives from a unique morphism 1 — f*(A) in F, which in turn 
corresponds to the required geometric morphism I. O 


Proposition 3.3.5 
(i) Any locally connected morphism can be factored, uniquely up to equiv- 
alence, as a connected locally connected morphism followed by a local 
homeomorphism. 


(ii) A geometric morphism f: F — E is locally connected iff, for each object 
B of F, the composite F/B — F — E may be factored as a connected 
morphism followed by a local homeomorphism. 


Proof (i) Given a locally connected morphism f, we define A = fi(1); then 
we certainly have a factorization fi = L4gi, where g:(B) = fi(B — 1). And g 
has a right adjoint g* given by the composite 
(f/A)* ni 
SA SFiS se 
where m:1— f*fi(1) is the unit map (cf. A4.1.13). But this composite is the 
inverse image of a locally connected geometric morphism, since each of its factors 
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is; and the resulting morphism F — €/A is connected by 3.3.3. This establishes 
the existence of the factorization; the uniqueness follows from 3.3.4. 

(ii) If f is locally connected, then the existence of the factorization follows 
from (i) and 3.3.2(iii).. Conversely, suppose the factorization exists for any B € 
ob F; let us write f,B for the object of E yielding the factorization F/B — 
E/f,\B — E of F/B — E. We claim first that fı defines a functor F — €, left 
adjoint to f*. For, if A is any object of E, then morphisms B — f*A in F 
correspond to (isomorphism classes of) geometric morphisms F/B — F/f*A 
over F, by B3.2.8(b); but these in turn correspond to geometric morphisms 
E€/f,B — E/A over E, by 3.3.4, and hence to morphisms f,B — A in £. 

Now the hypothesis that the factorizations exist is clearly ‘stable under slic- 
ing’; so, to prove that f is locally connected, it suffices to show that f* is 
cartesian closed, or equivalently (by A1.5.8) that the Frobenius reciprocity con- 
dition fi(B x f*A) = fıB x A holds. But it is easy to see that the induced 
morphism F/(B x f*A) — E/(fıB x A) is connected, since it is obtained by 
slicing F/B — E/fıB over the object (mı: fiB x A — fB) of its codomain. So 
Frobenius reciprocity follows from the uniqueness of the factorization. O 


We say an object A of (the domain of) a locally connected S-topos p: E — S is 
connected if pA = 1; equivalently, if the composite geometric morphism €/A — 
E€ — S is connected as well as locally connected. From the fact that p is an 
S-indexed left adjoint, we may deduce what is really a reformulation of 3.3.5(ii): 


Lemma 3.3.6 Ifp: E€ — S isa locally connected topos over S, then every object 
of E is expressible as an S-indexed coproduct of connected objects. 


Proof Given A € ob E, let I = pA, and let n4: A — p*I be the unit of 
the adjunction. Clearly, A is the J-indexed coproduct ©7(n4); we claim that 
(p)! maps na to the terminal object of S/I, so that it may be regarded as an 
I-indexed family of connected objects. For this, we note that if x: J — I is any 
object of S/I, then morphisms n4 — f*z in €/f*I correspond to morphisms 
fA — J in S which are sections of z, i.e. to morphisms 1; — z in S/I. 0 


Thus we may think of pA, for a general object A of €, as ‘the object of 
connected components of A’. Saying that A is connected is equivalent to saying 
that it has no nontrivial ‘€-indexed decompositions’; that is, any morphism 
A — p*I factors through p*z for some x: 1 — J in S. Hence the decomposition 
of 3.3.6 is unique. 

From either version of 3.3.2(iv), we know that the localic part of the 
hyperconnected-localic factorization of a locally connected morphism f is locally 
connected; it is even easier to see that it inherits connectedness from f. How- 
ever, there seems to be no reason in general why the hyperconnected part of a 
locally connected morphism should be locally connected; we do not have a coun- 
terexample, but we digress briefly to mention an example of a hyperconnected 
morphism which is not locally connected. 
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Example 3.3.7 Consider the morphism y: Sh(C) — Set, where C is the site 
derived from a small full subcategory of Sp as in A2.1.11(d). It is easy to see 
that Sh(C) has just two subterminal objects, and hence y is hyperconnected 
by A4.6.6(v). However, if y were essential, then for each object X of C the 
functor A + y*A(X) would be representable (by y/(X), where l: C > Sh(C) 
is the Yoneda embedding); and it is easily verified that y* A(X) is the set of 
locally constant functions X — A, so that this functor fails to preserve infinite 
products if the space of components of X is not discrete. So, provided C contains 
a space whose space of components is not discrete, Sh(X) is not locally connected 
over Set. 


In the converse direction, a connected and locally connected morphism need 
not be hyperconnected (consider sheaves on a connected and locally connected 
space, such as R); but we shall see in 3.5.4(i) that, if we replace local con- 
nectedness by the stronger condition of being atomic, then the implication 
does hold. 

In A4.2.7(b) and 3.1.2, we gave necessary and sufficient conditions on a 
functor f:C — D between small categories for the corresponding geometric 
morphism [C,Set] — [D,Set] to be surjective (respectively open). It would 
clearly be convenient to have similar conditions for connectedness and local 
connectedness. In fact it seems hard to give conditions which are both neces- 
sary and sufficient; but there are sufficient conditions which are often useful. 
In order to state them, we introduce an auxiliary definition: given f:C — D, 
we define Exts to be the category whose objects are quadruples (U,V,r,7) 
where U € ob C, V € ob D, r: fU — V andi: V — fU satisfy ri = ly, and 
whose morphisms (U,V,r,i) — (U', V',r',i') are pairs (a: U + U',b:V > V') 
such that r'(fa) = br and ib = (faji. We shall write g: Exts — D for 
the obvious functor sending (U,V,r,i) to V, and Ext,s(V) for the fibre of 
this functor over a particular object V (we call this the category of f- 
extracts of V, the word ‘extract’ being used in this context as the opposite 
of ‘retract’). 

We note that the conditions of A4.2.7(6) and 3.1.2 can both be formulated 
in terms of the notion of f-extract: the geometric morphism [C, Set] — [D, Set] 
induced by f is surjective iff Exts(V) is nonempty for each V € ob D, and open 
iff g has the ‘left lifting property’ that, given b: V — V’ in D and an object 
(U, V,r,i) of Exts mapped by g to the domain of b, we can find a morphism 
of Exts whose domain is (U,V,r,i) and which is mapped by g to b. (Actu- 
ally, the condition for openness was stated in 3.1.2 as a special case of this, 
in which the domain of b was assumed to be of the form fU and its lifting 
to Exts was of the form (U, fU,lsu,lyu). However, the general case follows 
from this one: given b: V — V’ and (U,V,r,i) as above, a lifting of the com- 
posite br in the sense of 3.1.2 is automatically a lifting of b in the sense just 
given.) 
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Lemma 3.3.8 Let f:C — D be a functor between small categories. 


(i) If g: Exts — D is surjective on morphisms and, for each V € ob D, 
the category Exts(V) is connected, then the induced geometric morphism 
IC, Set] — [D, Set] is connected. 


(ii) Suppose that, for each b: V — V’ in D and each lifting of V to an object 
(U,V,r,t) of Exts, the category of liftings of b (whose objects are liftings 
of b to a morphism of Ext, with domain (U, V,r,i), and whose morphisms 
are morphisms of Exts(V’) forming commutative triangles) is connected. 
Then |C, Set] — [D, Set] is locally connected. 


Proof (i) Let S,T:D 3 Set be two functors and ¢: f*S — f*T be a natural 
transformation. We have to show that there is a (unique) natural transformation 
0: S —T with f*0 = ¢; but the uniqueness follows from A4.2.7(b), since the 
connectedness of Ext s(V) implies that it is nonempty. To show existence, let V 
be any object of D and choose an object (U, V,r, i) of Ext, lying over it; then 
we define @y to be the composite 


sv 5 sgu) 22> rou) > rv: 


the connectedness of Exts(V) (plus naturality of ¢) is exactly what we need 
to prove that this definition is independent of the choice of U, r and i. To 
verify that @ is natural, we need the other hypothesis: given b: V — V’, let 
(a,b): (U,V,r,7) = (U',V’,r’,7’) be a morphism of Ext; lying over it, and 
consider the diagram 


sv 4s sgu) 2 tify) -T> Tv 


Sb S(fa) T (fa) Tb 


“7 Qr 1 
svi 2s sgun > run T rv’ 


where the central square commutes by naturality of ¢, and the other two by 
functoriality of S and T. 

(ii) We note first that the hypothesis is stable under slicing: if we have a 
pullback square 


1 
EEN 
d e 
f 
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where e (and hence also d) is a discrete opfibration, then f’ inherits the condi- 
tion from f. For in this case it is easy to see that a lifting of an object (U, V,r,7) 
of Exty to one of Extp is uniquely specified by giving an element z of T(V), 
where T: D — Set is the functor corresponding to e, since the element of T(fU) 
corresponding to the lifting of U to an object of C’ must be Ti(z); and simi- 
larly for morphisms of Ext. (In other words, Extp — Ext, is also a discrete 
opfibration, and is simply the pullback of e along g.) Now, given a morphism 
b: (V, £) > (V',Tb(z)) of D’ and a lifting of its domain to Exts, the category 
of liftings of b to Ext p is easily seen to be isomorphic to the category of liftings 
of its image in D to Ext,. 

Hence, by 3.3.1 (iii), it suffices to show that the hypothesis of (ii) implies 
that f*:[D,Set] — [C,Set] is a cartesian closed functor. We recall the con- 
struction of exponentials in [D, Set] from A1.5.5: given functors S,T: D 3 Set 
and V € ob D, TS(V) is the set of natural transformations D(V,—) x S > T. 
In terms of this description, it is easy to see that the comparison map 
Osr: f*(T%) — f*TFS sends a natural transformation a: D (fU, —) x S >T 
to the transformation 8: C(U,—) x f*S — f*T given by By (a: U — U’, z) = 
afu’(fa,z). Thus we have to show that, given 8:C(U,—) x f*S > f*T, 
we can uniquely reconstruct a from it. But this is easy: given b: fU > V 
in D and z € S(V), we define ay(b,r) = (Tr)Gy(a,(Si)r), where 
(a,b): (U, fU, 1zu, lsu) — (U', V,r,i) is any morphism of Ext, lying over b. The 
connectedness of the category of liftings of b ensures that this is independent of 
the choice of lifting; moreover, it is natural, since if we are given b: V > V’, 
then we may lift it to a morphism (a’,b’): (U’,V,r,i) = (U",V',r’,i’) of Ext, 
and we have 


(Tb')ay(b, 2) = (Tb')(Tr) By: (a, (Si)r) 
= (Tr’)(T(fa’)) Bu: (a, (Si)z) 
= (Tr')By»(a'a, $(fa")(Si)z) 
(Tr’) Bur (a'a, (Si’)(Sb')r) 
ay: (b'b, (Sb’)z) . 


Finally, if b happens to be of the form fa, then we may take (a,b) as our lifting 
of it, and thus deduce that (65 r)u(a@) = 2. o 


We note that the arguments in the proof of 3.3.8 are constructive, and so 
can be applied to functors between internal categories in any topos S. The same 
applies to the following example (which we shall meet again in 5.2.7), provided 
S has a natural number object. 


Example 3.3.9 Let D be the category of finite cardinals (that is, sets of the 
form [p] = {0,1,...,p — 1} for some p € N; cf. A2.5.14) and arbitrary maps 
between them, and let C be the poset of finite partial equivalence relations 
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(that is, symmetric and transitive relations) on N, ordered by inclusion. We 
have a functor f: C — D which sends a relation R to the cardinality of the set 
of R-equivalence classes (for definiteness, we suppose that the classes are listed 
in increasing order of their greatest members); clearly, if R C S, then each 
R-equivalence class is contained in a unique S-equivalence class, which explains 
how f is defined on morphisms. We claim that f satisfies the hypotheses of 
3.3.8(i) and (ii), so that the geometric morphism [C,Set] — [D, Set] which it 
induces is connected and locally connected. 

To prove this, first suppose b: [p] — [q] is a morphism of D. Let [p] denote 
the relation consisting of all pairs (j, j} with j < p; clearly, fp] = [p]. We define 
a larger relation R to consist of all pairs (j, 7) with j < p+4q, all pairs (j, p+0(j)) 
and (p+ b(j),7) with j < p, and all pairs (j,k) where j,k < p and b(j) = b(k). 
It is then clear that f(R) = [q] and that f maps the inclusion [p] C R to b; so 
we have shown that f is surjective on morphisms, from which it easily follows 
that g: Ext s — D is also surjective. 

To verify the other condition of 3.3.8(i), let (R, [p], r, i) be an arbitrary object 
of Ext([p]). We shall show that it can be connected in this category to the 
object ([p], [p], lip]; lip). To do this, first choose a natural number n which is 
greater than any number involved in the relation R, and let R’ be the relation 
obtained from R by adding all pairs (j,j) with n < 7 < n + p and also all pairs 
(n+ j,k) and (k,n + J) where j < p and k is in the i(j)th equivalence class of 
R. Since 7 is injective, it is easy to see that R’ is (symmetric and) transitive, 
and that the inclusion R C R’ lifts to a morphism (R, [p],r,2) — (R’, [p], r,i") 
of Ext,([p]), where r’ and 2’ are related to r and 7 by a suitable permutation of 
the set f(R) = f(R’). Now let S be the relation consisting of all pairs (j, j} with 
n <j < n+p; then the inclusion S C R’ is mapped by f to 7’: [p] — f(R’), 
and hence defines a morphism (S, [p], lip), 1p) > (R', [p], r", i) in Ext s([p]). But 
we may connect the domain of this morphism to ([p], [p], lip); 1,,)) by similarly 
enlarging S to a relation S’ which contains all pairs (j, n + j) and (n+ j,j) with 
j < p, and then shrinking down again to [p]. 

The verification of the condition of 3.3.8(ii) is similar, and we shall omit the 
details. 


Next, we consider how local connectedness can be characterized in terms of 
sites. We saw in Section C3.1 that openness corresponds to the condition ‘all 
covers inhabited’; in a similar way, local connectedness corresponds to ‘all covers 
connected’. Given a sieve R on an object U of a category C, we say R is connected 
if it is connected as a full subcategory of C/U, i.e. if it is inhabited and any two 
morphisms fı, f2 E€ R can be linked by a zigzag of the form 


U — > he Vu o hey 
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with all the f; in R. And we say a site (C,T) (where T is assumed to be sifted, 
but not necessarily a Grothendieck coverage) is locally connected if every cover in 
T is connected. As usual, it is clear that this definition makes sense for internal 
sites in an arbitrary topos S. 


Theorem 3.3.10 Let p: E — S be a bounded geometric morphism. Then p is 
locally connected iff there exists a locally connected internal site (C,T) in S such 
that E ~ Shs(C,T) in Lop/S. Moreover, p is connected and locally connected iff 
there exists a locally connected site of definition for E whose underlying category 
C has a terminal object. 


Proof We shall, as usual, argue as if in the case S = Set; but we note that all 
the arguments used are constructive, and so can be internalized in an arbitrary 
base topos S. 

First suppose € = Shs(C,7) where (C,T) is a locally connected site. We 
claim first that every constant functor C°? — S is a T-sheaf, i.e. that the inverse 
image functor C*: S — [C°?, S] factors through the inclusion E€ — [C°?,S] - 
the resulting factorization being the functor p*, of course. (Strictly speaking, we 
ought to write C°P* rather than C*; but we hope the reader will forgive this abuse 
of notation.) So consider the constant functor C* A whose value is an object A of 
S; to say that C*A is a T-sheaf is to say that , for each T-covering sieve R on an 
object U of C, the unit map A — limRrR*A is an isomorphism, where R is the 
full subcategory of C/U corresponding to R. But for connected R this follows 
from B2.5.6. 

Now C* has a left adjoint lime, which is S-indexed by (a special case of) 
B2.3.21; so we obtain the desired S-indexed left adjoint for p* simply by restrict- 
ing this functor to the full subcategory Shs(C,T). And if C has a terminal 
object 1, then the unit map A — p,p*(A) may be identified with the unit map 
A — limeC*A = C*A(1), which is an isomorphism; so p is connected. 

Conversely, suppose p is locally connected. Then, as we saw in 3.3.6, every 
object of € is expressible as an S-indexed coproduct of connected objects; so, 
given a bound B for € over S, the family of all connected subobjects of B is a 
separating family for € as an S-indexed category, and we can use it to form an 
internal full subcategory C of € which, when equipped with the coverage induced 
from the canonical coverage on €, will yield a site of definition for € over S. We 
must show that this site is locally connected. Clearly, every cover of an object of 
C must be inhabited, since connected objects are positive. And if R is any cover of 
an object U of C, we observe that if fı: V; — U and fo: V2 — U are morphisms 
in R such that the pullback V; xy Vz (in £) is inhabited, then they must lie in 
the same connected component of R, for some connected subobject of B must 
map to the pullback. Hence, if we write {S; | i € I} for the set of connected 
components of R, and U,, for each i € J, for the subobject (in €) of U which is 
the union of the images of all the morphisms in S;, then the U; form a pairwise- 
disjoint covering of U by subobjects; that is, we have a well-defined morphism 
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U — p*I in € which sends U; to the element i. So by the remark after 3.3.6 one 
of the U; must be the whole of U, and the corresponding S; is the whole of R. 
And, once again, if p is connected as well as locally connected, then the site just 
described will contain the terminal object of €, provided it occurs as a subobject 
of B, and we can easily ensure the latter. o 


Examples 3.3.11 (a) If a category C satisfies the right Ore condition 
(cf. A2.1.11(h)), then any inhabited sieve on an object of C is connected. Hence 
we may conclude from 3.3.10 that, for such a C, any dense subtopos of [C°?, Set] is 
locally connected. (As stated, this example is dependent on classical logic in Set, 
since denseness of a subtopos of [C°?, Set} is equivalent to saying that covers are 
nonempty rather than inhabited; but we can constructivize it by re-interpreting 
the word ‘dense’ in the strong ‘fibrewise’ sense introduced in 1.1.22.) 

(b) It is clear that a sieve generated by a single morphism is always connected; 
hence the site obtained by equipping a regular category with its regular coverage 
(2.1.12(b)) satisfies the conditions of 3.3.10. In view of D3.1.4, this says that the 
classifying topos of any regular theory is connected and locally connected. 

(c) We revert to an example that we have considered already (for the case 
S = Set) in A2.1.11(g) (and see also D4.1.9). In the present context, let us 
take an arbitrary object J of S (which is here assumed to have a natural number 
object); let P be the poset of all partial maps f: N — I with finite domain (which 
may be interpreted as meaning either that the domain of f is a finite cardinal, 
considered as a subobject of N, or that it is isomorphic to a finite cardinal; it 
does not make any essential difference), ordered by the relation ‘is an extension 
of’. We impose a coverage on P by saying that, for each f : P and each i: J, 
the set 


{g: P | (g extends f) and (i € im g)} 


covers f. In A2.1.11(g) we verified that all such covers are connected (in the 
case S = Set, but the arguments used there are constructive); and P clearly has 
a top element, namely the empty partial map. So from 3.3.10 we may deduce 
that Shs(P,T) is a connected and locally connected S-topos. We note also that 
this topos is the classifying topos (over S) for the theory of partial surjections 
N — I; that is, the intersection of the propositional theories which we considered 
in 1.2.8 and 1.2.9. 


As usual, we may convert (the connected case of) 3.3.10 into a characteri- 
zation of connected locally connected morphisms in terms of fibrations of sites. 
In order to state the condition, we need an ad hoc definition: given a functor 
F:D—C, a sieve S on an object V of D and a coverage J on C, we shall say 
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that R is J-locally connected if, whenever we are given a commutative square 
a2 
U —> FY, 


Qa, Fb2 


Fb, 
FY, ——> FV 


with both bı and bz in S, we can find a J-covering sieve R on U such that, for 
each U’ — U in R, we can connect bı to bz in S by a zigzag of morphisms, and 
also find a cone over the image of this zigzag under F with vertex U’, connecting 
the given morphisms U’ — U — FY, and U’ — U = FV}. 


Proposition 3.3.12 Let f: F — E be a geometric morphism between bounded 
S-toposes. Then f is connected and locally connected iff it can be induced by a 
fibration of sites (P,T): (D, K) — (C,J) in S such that P preserves covers and 
every K -covering sieve is J-locally connected. 


Proof First suppose f is connected and locally connected. Choose any site 
(C, J) for € over S; then choose a locally connected site (B, H) with a terminal 
object for F over E. If we externalize the latter to a site (D, K) in S as in 2.5.7, 
then the condition (in the internal logic of £) that every H-covering sieve is 
(inhabited and) connected translates straightforwardly into the given conditions 
on K. 

Conversely, suppose f is induced by a continuous fibration satisfying the 
conditions. We claim that if A is any J-sheaf on C, then the composite Ao P is 
a K-sheaf: to see this, suppose given a K-covering sieve S on an object V and 
a compatible family of elements (zẹ E€ A( PV’) | b: V’ = V € S). We must show 
that this family can be extended to a compatible family for the (J-covering) sieve 
on PV generated by the Pb, b € S. But this is precisely the statement that, if 
we are given any commutative square 


a2 
U ———> FV: 
ay F be 


FV, ———> FV 


with bı and bz in S, then A(a)(xrp,) = A(az)(Zp,); so the assertion that U can 
be covered by morphisms U’ — U for which we have a zigzag as above (plus the 
information that A is a J-sheaf) is precisely what we need to verify this. 

It follows that in this case the inverse image f* is simply the functor ‘compose 
with P’ applied to J-sheaves; since we also know that f, is given by composition 
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with T, and since PT = 1c, it follows that f, f* = 1e, ie. f is connected. But 
since P is a fibration, it is also easy to see that it satisfies the dual of the condition 
of 3.3.8(ii) (for the category of liftings of a given morphism has a terminal object, 
namely the prone lifting); hence composition with P commutes with I-functors 
as a functor [C°?,S] — [D°®, S]. But the inclusions Shs(C, J) — [C°?, S] and 
Shs(D, K) — [D°?, S] are cartesian closed by A4.2.9, and hence also commute 
with II-functors; hence (—) o P commutes with []-functors as a functor between 
the sheaf toposes, i.e. f is locally connected. C 


Corollary 3.3.13 Let 


f3 f fi 
SS 


aS RL ORE 


be an inverse sequence in BYop/S, where each fi is connected and locally con- 
nected. Then the legs fZ: Ex. —> En of the limit cone in BYop/S are connected 
and locally connected. 


Proof This is not as simple as it seems, since it is not immediately obvious 
that the condition of 3.3.12 is stable under composition of fibrations of sites. 
Once one has verified that it is (which is straightforward but tedious), it is then 
easy to see that the same condition is inherited by the limit fibrations of 2.5.11. 
We omit the details, which may be found in [828]. O 


As an application of 3.3.10, we prove an interesting extension of A4.5.18, 
which will be needed in the next section. 


Theorem 3.3.14 Let f: F —€ be a bounded geometric morphism, and j a 
local operator on €. Let k be the local operator on F making 


sh (F) —> sh; (£) 


F —— E 


a pullback (cf. A4.5.14(e)). Then the corresponding subobject K — Nr is simply 
the upward closure of the image of the composite 


o 
f* J >—> f'Qe ay NF. 


Proof First we prove the result under the assumption that f is locally con- 
nected as well as bounded. It is clear that K contains the upward closure of the 
image of f*J — Qz, so it suffices to prove that the classifying map of the upward 
closure (K’, say) is a local operator. Suppose F ~ She(C, T), where (C,T) is a 
locally connected site; then f*OQ¢ and f*J are simply the appropriate constant 
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diagrams on C°P, by the proof of 3.3.10. On the other hand, Qy is the internal 
diagram corresponding to the functor C°? — E which (in set-theoretic terms) 
sends an object U of C to the set of T-closed sieves on U; and the comparison 
map (¢1)y sends a truth-value p: Qe to the sieve {a: V — U | p} (note that 
such sieves are indeed T-closed, since T-covering sieves are inhabited). Hence 
a T-closed sieve R belongs to K’(U) iff there exists a T-covering sieve S and 
a J-dense truth-value p such that (p = [a € R]) holds for all a € S; but this 
implies that the truth-value [ly € R] is j-dense. It now follows easily that a 
subobject B’ => B in F belongs to the class =(K’) (that is, its classifying map 
factors through K’ — QF) iff the sentence 


(VU: Cy)(Vx: B(U))([z € B'(U)] € J0) 


is valid in E. Using this, it is trivial to verify that =(K’) satisfies the conditions 
of A4.5.11 (iii), and hence that the clasifying map of K’ is a local operator. 

Now consider the general case. Let G be the topos obtained by glueing along 
the cartesian functor f*: E — F, that is the category whose objects are triples 
(A,B,a) with a: B — f*A. Then we have a closed inclusion h: F — G given 
by h*(A,B,a) = B and h,(B) = (1,B,B — 1) (cf. A4.5.6); but it is easy 
to check that we also have a locally connected geometric morphism g: G —> € 
defined by setting g*(A) = (A, f*A, 1-4), 9 (A, B,a) = A and g,(A, B,a) to 
be the pullback of 

f.B 


f.(a) 


A 
A a ffA 
where 7 is the unit of (f* 4 fx). Clearly, the composite gh is isomorphic to 
f. Also, g is bounded if f is: if G is a bound for F over E, then the object 
H = (1,G,G — 1) may be verified to be a bound for G over £, as follows. Given 
an arbitrary object (A, B,a), choose a diagram 


B B>—"s fcxG 
where we may assume (on replacing C by C I 1, if necessary) that C is well- 
supported. Now we have a diagram 


oe (1,(ae,m) 
(A x C, B, (ae, nrim)) oe (Ax C,f*(A x C) x G,m) =g*(AxC)x H 


(71, e) 


(A, B, a) 


of the required form in G. 
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Thus we know that the conclusion of the theorem holds for g, by the first 
part of the proof. But it also holds for h, by A4.5.18(ii); and it is easy to see 
that the conclusion, regarded as a property of geometric morphisms, is ‘stable 
under composition’, since inverse image functors preserve everything involved in 
constructing the upward closure of the image of a morphism into a poset. So the 
conclusion holds for f. O 


Next, we consider stability of (local) connectedness under pullback. As previ- 
ously for openness and propriety, there are two versions of the pullback-stability 
theorem, depending on which side of the pullback square we assume to be 
bounded. One version follows readily from 3.3.10: 


Theorem 3.3.15 Let 


ale F 


H 
g 
G ——> € 


be a pullback square in Lop with f locally connected, and either f or g bounded. 
Then k is locally connected, and the Beck-Chevalley natural transformation 
f*g« > h,k* is an isomorphism. Moreover, if f is also connected, then so is k. 


Proof (i) For the case when f is bounded, we have to verify that the property 
of being a locally connected site is preserved under change of base along g, as we 
defined it in Section C2.4. This is not quite automatic, since connectedness of a 
sieve (unlike inhabitedness) is not a first-order property. However, it is definable 
in terms of a coequalizer diagram, and g* preserves coequalizers; hence, if R is a 
connected sieve on an object U of an internal category C in E, we see that g*R 
is a connected sieve on g*(U) in g*C. It follows that, if T is a coverage on C all 
of whose sieves are connected, then g*T also consists of connected sieves; so the 
pullback-stability result is established. And if C has a terminal object, then so 
does g*C; so the connectedness assertion is also immediate. 

For the Beck-Chevalley condition, we consider first the case when f has 
the form [C°?,€] — E; then H is the topos [g*C°?,G], and h, is the functor 
‘apply gẹ to discrete fibrations over g*C, and then pull back along the unit map 
C — g,g*C’. But if we apply this to a constant diagram (g*C)*B, where B is 
an object of G, we obtain the discrete fibration C x g,B — C; thus we see that 
f*g» — h,k* is an isomorphism in this case. In general, when F is a subtopos 
of [C°P,€], we have seen that f* is simply the factorization of the ‘constant 
diagram’ functor through the subcategory Sh¢(C, T), and similarly for k*; and 
h, is similarly the restriction of the direct image functor [g*C°?,G] — [C°P, E] to 
sheaves, by A4.3.12. So the Beck—Chevalley condition in the general case follows 
from the particular one already established. 
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(ii) For the second version of the theorem, we shall as usual deal separately 
with the cases when g has the form [C, E] — € for an internal category C in €, 
and when g is an inclusion. 

If g has the form [C, €] — E, we recall from B2.3.12(a) that any €-indexed 
category extends in a canonical way to a Cat(€)-indexed one: this extension 
sends the canonical indexing E of € over itself to the pseudofunctor C + [C, €], 
and the €-indexed category F to C + [f*C, F]. It is clear that €-indexed func- 
tors and adjunctions similarly extend; so, from the fact that f*: E — F has an 
€-indexed left adjoint, we deduce that it has a Cat(€)-indexed one. But if we 
now restrict the latter indexing to the full subcategory of Cat(€)/C consisting 
of discrete opfibrations, we obtain a [C, €]-indexed left adjoint for [C, f]* = k*. 
Also, the Beck—Chevalley condition in this case is equivalent to saying that the 
natural map kih* — g* fi is an isomorphism; but this is just part of the assertion 
that fi is a Cat(€)-indexed functor. 

In the case when g is an inclusion, let j be the corresponding local operator 
on €, and j’ the pullback local operator on F (cf. A4.5.14(e)). We claim that in 
this case the functor f* maps j-sheaves to 7’-sheaves, so that k* is simply the 
restriction of f* to sh;(€). For if A is a j-sheaf in E£, then by A4.4.10 the canon- 
ical morphism Y4: J*A — II4A (the transpose of the isomorphism d*J*A S A, 
where d: 1 — J is the generic j-dense monomorphism) is an isomorphism in €/ J. 
Since f* commutes with II-functors, we deduce that we have an isomorphism 


(FIY (FA) —> Up a( f" A) 


in F/f*J. Unpacking this as in A4.4.10, we deduce that f*A satisfies the sheaf 
condition for all pullbacks of f*d: 1 — f*J in F; hence the largest local operator 
on F for which f*A is a sheaf (cf. A4.5.15) has the property that f*m is dense 
for it whenever m is a j-dense monomorphism in €. But j’ is by definition the 
smallest local operator with this property; so f*A must be a sheaf for it. It now 
follows easily that k*: sh; (€) — sh; (F) has a left adjoint, namely f: followed 
by the reflector E — sh; (E); and the fact that this adjunction is sh, (€)-indexed 
follows readily from the fact that (fı 4 f*) is €-indexed. And the Beck—Chevalley 
condition in this case is just the assertion that k* coincides with the restriction 
of f* to sheaves. 

For the final assertion of the theorem, we note that the Beck—Chevalley con- 
dition transposes to yield an isomorphism kih* & g* fi; hence if fı preserves 1 
then so does kı, and the result follows from 3.3.3. O 


Corollary 3.3.16 A geometric morphism is stably left Beck-Chevalley (in the 
sense of 2.4.16) iff it is locally connected. 


Proof One direction was proved in 3.3.2(i); the converse follows from 3.3.15. 0O 


Example 3.3.17 In the absence of local connectedness, (global) connected- 
ness of geometric morphisms is not in general preserved under pullback. For a 
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counterexample, consider the pullback diagram 


T ———> X 
f 
S——> Y 


in Loc, where X is the unit interval [0, 1]with its usual topology, Y is the same 
set with the topology whose only nontrivial open sets are the intervals (t, 1] 
(0 < t <1), S is the subspace {0} U (4, 1] of Y, f is the inclusion and g is the 
identity mapping. Since S is complemented as a sublocale of Y, being the union 
of an open and a closed sublocale, it follows from 1.2.13 that the pullback T in 
Loc coincides with the pullback in Sob — that is, it is simply S retopologized 
as a subspace of X. In particular, it is disconnected, but S is connected; hence 
the identity mapping T — S cannot induce a connected geometric morphism. 
However, we claim that the geometric morphism Sh(X) — Sh(Y) induced by g 
is connected. 

To see this, note first that if F — Y is any local homeomorphism, then 
its pullback E — X along g is simply obtained by retopologizing the space F 
with the appropriate finer topology. Moreover, if we regard F and Y as posets 
under the usual specialization ordering, then the projection F — Y is a discrete 
opfibration (since each point of F has a neighbourhood mapping bijectively onto 
an open set in Y); and the open sets in F are exactly those which are open in Æ 
and upper sets in the specialization ordering. Using these facts, it is easy to see 
that if F” — Y is another local homeomorphism and h: F — F’ is a mapping 
over Y, then h is continuous as a mapping F — F” iff it is continuous as a 
mapping E — E’. So g* is full and faithful. 


Finally in this section, we fulfil a promise made in Section B4.5, by charac- 
terizing the algebras for the ‘measure topos’ monad M on SBYop/S by a linear 
(rather than pointwise) cocompleteness condition, in the sense of B1.1.16. 


Proposition 3.3.18 Let S be a topos with a natural number object. Then a 
bounded S-topos ıs an M-algebra iff it is linearly B-cocomplete, where B 1s the 
class of locally connected morphisms between bounded S-toposes. 


Proof In B4.5.13, we saw that a topos E is an M-algebra iff it is pointwise 
cocomplete with respect to the class A of S-essential geometric morphisms. By 
B1.1.18, this is equivalent to saying that E is linearly cocomplete with respect 
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to the class of morphisms which appear as the top edges of comma squares 


(flg) —— H 


G —— F 


whose bottom edges are in A. Since all such morphisms are locally con- 
nected by 2.3.17, it is immediate that ‘linearly B-cocomplete’ implies ‘pointwise 
A-cocomplete’. Conversely, if E is an M-algebra then it is certainly B-cocomplete, 
since B C A; the linearity condition follows from the Beck—Chevalley condition 
for pullbacks of locally connected morphisms (3.3.16), just as the pointwise con- 
dition for A-cocompleteness followed from that for comma squares in the proof 
of B4.5.13. O 


Suggestions for further reading: Barr & Paré [84], Moerdijk [828]. 


C3.4 Tidy morphisms 


As we noted after 1.5.5, the condition that the direct image functor Sh(X) — Set 
should preserve directed unions of subterminal objects, which is equivalent to 
compactness of the locale X, seems less ‘natural’ in topos-theoretic terms than 
the condition that this functor should preserve all filtered colimits. But the 
latter is appreciably stronger: we shall call a locale X (or more generally a Set- 
topos €) strongly compact if the direct image functor 7.: Sh(X) — Set (resp. 
Yx: E — Set) preserves filtered colimits. 


Examples 3.4.1 (a) Let X be the topological space obtained from two copies 
of the closed unit interval [0,1] by identifying the two copies of the open interval 
(0,1). It is clear that X is compact, since [0,1] is. But, for any n > 0, let 
Xn denote the space obtained from two copies of [0,1] by identifying the two 
copies of the interval (1/2"+!,1 —1/2"t'): it is clear that we have surjective 
local homeomorphisms X, —> Xm whenever n < m, and that X = Xa is the 
colimit of this diagram in LH (and hence in LH/X ~ Sh(X)). So, considering 
the X, as sheaves on X, they form a directed diagram whose colimit has a 
(unique) global section, but no X,, has a global section. Hence X is not strongly 
compact. 

(b) For any group G, it is clear that the topos [G, Set] is compact, since 
it is hyperconnected over Set. But [G,Set] is strongly compact iff G 1s finitely 
generated. For, if G is not finitely generated, we can express it as a nontriv- 
ial directed union of subgroups (G; | i € J); then the G-sets G/G; form a 
directed diagram in [G, Set] whose colimit is the terminal object, but no G/G; 
admits a morphism from 1. On the other hand, if G is finitely generated and 
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we are given a filtered diagram of G-sets A; (indexed by a filtered category 
T), and a morphism 1 — limzA; (that is, an element a of this colimit whose 
stabilizer subgroup is the whole of G), then we can lift a to an element ao 
of some A;, (not necessarily fixed by the whole of G); now the stabilizer sub- 
groups Gy of the elements ay = f (ao) € Ai, as f : io — i ranges over the co-slice 
category ig\Z, form a filtered diagram of subgroups of G whose union is the 
whole of G, so some Gy must be the whole of G. Thus we have shown that 
the canonical map limz7,(A;) — y.(limzA;) is surjective; but we already know 
it is injective, by compactness of [G, Set]. A similar argument shows that for 
a topological group G, the topos Cont(G) is strongly compact iff the top ele- 
ment G is inaccessible by directed joins in the poset of open subgroups of G. 
In particular, if G is a compact topological group, then Cont(G) is strongly 
compact. 

(c) Let C be a small cartesian category, and let T be a coverage on C 
whose covering families are all finite. Then the assertion that a given functor 
CoP — Set is a T-sheaf is equivalent to saying that certain finite diagrams 
are limits in Set; so using the fact that finite limits and filtered colimits 
commute in Set, it is easy to see that Sh(C,T) is closed in [C°?, Set] under 
filitered colimits, i.e. the inclusion functor preserves them. But the direct image 
functor [C°?, Set] — Set preserves all colimits, since it is simply given by eval- 
uation at the terminal object of C. So Sh(C,T) is strongly compact. Thus 
any coherent Grothendieck topos, in the sense of D3.3.1, ıs strongly com- 
pact. In particular, any coherent locale (as defined in 2.4.3 above) is strongly 
compact. 


For the moment we refrain from giving any further explicit examples of 
strongly compact locales; but, once we have developed the theory of locally 
compact locales in Section C4.1, we shall be able to show that any stably locally 
compact locale (in particular, any compact Hausdorff locale) is strongly com- 
pact (see 4.1.14). Indeed, it is true more generally that any compact Hausdorff 
topos is strongly compact; but we shall postpone the proof to 5.3.15, since it 
requires descent theory. (The converse is false, as may be seen by comparing 
3.4.1(b) with 3.2.24, or by considering sheaves on a coherent locale which is not 
Hausdorff.) 


Definition 3.4.2 Following I. Moerdijk and J. Vermeulen [858], we shall call 
a geometric morphism f: F — € tidy if it makes F into a strongly compact 
E-topos, i.e. if the direct image functor f, preserves filtered €-indexed colimits. 
(The term ‘proper’ has also been used for this notion by some authors; but 
since it is not a conservative extension of the well-established localic notion of 
propriety, we feel that the latter term is best used for the class of morphisms 
studied in Section C3.2.) Explicitly, this means that, given any object A of E 
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and any filtered internal category C in €/A, the square 


f/A 


Fi a E 


C, f/A 
uve, Fyra EE ceja 


satisfies the Beck-Chevalley condition. 


As in 3.2.12, we could have phrased the definition in terms of weakly filtered 
categories in E, rather than filtered categories in slices of E. It is again immediate 
from the definition that any stable right Beck-Chevalley morphism, in the sense 
of 2.4.16, is tidy; we shall prove the converse in 3.4.11 below. 

We note in passing that Example 3.4.1(b) suffices to show that hypercon- 
nected morphisms are not necessarily tidy. However, we have the following 
stability properties of tidy morphisms: 


Lemma 3.4.3 Given a commutative triangle of geometric morphisms 


~ 


h 


E 


(i) of f and g are tidy, then h is tidy; 
(ii) if h is tidy and g is connected, then f is tidy; 
(iii) of h is tidy and f is a closed inclusion, then g is tidy. 


Proof (i) and (ii) are just like the first two parts of 3.2.16, replacing the 
directed poset I by a filtered internal category C in €/A, and in (ii) using the 
fact that the morphism [h*C, G/h* A] — [f*C, F/f* A] induced by g is connected 
(so that every object of its codomain is isomorphic to one in the image of its 
direct image functor). For (iii) we use the argument hinted at in the remark 
following 3.2.16: given a filtered internal category D in F, the category C = f,D 
is filtered in € by A4.5.4 and the fact that regular functors preserve filtered 
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categories, and D = f*C since f is an inclusion. So in the diagram 


gp, g) EIS p, E eg 
one cay eee TY 


where the vertical arrows are colimit functors, the outside and the right-hand 
square commute up to isomorphism (the latter since closed inclusions are tidy; for 
this, we may either use 3.4.1(c) and the fact that closed sublocales of the terminal 
locale are coherent, or ‘internalize’ the argument of A4.5.4 to show that the direct 
image f, preserves arbitrary connected €-indexed colimits, and in particular 
filtered ones), and f, is conservative; so the Beck—Chevalley transformation in 
the left-hand square must be an isomorphism. The same argument applied in an 
arbitrary slice F/B of F (which is of course a closed subtopos of €/f,B) proves 
the result. 0 


Once we have proved the stability of tidy morphisms under pullback, we 
shall be able to delete the hypothesis that f is closed from 3.4.3(iii), by an 
argument like that in the proof of 3.2.16(iii). However, since tidy morphisms 
(are proper, and hence) have closed images, the particular case proved above 
suffices to show that a geometric morphism is tidy iff both parts of its surjection— 
inclusion factorization are tidy. It also follows from 3.4.3(ii) that the localic part 
of the hyperconnected-localic factorization of a tidy morphism is tidy. There 
seems to be no reason why the hyperconnected part should be tidy; but we shall 
see in 3.4.15 below that there is a canonical factorization of tidy morphisms 
into tidy factors which are respectively connected and localic, rather like the 
factorization of locally connected morphisms provided by 3.3.5(i). 

The following criterion for strong compactness, due to K. R. Edwards [329], 
is often useful: 


Lemma 3.4.4 A Set-topos E ıs strongly compact iff it ıs compact and satisfies 
the following condition: 


(E) Given any well-supported object A of E, and a directed family (R; | 2 € I) of 
equivalence relations on A whose union is the whole of A x A, there exists 
i € I anda TER subobject B — A such that B x B < R; as 
subobjects of Ax A 


Proof For the necessity, suppose given A and R; as in condition (E). Then 
the quotients A/R; form a directed diagram in € whose colimit is 1, so if € (1, —) 
preserves filtered colimits then some A/R; must admit a morphism from 1. Form 
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the pullback 
BoA ; 


1 >—— A/R; 


then the object B has the required properties. 

Conversely, suppose the condition is satisfied, and let (A, | c € ob C) be 
the vertices of a filtered diagram in E. Since E is compact, we know that the 
comparison map limc€ (1, Ac) — E (1, lime A.) is injective, so we have only to 
prove that it is surjective. So suppose given f: 1 — lime A.. Form the pullbacks 


B: ———> A, ; 


| —— > lime A, 


since colimits in € are stable under pullback, we have 1 = lime Be. In particular, 
1 is the union of the supports of the Be, so by compactness some Be, must be well- 
supported. Now we restrict our attention to the (filtered) co-slice category co\C; 
for each object œ: co — c of this category, let Ra be the kernel-pair of Be —> Be. 
Then the Ra form a directed family of equivalence relations whose union is 
the whole of Bee x Beo, so by assumption there exists some Be —> Be, whose 
kernel-pair contains the whole of B’ x B’, for some well-supported subobject 
B’ — B,,. But this is equivalent to saying that the composite B’ > Ba —> Be, 
factors through 1; now the composite 1 + Be, > Ae, is an element of € (1, Ac, ) 
mapping onto the given element f. Cl 


The proof of 3.4.4 is constructive, and so can be internalized to an arbitrary 
topos S to give a criterion for tidiness of morphisms E — S. 

We now turn to the problem of obtaining a characterization of bounded tidy 
morphisms in terms of sites. It appears that here, in contrast to the situation 
for proper maps (cf. the remarks after 3.2.19), there is no way of avoiding the 
assumption that we are working over a topos with a natural number object; 
for we need to be able to construct the equivalence relation generated by a 
subobject of a square (i.e. an object of the form U x U of our site) as a countable 
union. It therefore seems sensible to restrict to sites whose underlying categories 
are pretoposes, rather than merely coherent categories as in Section C3.2; this 
assumption (or at least the assumption that C is positive as a coherent category) 
has an appreciable simplifying effect on the condition we obtain. 

Our basic strategy is to rewrite the condition (E) of 3.4.4 in terms of our 
site (C, J). Of course, in addition to assuming that C is a pretopos, we shall 
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assume as in Section C3.2 that J is the join of the coherent coverage P and 
a dm-coverage D, such that the terminal object of C is D-irreducible. Given a 
subobject V — U x U in C, we shall write V) for the reflexive and symmetric 
closure of V (that is, the union of V, its opposite and the diagonal) and V ™® 
(n > 1) for the nth iterate (in Rel(C)) of V), and refer to the sieve on U x U 
generated by the V™) > U x U as the equivalence steve associated with V; it is 
of course a dm-sieve. 


Proposition 3.4.5 A Grothendieck topos E is strongly compact iff there exists 
a site of definition (C, J) for E such that C is a (small) pretopos, J = PV D is a 
coverage satisfying the hypotheses of 3.2.19(iv) and, if U is any well-supported 
object of C and R any D-covering sieve on U x U, there exists a: U' = U such 
that U’ is well-supported, and a morphism e: V > U xU in R such that the 
pullback along a x a: U' x U' >= U x U of the equivalence sieve associated with 
e 1s D-covering. 


Proof First suppose € = Sh(C,.J) where C satisfies the given conditions. 
By 3.2.19 we know € is compact, so we must show that it satisfies condition 
(E). So suppose given a well-supported object A of E£, and a directed family of 
equivalence relations (R; | i € J) as in the statement of (E). We note first that 
there is a well-supported object U of C such that (U) admits a morphism to A 
(equivalently, A(U) is inhabited); for the objects U such that A(U) is inhabited 
must cover the terminal object, and by compactness there is a finite subfamily of 
them which cover, and we can form the coproduct of this family in our site. Now 
I(U x U) = (U) x (UV) is covered in E by the subobjects which are the pullbacks 
of the R;, so we can find a D-covering sieve on U x U whose image under l 
refines this cover. Let a: U’ > U and e: V >= U x U be as in the statement of 
the proposition. By construction, (V) — (U x U) — A x A factors through 
some R; — A x A, and since the latter is an equivalence relation it follows that 
lV™)) — (U xU) — Ax A also factors through R;, for all n. But the pullbacks 
of the 1(V(™) — (U x U) along I(U' x U’) >= I(U x U) are jointly epic; so if we 
take B — A to be the image of I(U') — I(U) — A, then B x B is contained in 
R; (and B — 1 is epic, since 1(U’) — 1 is epic). So condition (F) is verified. 
Conversely, suppose E is strongly compact; let C be a small full generating 
subcategory of £ which is closed under finite limits and colimits, and J the 
coverage induced on it. By 3.2.18, we can decompose J as P V D in the required 
manner. To verify the condition in the statement of the Proposition, suppose 
given U and R as stated there. For each e: V — U x U in R, let V > (U) xI(U) 
be the equivalence relation (in €) generated by [(V) — I(U) x I(U): of course, 
it is the union in Subg(I(U) x I(U)) of the (V) — (U) x'I(U). By condition 
(E), there exists B — I(U) such that B — 1 is epic and B x B is contained in 
V for some V. As in the first part of the proof, we may now choose U’ — U in 
C such that U’ is well-supported and [(U’) — I(U) factors through B — I(U); 
so 1(U') x l(U") is also contained in V, whence U’ x U’ must be covered by the 
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directed family of monomorphisms (V‘”) N (U’ x U’) — U' x U' | n > 1). So the 
latter generates the required D-covering sieve. 0 


We shall call a monomorphism V — U x U in C, where U is well-supported, 
an effective subobject (relative to a given coverage on C) if the equivalence sieve 
associated with it covers U’ x U’ for some well-supported U’ — U. Thus the 
condition in the statement of 3.4.5 says that every dm-cover of a well-supported 
square contains an effective subobject. (Warning: this notion of effectivity is 
not simply a property of the monomorphism in question; it also depends on 
the particular decomposition of its codomain as a square U x U - if it happens 
that U x U coincides with U’ x U’ in C, the effective subobjects of the two 
squares may be entirely different. Thus the set E appearing in the next lemma 
should really be viewed as a set of pairs (U,e) where U is a well-supported object 
and e: V + U x U; however, to simplify the notation we shall treat it as if its 
members were simply the monomorphisms e.) 


Lemma 3.4.6 Let C be a pretopos; let D be a (sifted, but not necessar- 
ily Grothendieck) dm-coverage on C, and suppose we have a family E of 
distinguished monomorphisms into well-supported squares with the following 
properties: 


(i) Every (V = U x U) in E is effective (with respect to the coverage D). 


(ii) f (V —= U x U) € E and R € D(V), then there exists (V' = V) € R 
such that the composite V’ — V —U x U is still in E. 


(iii) For every well-supported object U , the identity morphism U x U = U x U 
belongs to E. 


Then the Grothendieck coverage D generated by D has the property that every 
D-covering sieve on a well-supported square contains a monomorphism in E. In 
particular, the topos Sh(C, PV D) (where P is the coherent coverage) is strongly 
compact. 


Proof Consider the set T of all dm-sieves R on objects of C with the property 
that, given a morphism V — b( R) and a monomorphism V — U x U € E, there 
exists V’ — V such that V’ — V — U x U is still in E and the composite 
V’ >= V — b(R) belongs to R. We must show that this set is a Grothendieck 
coverage; it clearly contains D, by condition (ii), so it will then follow that it 
contains D. 

The form of the definition of T automatically ensures that it satisfies the 
pullback-stability condition (C); and condition (M) is guaranteed by condition 
(iii) in the statement of the lemma. So we have to verify condition (L). Let R, S 
be dm-sieves on W such that R € T(W) and a*(S) € T(dom a) for all a € R, 
and suppose given a: V — W and e: V = U x U € E. By assumption, we can 
find V’ — V such that V’ — V — W isin R and V’ > U xU is still in E. Now 
the pullback of S along the composite a’: V’ > V — W is in T(V’), so applying 
the definition of T to it we obtain V” — V’ € a™*(S) such that V” = U x U is 
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still in E. Thus the composite V” — V — W isin S, and we have verified that 


S e T(W). 
The second assertion of the lemma now follows from condition (i) and 3.4.5, 
since if the morphisms in E£ are D-effective they are certainly D-effective. O 


Theorem 3.4.7 Suppose given a pullback square 


fie Nae 


f 


Fag 


in Top, where f is bounded and tidy, and E has a natural number object. Then 
k is tidy, and the Beck-Chevalley natural transformation g* f, —> k,h* is an 
isomorphism. 


Proof Let (C, J) be an internal site for F over E satisfying the hypotheses of 
3.4.5. Then, as in the proof of 3.2.21(i), we may obtain a site for H over G by 
equipping the pretopos g*C with the coverage which is the join of its coherent 


coverage and the Grothendieck closure g#D of g#D. Let E — Co x Cı be 
the set of pairs (U,e: V —= U x U) such that U is well-supported and e is a 
D-effective subobject; then clearly E satisfies the conditions of 3.4.6. We claim 
that g* E satisfies the same conditions with respect to the coverage g* D on g*C; 
the fact that it satisfies the second and third conditions is obvious, and the 
first condition follows from the fact that, since g* preserves the natural number 
object, it also preserves the construction of the equivalence sieve associated with 


a monomorphism into a square. Hence (g*C, g# PV g# D) satisfies the hypotheses 
of 3.4.5, and so k is tidy. 

For the Beck-Chevalley condition, consider an object of F, which we may 
think of as a discrete fibration F — C satisfying the sheaf axiom for all covers in 
J. Then h*F is simply the reflection of g*F — g*C in the subcategory of sheaves 
on g*C, and the Beck—Chevalley transformation g* f,(F) — k,h*(F) is simply 
the component of the unit map at the terminal object 1 of g*C. By 3.2.21, we 
already know this morphism is monic, so we need only show that it is epic. 

First we note that F satisfies the condition which says that, given a 
well-supported object U of C, an element z €e F(U) and a morphism 
e = (e€1,€2): V = U xU in E such that F(b))(r) = F(b2)(x), there exists a 
well-supported U’ — U and an element y € F(1) such that F(U’ — U)(x) = 
F(U’ — 1)(y) in F(U’). For, since F is a sheaf for the coherenj coverage, the fact 
that the elements F(7)(x) and F(m2)(x) of F(U x U) become equal on restric- 
tion to V implies that they become equal on restriction to V'" for each n; but 
these subobjects generate a sieve whose pullback to U’ x U’ is D-covering, and 
hence F(U’ — U)(x) has equal images under the two projections U’ x U’ 3 U”. 
But this says that the restriction of x to F(U’) defines a compatible family for the 
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P-covering sieve generated by U’ — 1; so it is the image of a unique y € F(1). 
Now the condition at the beginning of this paragraph may be expressed by a 
coherent sequent (cf. D1.1.5) in the internal language of €; hence g*F satisfies 
the same condition in G. 

Now let F — g*C denote the associated sheaf of g*F, and suppose given an 
element z € F(1). By the definition of the associated sheaf given in the proof of 
2.2.6, z is determined by a family of elements (x; € g*(F)(U;) | i € I), where 
(U; — 1 | i € I) isa covering family, such that the z; are ‘locally compatible’, i.e., 
for each pair (i, j), the elements g* F'(71)(x;) and g* F(12)(z;) agree on a cover of 
U; x Uj. Now, by compactness of g*C and the fact that it has finite coproducts, 
we may replace the family (x; | 7 € J) by a single element y € g*F(U) for some 
U — 1 in g*C; moreover, it is easy to see that the family S of all morphisms 
(€1,€2): V => U x U for which g* F'(e1)(y) = g* F(e2)(y) is a dm-sieve on U x U, 
since g*F is a sheaf for the coherent coverage. Hence this covering sieve belongs 
to g# D. So, by the proof of 3.4.6, we can find a monomorphism e: V > U x U 
in S which belongs to g*(E); but, by the argument of the previous paragraph, 
this means that we can find a well-supported U’ — U such that the restriction 
y’ of y to g*(F)(U') derives from an element z € g*(F')(1). Now the image of z 
in F(1) must equal z, since they agree on the covering U’ —> 1 and F is a sheaf; 
so we have verified that g*F(1) > F(1) is epic. O 


As usual, we have a second proof of the pullback-stability theorem for tidy 
morphisms, in which we assume boundedness for the morphism along which 
we are pulling back, and factor it as an inclusion followed by one of the form 
[C,E] — E. However, this proof differs from those in the last three sections, 
in that we have to assume boundedness for the tidy morphism as well; this 
is because we shall need to appeal to Theorem 3.3.14, which we have proved 
only for bounded morphisms. Nevertheless, the second proof seems worth giv- 
ing, since it does not require the assumption that the toposes involved have 
natural number objects, and it is therefore applicable in contexts where 3.4.7 
is not. 

In fact, the first step in the proof does not require the tidy morphism to be 
bounded. 


Lemma 3.4.8 Let f: F — € be a tidy geometric morphism, and let C be an 


internal category in E. Then the induced morphism |C, f]: [f*C, F] — [C, E] is 
tidy, and the pullback square 


fre. F] pail [C, £] 


f 


F ——-> E 
satisfies the Beck-Chevalley condition. 
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Proof The proof that [C, f] is tidy is exactly like the corresponding part of 
the second proof of 3.2.21: given a filtered internal category I in [C, E], we let J 
denote its underlying (filtered) category in €/Co, and form the cube displayed 
in the proof of 3.2.21. As before, the fact that the top face commutes up to 
isomorphism follows from the fact that the bottom face does so and the vertical 
edges reflect isomorphisms. The Beck—Chevalley condition is easily verified by 
direct calculation, without using the tidiness of f (which is not surprising, since 
[C, E£] — E is locally connected). O 


Lemma 3.4.9 Let f: F — E be a bounded tidy morphism; let j be a local 
operator on E, and let 7’ be the local operator on F for which the square 


W E 


f 


F——>€ 


is a pullback. Then the Beck-Chevalley natural transformation g* f, — k,h* is 
an isomorphism. 


Proof By A4.3.12, k, is simply the restriction of f, to sheaves; so the lemma is 
equivalent to the statement that f, commutes up to isomorphism with the asso- 
ciated sheaf functors for j and 7’. But, as we remarked at the end of Section A4.4, 
the associated j-sheaf of an object A may be constructed as At+, where At is 
the colimit of an internal diagram A of shape JP, and J = (J; 3 J) is the 
sub-poset of Q determined by the object J (this is the internalization of the 
construction of this functor which we gave in the proof of 2.2.6; we shall not give 
the details here, but the interested reader may find them in [501] or [504]). So it 
suffices to show that the diagram 


fie 


Fs 


E 

+ l 
fs 

F—_ >E 


commutes up to isomorphism. But the first and third conditions in the definition 
of a local operator ensure that J is an internal meet-semilattice in £, and so J°? 
is filtered; hence this is almost immediate from the definition of a tidy morphism. 
The only reason why it is not quite immediate is that, to compute the functor 
+ in F, we have to form a colimit over the internal category J’°?, rather than 
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f*J°P. However, if B is an arbitrary object of F and we form the appropriate 
diagram B of shape J’°?, it is easily verified from the description of A in the 
proof of A4.4.10 that 


— 


f*B —— f.(B) 


CRORES SO 


is a pullback, where the bottom edge of the square is the transpose of the canon- 
ical morphism f* J — J’ in F. So, to complete the proof, it suffices by B2.5.12 
to show that f*J — J’ is an initial functor. But that is precisely what 3.3.14 
tells us. O 


Theorem 3.4.10 Suppose given a pullback square 


k 
——> 

J 
——_—> 


men =a 


in Yop, where f is bounded and tidy, and g is bounded. Then k is tidy, and the 
Beck-Chevalley natural transformation g* f —> k,h* is an isomorphism. 


Proof By 3.4.8, we may reduce to the case when g is an inclusion; and the 
Beck-Chevalley condition for this case is proved in 3.4.9. So it remains to prove 
that k is tidy, which we shall do by showing that it is a stable right Beck- 
Chevalley morphism. 

Suppose given a bounded morphism r: G’ — G. Then the composite gr is also 
bounded, so we may factor it as 


/ 
Gita pie aA F 


where g’ is an inclusion and €’ œ [C, E] for some internal category C in E£. Now 
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we may form the cube 


H! k' > g' 
r 
S 
h' H ——— G 
k 
g' 

1 f' / 
ee | ee g 
q 4 i 

f 


F —— E 


in which all faces with horizontal edges are pullbacks, and the other two faces 
commute. Moreover, the Beck-Chevalley condition holds for the bottom face by 
3.4.8, and for the front and back faces by 3.4.9, so an easy diagram-chase shows 
that the canonical natural transformation r*k,h* — k! s*h* is an isomorphism. 
But h is an inclusion, so h* is essentially surjective on objects; hence r*k, — k! s* 
is an isomorphism. 

Thus we have shown that k is a RBC morphism in the sense of 2.4.16. But 
the same diagram shows that any pullback of k along a bounded morphism is 
expressible as a pullback of a tidy morphism along an inclusion; so it too is a RBC 
morphism. Hence k is a SRBC morphism; as we remarked after Definition 3.4.2, 
this trivially implies that it is tidy. 0 


Corollary 3.4.11 A bounded geometric morphism f: F — E is tidy iff it is a 
stable right Beck-Chevalley morphism in the sense of 2.4.16. 


Proof One direction is immediate from the definition; the converse follows 
from 3.4.10 (or from 3.4.7, if E has a natural number object). 0 


We do not yet have a ‘dual’ for the part of 3.3.15 which says that connected 
locally connected morphisms are stable under pullback. We could obtain this 
by observing that connectedness is always preserved under pullback along mor- 
phisms of the form [C, E] — €, and then using the Beck-Chevalley condition to 
show that connectedness of tidy morphisms is preserved under pullback along 
inclusions. However, we shall take an alternative route, which involves discussing 
tidiness in the context of the pure-entire factorization of a geometric morphism. 
We have not met this factorization before (though we met entire morphisms 
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in B4.4.21(a)), but it is easily described: recall that a morphism f: F — E is 
said to be entire if it is localic and the corresponding internal locale X in E 
is compact and zero-dimensional (i.e. O(X) is generated by its complemented 
elements). It is well known that compact zero-dimensional locales are coher- 
ent; in fact they are precisely those whose frames are of the form JB for a 
Boolean algebra B (cf. {520, II 4.2]). It thus follows immediately from 2.4.3 that 
entire morphisms are stable under pullback, and from 3.4.1(c) that they are 
tidy. 

We recall also that in any topos we have Q © J(2), where 2 = 1111 is the 
Boolean algebra of complemented elements of Q. We define a geometric morphism 
f: F — E to be pure if f, preserves 2; more explicitly, if the canonical morphism 
2¢ — f. (2) (the unit of (f* 4 f,) at the object 2) is an isomorphism. 


Lemma 3.4.12 
(i) A morphism f is pure iff fẹ preserves finite coproducts. 


(ii) Hyperconnected morphisms are pure. 
Proof (i) One direction is immediate since 2 = 1 H 1 and f, preserves 1. For 


the converse, we note first that a pure morphism f must be dense (i.e. f, must 
preserve 0), since f, preserves the pullback 


0—1 
vi 
1 ————> 2 


For binary coproducts, we similarly use the fact that f, preserves the pullbacks. 


Vy V2 
A — > AUB «——— B 


Vy V2 
| ——— 2 «|1 


(ii) If f: F — E is hyperconnected, then for any object A of E we have an 
isomorphism Sube (A) & Sub¢(f*A), which clearly restricts to an isomorphism 
between their sublattices of complemented elements. So morphisms A —> 2¢ 
correspond bijectively to morphisms A — f, (2). O 
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Proposition 3.4.13 
(i) Pure morphisms are orthogonal to entire morphisms: that is, given a 


commutative square 


eee 


H G 
i ! 
f 
F——>€E 


where h is pure and g is entire, there exists (uniquely up to canonical 
isomorphism) a morphism l: F — G making the two triangles commute. 


(ii) Every geometric morphism can be factored as a pure morphism followed 
by an entire morphism, and the factorization is unique up to canonical 
equivalence. 


Proof (i) Since entire morphisms are stable under pullback, it suffices to show 
that if g: G — E is entire and h: H — € is pure, then any morphism k: H —> G 
over € factors through a unique section of g. But this is easy, since k corresponds 
to a frame homomorphism g,(Qg) > hs (Nu) in E, and hence to a lattice homo- 
morphism g,(2g) > h.(Qy%); and the latter must factor (uniquely) through the 
Boolean algebra h, (24) = 2¢ of complemented elements of h,(Q7). 

(ii) Given an arbitrary morphism f: F — E, let X be the locale in E defined 
by O(X) = I(f.(2¢)), and let G = She(X). Then G is clearly entire over 
E€; and the canonical morphism f,(2¢-) — f,(Q¢) induces a frame homomor- 
phism I(f,(2¢)) > f.(Qs) (cf. 1.1.3) and hence a geometric morphism h: F — G 
over E. We claim that A is pure. 

To prove this, we may use 3.4.12(ii) to reduce to the case when f is localic, 
i.e. F ~ She(X) where X is the internal locale defined by O(X) = f,(Q#). In 
terms of this description, Q¢ is the sheaf on O(X) corresponding to the discrete 
fibration O(X),; — O(X) whose domain is the order-relation on O(X), and the 
mapping sends a pair (U, V) of opens in X with U < V to V (cf. A2.1.8). Clearly, 
the object 2, corresponds to the subobject 


{(U,V) | (AW: O(X))((UNW = 9) and (UUW =V))} 


of O(X),. Now the direct image of the geometric morphism h is simply given by 
pullback along the frame homomorphism I(f,(2-)) > O(X), which of course 
sends an ideal of complemented elements of O(X) to its join. If we use the 
alternative representation of objects of G as sheaves for the coherent coverage 
on f,(2) itself, we see that h,(2F) corresponds to the funétor whose value at 
a complemented open U is the set of complemented opens below U. But this 1s 
precisely the object 2g; so h is pure. 

This establishes the existence of the factorization; the uniqueness follows 
from (i). o 
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Note that it follows from 3.4.13 that a composite of entire morphisms is entire, 
as we claimed in B4.4.22. (In fact it would not be hard to prove this directly 
from the definition.) 

Generalizing 3.4.12(ii), connected morphisms are pure; in fact 


Lemma 3.4.14 For a geometric morphism f: F —€E, the following are 
equivalent: 


(i) f is connected. 
(ii) fẹ preserves E€-indexed coproducts. 


(iii) The pullback of f along any entire morphism with codomain E is pure. 


Proof (ii) => (i) is immediate, since the assertion that f, preserves €-indexed 
coproducts in particular says that, for any object A of £, the canonical mor- 
phism “ia(f/A)«(1f-a) — fe 2 ¢-a(1y-a) is an isomorphism; but this canonical 
morphism is just the unit map A — f, f* A. 

(i) = (ii): Conversely, if f is connected, then for any h: A > B in € the two 
ways round the diagram 


Zf 
F/F A —— F/f*B 


(F/A) (f/b). 


Lh 
EJA ——"- > EJB 


both send an arbitrary k: C — f*A to the composite h-(n4)~'- (fak), where n 
is the unit of (f* 4 fx). 

(i) + (iii): Let g: G — E be an entire morphism, corresponding to an internal 
Boolean algebra B in €, and form the pullback 


2 EG 


H 

i i 
i 

F —— E 


Then the assertion that k is pure is equivalent to saying that the top and 
right edges of the above square are the pure—æntire factorization of the diag- 
onal, i.e. that f,h.(27) = B as Boolean algebras in E. But h is also entire; 
indeed, by the proof of 2.4.3 we know that h,(2x) is the Boolean algebra f*(B). 
Hence (iii) is equivalent to the assertion that the unit map B —> f, f*(B) is an 
isomorphism for every object B of € which is the underlying object of a Boolean 
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algebra. But by A4.5.23 there exists a surjection p: B — E where B is a Boolean 
topos; so for any object A of € we have a monomorphism 


p»({}) 


A >——> p,p*(A) >————>  p, (2 ““) 


from A to the underlying object of a Boolean algebra, and by applying the same 
construction to the cokernel-pair of this morphism we may represent A as the 
equalizer of a pair of maps between the underlying objects of Boolean algebras. 
Since f* and f, both preserve equalizers, the assertion above is thus equivalent 
to saying that A — f,f*(A) is an isomorphism for all A. o 


The pure-entire factorization of an arbitrary morphism is not stable under 
arbitrary pullback; but if a pullback square 


EERE 


h 


ves maS 


Pels 


satisfies the Beck-Chevalley condition g* f, = k,h*, then in particular we have 
g* f.(2e) = kh* (27) = k(2n), and so the pure-entire factorization of f pulls 
back to that of k. Hence we may conclude 


Corollary 3.4.15 
(i) The pure-entire factorization of an arbitrary morphism is stable under 
pullback along locally connected morphisms. 


(ii) The pure—-entire factorization of a bounded tidy morphism is stable under 
arbitrary bounded pullback. Moreover, both halves of the factorization 
are tidy. 

(iii) A bounded tidy morphism is pure iff it is connected. 


(iv) Connected bounded tidy morphisms are stable under pullback. 


Proof (i) and the first assertion of (ii) are immediate from 3.3.15 and 3.4.10 
respectively. For the second assertion of (ii), we recall that entire morphisms are 
tidy in any case; for the pure part, let 


| Ley, en EE 
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be the pure—entire factorization of a tidy morphism f: F — E. Then we have a 
diagram 


ae 
(1, h) A, 
1 x 

F << — F xe G — > G xeG 


f T2 


E <____- G 


where the squares are pullbacks and the vertical composite in the middle is h; 
but f is tidy by assumption and A, is tidy because it is a closed inclusion. 
So the result follows from the stability of tidy morphisms under composition 
and pullback. 

(iii) now follows from (ii) and 3.4.14(iii), and (iv) is immediate from (ii) 
and (iii). oO 


Remark 3.4.16 An alternative proof of 3.4.15(iii), without the assumption 
of boundedness, may be given using 3.4.12(i) and 3.4.14(ii), plus the fact 
that arbitrary coproducts may be expressed as filtered colimits of finite ones 
(cf. D5.2.14). 


It is also possible to give an alternative, more direct (but rather more long- 
winded!), proof of the fact that the pure part of the pure-entire factorization of 
a tidy morphism is tidy. The argument relies on the following result, which is of 
interest in its own right — it extends to entire morphisms something which we 
already knew for closed inclusions, by A4.5.4. 


Lemma 3.4.17 Let f:F —€ be an entire morphism. Then f, preserves 
coequalizers of reflexive pairs; in particular, it is a regular functor. 


Proof Suppose F ~ Sh¢(B, P), where B is an internal Boolean algebra in E 
and P is its coherent coverage. We claim that the inclusion Sh¢(B, P) — [B°?, E] 
preserves reflexive coequalizers; since the direct image functor [B°?,€] — E is 
given by evaluation at the top element of B (and hence preserves all colimits), this 
will suffice. But the coverage P is generated by the empty cover of the bottom 
element 0 and all two-element covers (a — a V b,b — a V b) where aA b = 0 
(since any two-element cover may be refined to a disjoint one); hence we see 
that a diagram F on B® is a P-sheaf iff F(0) S 1 and F(a V b) S F(a) x F(b) 
whenever a and b are disjoint. Using the fact that binary products commute 
with reflexive coequalizers in a cartesian closed category (A1.2.12), it follows 
easily that if F = G is a reflexive pair of morphisms in Sh¢(B, P) then their 
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‘pointwise’ coequalizer (i.e. their coequalizer in [IB°P, €]) is a sheaf, and hence is 
their coequalizer in She(B, P). The second assertion of the lemma follows from 
the fact that any epimorphism in a topos may be presented as the coequalizer 
of its kernel-pair (A1.3.4). 0 


A further property of entire morphisms which we require is that, if f: F — E 
is entire and A is any well-supported object of F, then the counit map f* fA —> 
A is epic. For if we represent F as She(B, P) where B = (Bı = B) is a Boolean 
algebra and P is its coherent coverage, then the assertion that A — 1 is epic 
implies by the result just proved that A(1) is inhabited (where 1 denotes the 
top element of B). Hence we can represent any x € A(b) as the restriction to 
b of an element of A(1) = f(A), by choosing an arbitrary y € A(1) and then 
patching together x and the restriction of y to A(—b); this defines an element of 
f* f«(A)(1) whose restriction to f* f.(A)(b) maps onto x. Thus f* f,(A) — A is 
already epic in [B°?, E], and hence also in Sh¢(B, P). 

Now let f: F — E be an arbitrary tidy morphism, and let 


| LE TER 


be its pure-entire factorization. Suppose given a filtered internal category C in 
G; then by 3.4.17 the category g,C is filtered in E, and since filtered categories 
are necessarily well-supported the counit map g*g,C — C is epic, and hence 
final in the sense of B2.5.12. Hence the diagram 


lime 
G 


lim 9+ 9, 


[9"9«C, G] 


commutes, where the vertical arrow is induced by pullback along the counit. 
Now consider the diagram 


[A*C, F] ———> [C,9] 


[a*g"geC, F] —> [9"g-C,G] —> [g.C, E] 


lim lim lim 
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where the horizontal arrows are all direct image functors. If we chase a discrete 
opfibration F — h*C around the upper square, we obtain the pullbacks of h,F 
either way around the square 


g* 9 C ———> C 


h,h*g*g.C —> h,h*C 


formed by the unit and counit maps; but this square commutes by naturality of 
the unit, and so the top square in the large diagram commutes up to isomorphism. 
Also, the right lower square and the outside of the lower rectangle commute up to 
isomorphism, since gh and g are both tidy. So we conclude that, for any F — h*C, 
the Beck—Chevalley morphism limc¢[C, h].(F) > h,lim),-c(F) is mapped by g, 
to an isomorphism; i.e., considering it as a map between sheaves on a Boolean 
algebra B in €, it is ‘bijective on global sections’. Now each element b of B 
corresponds (via the principal ideal | (b)) to a clopen subtopos of G, and hence 
(via pullback) to a clopen subtopos F’ of F; the composite F’ — F — € is still 
proper, so on restricting to it we may similarly deduce that the Beck—Chevalley 
morphism is bijective on sections over b, for any b; i.e. it is an isomorphism in 
Sh(B, P) =G. 

We are still not finished; for we have to consider filtered categories in arbi- 
trary slices of G, not just in G itself. However, these can be dealt with in a 
similar manner, using the fact that every object of G is an E-indexed colimit of 
complemented subterminal objects. We shall not go into the details here. 

For geometric morphisms over a base topos S, local connectedness may be 
viewed as a strengthening of the property of being S-essential. In the same way, 
tidiness of a morphism f: F — E may be seen as a strengthening of the ‘relative’ 
condition that the direct image functor f, should preserve filtered S-indexed 
colimits (that is, the special case of Definition 3.4.2 where the object A and 
the filtered category C are assumed to come from S and S/A respectively). This 
relative notion of tidiness has also been studied by Moerdijk and Vermeulen [858]: 
in particular, they proved the dual of Pitts’s Theorem 2.3.17, which we state here 
(although we refer to [858] for the proof). 


Theorem 3.4.18 Suppose given a comma square 


(f 19) ——>G 
u—- g 
f 
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in BLop/S, where S is a topos with a natural number object. If g is tidy relative 
to S, then u is tidy, and the Beck-Chevalley natural transformation f* g, —> u.v* 
is an isomorphism. O 


Suggestions for further reading: Johnstone [519], Moerdijk & 
Vermeulen [857, 858]. 


C3.5 Atomic morphisms 


In this section we study those geometric morphisms whose inverse image functors 
are logical. Of course, we already know quite a bit about such morphisms: we 
saw in A2.3.2 that every local homeomorphism has logical inverse image, in 
A2.2.10(ii) that any such morphism must be essential, and in A2.4.8 that the 
left adjoint of its inverse image functor must preserve monomorphisms. Also, 
such a morphism must be locally connected, since its inverse image commutes 
with the functors Ilp. 

Traditionally, a geometric morphism with logical inverse image is said to be 
atomic. This is perhaps not a particularly good name, since it fails to suggest 
that the condition is a strengthening of ‘open’ and of ‘locally connected’; in fact 
it derives from a characteristic property of atomic Set-toposes (that is, toposes 
which possess an atomic geometric morphism to Set), as we shall see in 3.5.9 
below, and is not really appropriate for morphisms between non-Boolean toposes. 
Nevertheless, it is by now a well-established term, and we have not attempted 
to depart from it. 


Lemma 3.5.1 
(i) A composite of atomic morphisms is atomic. 


(ii) Let 


g 
H — F 


E 


be a commutative diagram of geometric morphisms, where h is atomic and 
g is an open surjection. Then f is atomic. 


Proof 

(i) is trivial. 

(i) is proved in exactly the same way as 3.3.2(iv), but using the power-object 
comparison map ġa: f*(PA) — P(f*A) instead of the exponential compari- 
son map. 0 
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Corollary 3.5.2 Let € be a Grothendieck topos with enough points. Then E is 
Boolean iff the unique geometric morphism y: E — Set is atomic. 


Proof Clearly, if y is atomic then € must be Boolean, since y* preserves the 
isomorphism (1 H 1) = Q. Conversely, if € is Boolean, we apply 3.5.1(ii) to the 
diagram 


Set /K vs E 


y 

Set 
where K is a sufficient set of points of E (cf. 2.2.12); note that g is open by the 
remark before 3.1.9. O 


For Grothendieck toposes without enough points, atomicity over Set is a 
stronger condition than Booleanness. Indeed, for localic toposes we have: 


Lemma 3.5.3 Let X be a locale. Then the following are equivalent: 
(i) X is discrete (that is, O(X) is a full power-set). 
(ii) Sh(X) is atomic over Set. 
(iii) Sh(X) is Boolean and has enough points. 


Proof (i) => (iii) follows from the fact that if X is discrete we have Sh(X) ~ 
LH/X = Set /X. 

(iii) = (ii) follows from 3.5.2. 

(ii) => (i): Let yı: Sh( X) — Set be the left adjoint of the inverse image func- 
tor. Then by A2.4.8 we have O(X) = Subsn(x)(1) = Subset (7:(1))- 0 


An atomic Set-topos which is not localic need not have enough points: we give 
a counterexample in D3.4.14. However, one part of the proof of 3.5.3 survives in 
general: if y: E — Set is atomic, then for every object A of E we have Sube(A) S 
Subget(71(A)), so subobject lattices in £ are full power-sets (that is, complete 
atomic Boolean algebras). This is the origin of the name ‘atomic topos’, as hinted 
earlier. 

The equivalence of (i) and (ii) in 3.5.3 remains valid over an arbitrary base: 


Lemma 3.5.4 
(i) A connected atomic morphism is hyperconnected. 
(ii) If f: F — E is atomic, then the factorization F — €/fil — E of 3.3.5(i) 
coincides with the hyperconnected-localic factorization of f, and both 
halves of it are atomic. 


(iii) A localic morphism is atomic iff it is a local homeomorphism. 


Proof (i) If f: F — € is atomic and connected, then for every object A of 
E we have a bijection Sub(f* A) — Sub( fı f* A) S Sub(A) induced by fi, whose 
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inverse must be induced by f*. In particular, every subobject of f*A is in the 
image of f*; i.e. f satisfies condition (ii) of A4.6.6. 
(ii) As in the proof of 3.3.5(i), we may factor the inverse image functor 
E/fil—F as 
(f/fil)* ni 


efi Fr iT 


where 7) is the unit of (fi 4 f*); hence it is logical. So by (i) the morphism 
F — E/ fil is hyperconnected; and since the other factor is localic by A4.6.2(a), 
the factorization is the hyperconnected-localic one. 

(iii) is immediate from (ii), since if f is localic then the first half of the 
factorization in (ii) is an equivalence. o 


For dealing with atomic morphisms over an arbitrary base, the following 
characterization is often useful. For the moment, let us write D: E — Loc(€) 
for the functor which sends an object A to the discrete locale on A (defined by 
O(DA) = PA), and let X + X, denote its right adjoint which sends a locale 
X to its object of points (i.e. the hom-object Frm(€) (O(X),Q) of the locally 
internal category Frm(€); cf. 1.2.2). 


Lemma 3.5.5 For a geometric morphism f:F > €, the following are 
equivalent: 

(i) f ts atomic. 

(ii) For any internal locale X in E, we have f*(Xp) = (f*X)p- 
(iii) The functor fi: Loc(¥) — Loc(E) preserves discrete locales. 

) 


For every object B of F, the composite F/B — F — E can be factored as 
a hyperconnected morphism followed by a local homeomorphism. 


(iv 


Proof (i) > (iv) follows immediately from 3.5.4(ii). 

(iv) > (iii): By the definition of fı on locales in Section C2.4, fi(DB) is 
the internal locale in € corresponding to the localic part of the composite 
Shz(DB) ~ F/B — F — E. Condition (iv) says that this locale is discrete. 

(iii) = (ii): Let X be an internal locale in E. Then, for any object B of F, we 
have bijections 


B — (fX) 
DB => fX 
fi(DB) — X 
fi(DB) — D(X) since fı( DB) is discrete 
DB — FDX) 
DB —> D(f*(X,)) since f* preserves discrete locales 
B — F(X») 


so (f*X)p S f*(Xp). 
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(ii) = (i): Given an object A of £, let S^ be the A-fold power of the Sierpiński 
locale, i.e. the locale such that O(S“) is the free frame FA generated by A 
(cf. 1.1.4). From the adjunction (f* 4 f,), it is clear that f*(F.A) S F(f*A), 
so on taking objects of points we obtain 


f*(PA) = f*((S4)p) S (f° (S4))p S (SIA) p S P(f* A), 
i.e. f* is logical. 0o 


Condition (iv) of 3.5.5 should be compared with 3.3.5(ii). In connection with 
3.5.5(ii), we remark that if f is atomic then the functor f*#: Frm(€) > Frm(F) 
is simply the functor f* applied to internal frames in €; the fact that logical 
functors preserve completeness of internal posets was proved in B2.3.10(i). Using 
this, it is not hard to give a direct proof of the implication (i) = (ii) in 3.5.5 (as 
is done, for example, in [560]). 

An interesting feature of atomic morphisms is that their one-sided inverses, 
if any, are necessarily open. 


Proposition 3.5.6 Let f: F — E be an atomic morphism, let A be an object 
of E and let g: E/A — F be a geometric morphism over E. Then 


(i) g is open; 
(ii) if f is connected and A — 1 is epic, g is surjective. 


Proof It is convenient to prove the second assertion first. 

(ii) Since both f, and f* preserve Q, we know that their left adjoints f* 
and f, both induce bijections on subobject lattices, by A4.6.6(vi) and A2.4.8. 
Moreover, an easy diagram-chase shows that, for any B € ob F, the composite 
bijection Sub( B) — Sub(f* fı B) is inverse to the operation of pulling back along 
the unit map B — f* fi B; so, by considering the image of the latter, we see that 
it must be an epimorphism. Now, for any B’ > B, we have a pullback square 


B' f* fiB' ; 


B 


ffs 
applying g* to this, we obtain a pullback square 


g* B! ——» A*fiB’ , 


g B 


A* fiB 
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since g* f* = A*: E — €/A. Hence g*, like fı, preserves properness of subobjects; 
but since it also (unlike fı — cf. A2.3.8) preserves equalizers, it is conservative by 
A1.2.4. 

(i) First we observe that we may reduce to the case A = 1: for we have a 
pullback 


F/F A — F 


f/A f 


a ERE 


in Top, where f/A is atomic by A2.3.7, and thus g induces a morphism €/A > 
F/f*A over E/A, whose openness implies that of g itself by 3.2.23. 

Let B be an object of F, and suppose we are given a subobject C — g*B. 
As in the proof of (ii), it is easy to prove that for any B’ — B the naturality 
square 


B as oe eu 


BS iB 


is a pullback (by considering the classifying maps of the two vertical morphisms), 
although its horizontal edges are no longer necessarily epic. Applying g* to this 
square, we obtain a pullback 


g* B’ —— > fiB’ : 


GB iB 


so if we take B’ — B to be the unique subobject such that f,B’ is the image of 
the composite C — g*B — fiB, we obtain the unique smallest subobject of B 
such that C’ < g* B’. 

Now by 3.5.5(iv) we have a connected atomic morphism F/B’ — €/fiB". 
Also, the composite €/C — €/g*B' — F/B’ is a morphism over €/f,B’; and 
since C — fı B’ is epic by the definition of B’, it follows from (ii) that this latter 
composite is a surjection. So the image of €/C — E/g* B — F/B (in the sense of 
the surjection—inclusion factorization) is F/B’; thus we have verified condition 
(vi) of 3.1.7 for g. 0 
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Remark 3.5.7 Using the descent theorem for open surjections, we shall see in 
5.2.12 below that if f: E — S is connected atomic and has a section, then € may 
be identified with the topos of continuous G-sets for a suitable localic group G in 
S. In particular, it is bounded over S. That is, a connected atomic morphism with 
a section is necessarily bounded. The assumption of having a section is essential 
here; recall that two of our three prime examples of unbounded toposes over Set 
(cf. B3.1.14) are the topos Unif(G) where G is a topological group having no 
smallest open subgroup as in A2.1.7, and the topos [G, Set] where G is a ‘large’ 
group as in A2.1.4. Both of these are easily seen to be connected and atomic, 
but neither has a point. (For both toposes, the forgetful functor to Set preserves 
finite limits and all colimits which exist, but fails to have a right adjoint — in 
one case because the topos fails to be cocomplete, and in the other because the 
solution-set condition fails.) 


Our next task is, as usual, to give a site characterization of bounded atomic 
morphisms. Since atomic morphisms are locally connected, we know that for 
any bounded atomic morphism p: E — S we can choose a site of definition for 
E consisting of objects A which are connected, i.e. satisfy pi(A) = 1. In the case 
when p is atomic, connected objects of € are generally called atoms; by 3.5.5(iii), 
an atom A has the property that p,(2) S Qs, and in particular that the map 
Subs(1) — Sube(A) sending U to A x p*U is bijective. (The name ‘atom’ is 
really appropriate only in the classical case S = Set, when this condition says 
that A has no subobjects other than itself and 0 (and these two are distinct), 
but, as we remarked earlier, it seems hard to avoid using it in the more general 
context.) 

We note that any morphism between atoms is an epimorphism in £; for if 
f: A— B had image B x p*U, then we should have A & A x p*U. In particular, 
therefore, if C is any internal full subcategory of E (over S) on a generating 
set of atoms, each morphism of C generates a covering sieve in the induced 
coverage on C. We note also that C satisfies the right Ore condition (so that, by 
A2.1.11(h), these sieves do indeed form a coverage): if f: A — C and g: B => C 
are two morphisms of C with common codomain, then the pullback A xc B in 
€ is nonzero, and so admits a morphism from some object of C. 

In 2.1.12(c) we considered two further conditions on a site (C,T) as above, 
designed to ensure that T coincides with the canonical coverage on C: namely, 
that every morphism of C is an effective epimorphism (i.e., if g: A — C coequal- 
izes every pair of morphisms coequalized by f: A — B, then g factors uniquely 
through f) and that C does not have a strict initial object. The first of these is 
true in general (since, for anyf: A — B, the family of morphisms from objects 
of C to the kernel-pair A xg A is epimorphic); but the second is special to the 
case S = Set (and even in that case it fails when the only atom is the terminal 
object 1, i.e. when € = Set). Also, the first condition is not stable under change 
of base; so we do not include it in our definition of an ‘atomic site’, even though 
it was part of Barr’s original definition [81]. 
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One further feature of the case S = Set deserves mention. In a bounded 
atomic Set-topos there are, up to isomorphism, only a set of atoms, since if 
{G; |i € I} is any generating set then each atom must be a surjective image of a 
single G;. Conversely, any epimorphic image of an atom is an atom; so, by taking 
an arbitrary generating subcategory of atoms and closing it under quotients in 
E, we may assume that our site contains (up to isomorphism) all the atoms of 
€. However, this property is not needed for our characterization. 


Theorem 3.5.8 For a bounded geometric morphism p: E —> S, the following 
are equivalent: 


(i) p is atomic. 
(ii) There exists an internal site (C,T) for E over S such that C satisfies the 
right Ore condition, and T consists either of all inhabited sieves on objects 
of C or of all sieves generated by single morphisms of C. 
(iii) There exists a site (C,T) as in (ii), but with the additional property that 
each morphism of C is an effective epimorphism. 


Proof (i) = (iii) is proved in the discussion above, and (iii) = (ii) is trivial. 

For (ii) => (i), we note first that a site as in (ii) is locally connected, as we 
saw in 3.3.11(b); so we already know that p* is a cartesian closed functor, and 
we need only show that it preserves Q. For this, the argument is similar to (but 
easier than) that of 3.1.19(i): recall from the proof of 3.3.10 that p* simply sends 
Qs to the constant diagram on C°P with value Qs, so we have only to verify that, 
for each object U of C, the object Svr(U) of T-closed sieves on U, as described 
in the proof of 3.1.19(i), is isomorphic to . But this is easy, since a T-closed 
sieve R is entirely determined by the truth-value of the assertion ‘R is inhabited’: 
since every inhabited sieve is T-covering (or at least contains a T-covering sieve), 
to the extent that R is inhabited it must contain all morphisms with codomain 
U. (Also, because all T-covering sieves are inhabited, the sieves 


{a: Ci | dola) =U Ap} 
(where p: Qs) are all T-closed.) o 


We note that 3.5.8 has a straightforward strengthening, similar to those for 
our site characterizations of openness and local connectedness: p is connected 
and atomic iff the site in (ii) or (iii) may be chosen to have a terminal object. 

Before going on to prove the pullback-stability and inverse limit theorems, we 
pause to discuss some important examples of atomic sites in the case S = Set. 


Examples 3.5.9 (a) Any group G (more generally, any small groupoid) is an 
atomic site when equipped with the coverage in which only the maximal sieves 
cover (as we noted in A2.1.11(¿), these are the only inhabited sieves in this case). 
So [G, Set] is an atomic topos (as we already knew from A2.1.4). The atoms of 
[G, Set] are the transitive G-sets (equivalently, the sets G : H of left cosets of 
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arbitrary subgroups H of G); so we may obtain an alternative site of definition 
for [G, Set] by taking the full subcategory of all such G-sets, with the coverage 
in which every morphism generates a covering sieve. (However, we note that in 
this case the coverage is rigid in the sense of 2.2.18, since there is a largest atom, 
namely G itself; so we could alternatively use the Comparison Lemma 2.2.3 to 
derive the equivalence of the topos of sheaves on this site with [G, Set].) 

(b) For a topological group G, the atoms of Cont(G) are again the transitive 
continuous G-sets, i.e. the G-sets G : H where H is an open subgroup. So we 
may take these as the objects of our atomic site; equivalently, as we observed in 
2.2.4(c), we may take the objects of the site to be the open subgroups themselves, 
with morphisms H — K labelled by those cosets gK of K for which H C gKg™! 
and the composite of gk: H — K and hL: K — L defined to be ghL. 

(c) As we remarked in 2.1.12(c), the category Set fe of finite sets and surjec- 
tions, and the dual of the category Sets, of finite sets and injections, are both 
atomic sites when equipped with the coverage consisting of all sieves generated 
by single morphisms. (In fact they are both special cases of the previous example, 
for suitable topological groups G; we prove this explicitly for Set, in D3.4.10, 
and for Setje in D3.4.12. We recall also that in A2.1.11(h) we showed that the 
sheaves for the atomic coverage on Seti, are exactly the pullback-preserving 
functors; so we can now conclude that the category of pullback-preserving functors 
Set sm — Set is an atomic topos. 

(d) A similar argument shows that if Ordfm denotes the category of finite 
totally ordered sets and order-preserving injections, then the pullback-preserving 
functors Ords, — Set are exactly the sheaves for the atomic coverage on 
Ordy in» so they form an atomic topos. Once again, we shall meet this topos 
again in Section D3.4, where we shall see that it too can be identified with a 
topos of continuous G-sets. For more information about this topos, see [529]. 


7 


From 3.5.8, we may obtain a characterization of connected atomic mor- 
phisms in terms of fibrations of sites. In this context, we shall say that a 
functor F: C — D between the underlying categories of two sites creates cov- 
ers ìf, given any sieve R on an object U of C, R is covering iff the sieve generated 
by {Fa | a € R} covers FU in D; note that this implies that F both preserves 
and reflects covers as defined in Section C2.3, but it is stronger than either of 
these conditions. 


Proposition 3.5.10 Let f: F — E be a geometric morphism between bounded 
S-toposes. Then f is connected and atomic iff it can be defined by a fibration of 
sites (P, T): (D, K) — (C, J) in S such that P creates covers. 


Proof First suppose f is connected and atomic. Choose any site (C, J) for 
E over S; let (B, H) be an atomic site (with a terminal object) for F over E, 
and externalize it as in 2.5.7 to obtain a continuous fibration. The assertion that 
the H-covering sieves are exactly the inhabited ones means that we can simplify 
the description of the coverage K = J x H which we obtain on D = C x B: a 
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sieve S on (U,V) is covering iff the set of all a: U’ — U for which there exists 
some b with (a,b): (U’,V’) — (U,V) € S is J-covering. But this is exactly the 
statement that P creates covers. 

Conversely, suppose P creates covers. We claim first that (P,T) satisfies 
the condition of 3.3.12; in fact, if we have a K-covering sieve S on V and a 
commutative square 


ag 
U —> Pvp 


a Pbz 


Pb, 
PV, ———> PV 


with b;,b2 € S, then U can be J-covered by morphisms c: PV3 — U for which 
the composites a;c (i = 1,2) are equal to Pd; for some morphisms d; making the 
square 


V2 Sy 


dı bo 


b 
Sy 


commute in D. To prove this, choose prone liftings of a), az and the diagonal of 
the first square above, to obtain a diagram 


U; ———> U s——_- 0, 
by bo 


VY. ———> V <— 2 


where e; and eg are vertical (that is, their images under P are the identity on 
U). Since P creates covers, each of e and e2 generates a K-covering sieve; hence, 
by the remark after 2.1.8, the intersection of these two sieves (i.e. the sieve of all 
morphisms which factor through both e, and e2) is also K-covering. The image 
of this sieve under P is the cover of U which we seek. 

We therefore know that f is connected and locally connected, so it remains 
to show that f* preserves Q. But we may do this by direct calculation, as in the 
proof of 3.1.21: we have to show that the mapping dy : Ne(PV) — Q¢(V), which 
sends a J-closed sieve R on PV to the K-closed sieve {b: V’ > V | Pb € R}, 
is bijective. But it is easy to see that a K-closed sieve S is generated by the 
prone morphisms which it contains, since (as we already observed) every vertical 
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morphism in D must generate a K-covering sieve, and hence we may uniquely 
recover R from S, as the sieve of all morphisms U’ — PV whose prone liftings 
lie in S. m 


Corollary 3.5.11 Let 


fs fo fi 


Eg ————> E ——— > Eo 


be an inverse sequence in BLop/S, where each f; is connected and atomic. Then 
the legs f} : Exo —> En of the limit cone in BYop/S are connected and atomic. 


Proof As usual, we represent the fn by an inverse sequence of fibrations 
of sites satisfying the condition of 3.5.10. We must show that the projections 
P% : Coo —> Cp (in the notation of 2.5.11) create covers; but we already know that 
they preserve them, by 3.1.22. Suppose given a sieve S on an object (Um | m > 0) 
of Cæ, whose image under P% generates a J,-covering sieve. Then, for any 
n’ > n, its image under pr must generate a Jw-covering sieve, since P”, creates 
covers; and we can hoo n’ so large that Uj+ı © Tj+ıU; for all j > n’, i.e. so 
that (Um | m > 0) S T (Un ). But now TE preserves covers; so on applying it 
to our covering sieve on Uw we get a covering sieve generated by the pr -prone 
parts of all the morphisms in S. Hence, to prove that S is covering, it r panics 
by the local character axiom (L) to verify that the sieve generated by a single 
pr -vertical morphism is covering. But this follows by a similar argument: the 
marp hisi in question is in the image of Te for some n” > n’, and the morphism 
in Cy” which maps to it is iss ‘vertical: and hence generates a covering sieve 
there. D 


We are now ready to prove the pullback-stability theorem for atomic mor- 
phisms. As usual, it has two forms, depending on which side of the pullback 


square is assumed to be bounded. 
Theorem 3.5.12 Let 

H G 
F E E 


= 


be a pullback square in Top, where f is atomic and either f or g is bounded. 
Then k is atomic. 


Proof (i) In the case when f is bounded, we may represent F as She(C,T), 
where (C,T) is a site as in 3.5.8(ii), and then H ~ Shg(g*C,g#T). Now g*C 
clearly inherits the right Ore condition from C, since this may be expressed 
by a regular sequent (it is part of the assertion that C°P is weakly filtered, 
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cf. B2.6.2). The condition that T consists of all inhabited sieves is not in general 
preserved under the passage to g#T, since the comparison map g*(P+C,) — 
P*(g*C,) need not be epic; but the condition that T consists precisely of the 
sieves generated by single morphisms of C clearly is preserved. So the result is 
immediate from 3.5.8. 

(ii) In the case when g is bounded, we as usual consider the two sub-cases 
when g has the form [C,€] — E and when it is an inclusion. For the first of 
these, we recall from the proof of 3.1.24 that it is possible to give an explicit 
construction in terms of E of the power object of an internal diagram F on C; 
this construction uses only structure preserved by f*, and so it is itself preserved 
by [C, f]*, i.e. the latter is logical. For the case when g is the inclusion of a 
subtopos sh;(€) — E, we note that by 3.1.25 we have a pullback 


f*(Q5) — 9y 


where 7’ is the pullback local operator on F; but ¢1 is an isomorphism in this 
case, and so the top edge of the pullback square is also an isomorphism, i.e. k* 
(which is the restriction of f* to sheaves, by the second proof of 3.3.15) preserves 
Q. But it is also a cartesian closed functor by 3.3.15(ii), and hence logical. O 


Remark 3.5.13 An alternative proof of the inclusion case of 3.5.12(ii) may 
be given using the criterion of 3.5.5(iv). We know that the pullback k is locally 
connected and hence essential, by 3.3.15; now, given any object B of H, we set 
A =h,B and consider the cube 


HIB SOB 


G 

F/A —— | —— EJ fiA g 
h 

E 


| 
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Here the left vertical face is a pullback since h is an inclusion and so h*A = B, 
and the right vertical face is a pullback since the Beck—Chevalley condition for 
locally connected morphisms yields g* fA & kıh* A & kB. Since the front face is 
a pullback by assumption, the back face is too; hence from the fact that F/A — 
€/fiA is hyperconnected we may deduce that H/B — G/k:B is hyperconnected. 
So k satisfies the condition of 3.5.5(iv). 


We may now deduce another characterization of (bounded) atomic mor- 
phisms, paralleling the characterization of discrete locales which we gave 
in 3.1.15. 


Corollary 3.5.14 A bounded geometric morphism f: F — E is atomic iff both 
f and the diagonal morphism F — F xe F are open. 


Proof First suppose f is atomic. Then f is certainly open, and the projections 
F xe F 3 F are both atomic by 3.5.12. But the diagonal is a section of either 
of these, so it is open by 3.5.6. 

For the converse, we note first that the property of having open diagonal 
is stable under composition of geometric morphisms, and is inherited by both 
halves of the hyperconnected-—localic factorization, by arguments exactly like 
those which we gave in the ‘proper’ case in 3.2.25. Hence in particular, for any 
object B of F, the localic part of the composite F/B — F — E must be open 
with open diagonal; so by 3.1.15 it corresponds to a discrete internal locale in €. 
So we have verified the condition of 3.5.5(iii), and f is atomic. o 


Suggestions for further reading: Barr & Diaconescu [81], Johnstone [541], 
Joyal & Tierney [560]. 


C3.6 Local maps 


In Section C3.3, we studied those geometric morphisms f: F — E for which f* 
has an €-indexed left adjoint; in the present section our concern is with those f 
for which f, has an €-indexed right adjoint. At this point, the ‘duality’ between 
the odd- and even-numbered sections of this chapter finally breaks down: by the 
characterization of bounded atomic morphisms in 3.5.14, their ‘proper’ dual is 
the class of proper separated maps, which neither contains nor is contained in 
the class of local maps which we study in this section. In fact, as we shall see, the 
‘open’ dual of the notion of local map is a different strengthening of the notion of 
locally connected morphism, which we shall characterize in 3.6.16 below under 
the name ‘totally connected morphism’. 

Of course, f, has a right adjoint iff it is the inverse image of a geometric 
morphism c: E — F, iff f has a left adjoint c in the 2-category Yop. But this 
condition alone will not ensure that the adjunction (c* 4 cą) is indexed over € 
(though it will, of course, be indexed over F}; so our first concern is to determine 
conditions under which this happens. 
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Theorem 3.6.1 For a geometric morphism f:F — €E, the following are 
equivalent: 
(i) fa has an E-indezed right adjoint. 


(ii) f has a left adjoint when regarded as a morphism f — 1e in the 2-category 
Yop /E. 


(iii) f is connected and has a left adjoint in Yop. 

(iv) fx has a right adjoint which is full and faithful. 

(v) fx has a right adjoint which is a cartesian closed functor. 
) 


There exists a geometric morphism c: E — F such that fc = le and, for 
every geometric morphism g: G — E, the composite cg is an initial object 
of Lop/E (g, f). 

(vii) There exists a geometric morphism c: E — F such that fc = le and there 
exists a geometric transformation a: cf —> lF over E. 


(vi 


Proof (i) => (ii): If c, is the right adjoint of f,, then (as observed above) it is 
the direct image of a geometric morphism, which is left adjoint to f in Top. But 
since the adjunction is €-indexed, for any object A of E we have 


c* f(A) = ce f*(ZaA*(1)) 
Da((c*)4(f*)4.A*(1)) since c* and f* preserve 


€-indexed coproducts 


IIe 


A*(f*c*(1)) since f* and c* are €-indexed functors 
2y4A*(1) SA. 


So fe = le, ie. c is a morphism lg — f in Top/E; and it is still left adjoint to 
f in this 2-category, since the unit and counit of the adjunction (f* 4 c*) are 
€-indexed natural transformations (cf. B3.1.5). 

(ii) > (i): Since Top/E is fully embedded in the 2-category Tope of E-indexed 
toposes by B3.1.5, this is immediate. 

(ii) = (iii): Condition (ii) says that fẹ has a right adjoint and that the 
composite f,f* is isomorphic to the identity functor on E£; condition (iii) says 
additionally that the isomorphism le = f,f* is the unit of the adjunction 
(f* 4 f.). But, by Al.1.1, this extra condition is automatically satisfied when 
(ii) holds. 

(iii) (iv): Given adjunctions (f* 4 f, 4 f*), it is well known that f* is full 
and faithful iff f# is, since the composite f,f* is left adjoint to f, f*. 

(iv) = (v) is immediate from A4.2.9. 

(ii) > (vi): Suppose (ii) holds; let c: € — F denote the left adjoint of f. Since 
c is a morphism of Yop/E€, we certainly have fc = le. Now let g: G — E be an 
€-topos, and h: G — F an arbitrary geometric morphism over €. Then 2-cells 
cg — h in Top/E correspond to €-indexed natural transformations g*c* — h*. 
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by B3.1.5, and these in turn correspond to €-indexed natural transformations 
fe =c* — g,h*. But since f, is the €-indexed representable functor represented 
by ly, these in turn correspond to morphisms le — g,h*(1¢) = le, by the 
€-indexed Yoneda lemma. So there is a unique such 2-cell. 

(vi) = (vii) is immediate since (vi) includes the statement that cf is initial 
in Top/E (F, F). 

(vii) = (ii): Since a is a 2-cell in Yop/E, the isomorphism le — fe and 
a satisfy one of the two triangular identities for an adjunction in Yop/E€; in 
the terminology of B1.1.8, c is weak left adjoint to f in this 2-category. But 
idempotent 2-cells split in Top, by A4.1.15, and an examination of that proof 
will show that they also split in Top/E€ for any £; so by B1.1.8 f has a left adjoint 
in Top/E. oO 


Remark 3.6.2 In the case when f is bounded, we may add a further equivalent 
condition to 3.6.1, namely 


(viii) f. preserves arbitrary E-indezed colimits. 


For this condition is always implied by (i), by B1.4.14; and the converse holds 
if F has an €-indexed separating family, by B2.4.6. 


We say that f is a local geometric morphism, or that F is a local €-topos, if it 
satisfies the equivalent conditions of 3.6.1. (The name could perhaps have been 
better chosen, given the slightly awkward clash with ‘localic morphism’; but 
both terms are too well established to be easily abandoned.) In the particular 
case when € is Set (or whatever base topos we happen to be working over at 
the moment), we simply say F is a local topos. The geometric morphism c of 
3.6.1(vi) is often called the centre of the local morphism f, for reasons which 
will become clear shortly. 


Examples 3.6.3 (a) As we saw in 1.5.6, if X is a locale then Sh(X) is local 
(that is, the unique morphism Sh(X) — Set is local) iff X has a focal point, i.e. 
a point whose only neighbourhood is the whole space. More generally, suppose 
f: X —Y is a surjective continuous map of (sober) spaces such that, for each 
point y of Y, the fibre f~'(y) has a focal point c(y), and that the assignment 
y +» c(y) is continuous as a function Y — X. Then the continuous map c is a 
left adjoint right inverse for f in the 2-category Loc (ie. fe = ly and cf < 1x), 
and since the assignment X ++ Sh(X) is 2-functorial by 1.4.5, we deduce that 
the geometric morphism Sh(X) — Sh(Y) induced by f is local. For example, 
if L is a normal distributive lattice as defined in [520, II 3.6], and we take X 
and Y to be the spaces of prime and maximal ideals of L respectively, then the 
inclusion Y — X has a (continuous) left inverse r: X — Y sending each prime 
ideal to the unique maximal ideal which contains it, and the induced geometric 
morphism Sh( X) — Sh(Y) is local. 

(b) Similarly, if f:C — D is any functor between small categories which is 
a coreflector (i.e. has a left adjoint right inverse), then the induced morphism 
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[C, Set] — [D, Set] is local by A4.1.4. (Indeed, there is a converse result, by 
A4.1.5: if C is Cauchy-complete and the morphism induced by f is local, then f 
is a coreflector, since the left adjoint in Yop is automatically essential.) In partic- 
ular, if C is a small category with an initial object, then the geometric morphism 
[C, Set] — Set is local (and the converse holds provided C is Cauchy-complete). 

(c) It follows from the preceding sentence that if T is any cartesian theory, 
then the classifying topos Set[T] is local (over Set); for D3.1.1 shows that Set[T] 
may be identified with [(Cr)°?, Set], and the syntactic category Cr has a terminal 
object by D1.4.2. More generally, for any geometric theory T, a term model of T 
is defined to be a model M (in Set) such that, for each sort A of the signature of 
T, each element of MA is the interpretation in M of some closed term of sort A, 
and two such interpretations sm and tm are equal iff the sequent (T Fy) (s = t)) 
is provable in T. It is easy to see that if M is a term model of T, then p*M is 
initial in T-Mod(€) for any Set-topos p: E — Set, and hence by 3.6.1(vi) that 
the classifying topos Set[T] is local. (For a cartesian theory T, the term model 
is the free model generated by the empty set of generators.) 

(d) Generalizing the last assertion of (b) in a different direction, let (C,T) 
be a small site such that C has a terminal object 1 which is T-irreducible in the 
sense of 2.2.18(a) (i.e. its only T-covering sieve is the maximal sieve Mı). (Such a 
site is often called a local site.) Then we claim that Sh(C, T) is a local topos. For 
the direct image functor Sh(C, T) — Set is given by evaluation at the terminal 
object; we know that this has a right adjoint as a functor [C°?, Set] — Set, 
namely right Kan extension along the functor 1 — C which picks out the terminal 
object of C; and the irreducibility of the latter ensures that this functor takes 
values in Sh(C, T). (Explicitly, its value at a set A is the functor U ++ AC (V), 
which is a sheaf because C (1, —) maps covers to epimorphic families; cf. 2.4.10.) 
Conversely, suppose € is any bounded local Set-topos, and let C be a small full 
subcategory of € which contains a generating set of objects as well as the terminal 
object 1. Since the direct image functor €(1,—): E€ — Set is also an inverse 
image functor, it preserves epimorphic families; that is, if (A; — 1 | ż¿ € J) is an 
epimorphic family in €, then at least one A; — 1 must be a split epimorphism. It 
follows easily that 1 is irreducible for the coverage on C induced by the canonical 
coverage on €. Thus we conclude that a Grothendieck topos is local (over Set) 
iff it can be generated by a local site. The foregoing arguments are constructive, 
and so can be interpreted in the internal logic of an arbitrary topos S to give a 
characterization of bounded local morphisms with codomain S. 

(e) If E is the topos of sheaves on a small full subcategory C of Sp as in 
A2.1.11(d), then we saw in 2.3.23 that, for any object X of C, there is a local 
geometric morphism €/I(X) — Sh(X); and if X is a smooth manifold (resp. an 
affine scheme of finite type over a field k), then there is a similar local morphism 
to Sh(X) from the appropriate slice of the topos of A2.1.11(e) (resp. (f)). 

(f) We recall from A2.1.12 that the Sierpiński cone on a locally small topos € 
is the topos obtained by glueing along the functor E (1, —): E — Set. More gen- 
erally, given a geometric morphism p: E — S, we define the Sierpiriski cone on € 
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over S to be the topos obtained by glueing along p,: E — S; we shall denote it 
by scns(€). By B3.4.2, it may be viewed as the cocomma object (pT 1¢) in Top; 
that is, geometric morphisms scns(€) — F correspond to triples (f,g,a) where 
f:E —F,g9:S — F and a: pg — f is a geometric transformation. In particu- 
lar we have a morphism q: scns(€) — S corresponding to the triple (p, 1s, 1p); 
we claim that this is always local. For its inverse image is easily seen to be 
given by q*(I) = (p*I,I,nr) where 7 is the unit of (p* 4 p.); hence morphisms 
q*(I) — (A,J, f) are pairs (u,v) where u: p*(I) > A and v: 1 — J make the 
square 


ni . 
I ——> pp (I) 
v px(u) 


J ——> p,(A) 


commute. But this is equivalent to saying that the transpose of u is the composite 
fv, so the pair (u,v) is uniquely determined by v, and we see that the right 
adjoint q, of q* coincides with the projection scns(€) — S, i.e. the inverse 
image of the closed inclusion c: S — sengs(€) (cf. A4.5.6). We note also that the 
composite of q with the open inclusion d: E — sens (£E) is our original morphism 
p; thus we have shown that every geometric morphism may be factored as an 
open inclusion followed by a local morphism. 


Following on from the last example above, we note 
Lemma 3.6.4 For an S-topos p: E — S, the following are equivalent: 


(i) p is local. 


(ii) E is a retract of a local S-topos. 


(iv) E€ is an adjoint retract (cf. B1.1.9(c)) of sens(F) for some S-topos F. 
(v 


Proof (i) > (v): Suppose £ is a local S-topos with centre c. Then the mor- 
phisms 1¢ and c, together with the geometric transformation cp —> 1e of 3.6.1(vi), 
induce a morphism scns(&) — E over S, which is a splitting for the open inclu- 
sion. Moreover, it is left adjoint to d: since c* & p,, its inverse image may be 
identified with the functor A > (A,p,.A,1,,4), which is the direct image of d. 
(v) => (iv) is clear, since any left adjoint for an inclusion is necessarily a one- 
sided inverse for it (up to isomorphism). (iv) => (iii) is trivial; and (iii) = (ii) is 
immediate from 3.6.3(f). Finally, (ii) = (i) follows easily from B1.1.10 (and the 
splittability of idempotent 2-cells in Top/S, cf. A4.1.15). O 


) 
) 
(iii) E is a retract of sens(F) for some S-topos F. 
) 
) 


The open inclusion d: E — sens (E) has a left adjoint. 
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Remark 3.6.5 By applying the argument in the proof of 3.6.4(i) > (v) to local 
S-toposes of the form scns(F¥), we obtain the multiplication of a KZ-monad 
structure on the functor scns(—): Yop/S — Yop/S, whose unit is the open 
inclusion. Lemma 3.6.4, in conjunction with B1.1.14, now tells us that the alge- 
bras for this monad are exactly the local S-toposes (and morphisms of algebras 
are morphisms between local S-toposes which commute up to isomorphism with 
their centres). 

If the ideas in the preceding paragraph seem reminiscent of the notion of 
hyperlocal topos which we studied in Section B4.4, this is no accident. Of course, 
it follows immediately from B4.4.19(vi) and 3.6.1(ii) that hyperlocal S-toposes 
are local. But it is also the case that scns(—) may, like the lower bagdomain 
functor Bz, be regarded as a partial product functor. We note that, for any 
S-topos F, a morphism F — scng(€) over S corresponds, via pullback along 
the open inclusion d, to a pair (U,v) where U is a subterminal object of F 
and v is a morphism F/U — € over S; in other words, it is just a ‘bag of 
T-models’, where T is the geometric theory classified by €, whose indexing 
object is subterminal. (Equivalently again, it corresponds to a pre-geometric 
morphism (A4.1.13) F — E over S whose ‘inverse image’ functor preserves binary 
products as well as pullbacks.) From this description, it is easy to see that we 
have scns(—) ~ Pe(—, dı), where dı: S —> [2, S] ~ scng(S) is the open point of 
the Sierpinski topos over S (the classifying topos for the theory of subterminal 
objects, cf. B3.2.11). 


From another of the examples in 3.6.3, we obtain 


Scholium 3.6.6 A geometric morphism f: F — E between bounded S-toposes 
is local iff it can be induced by a fibration of sites (P,T): (D, K) > (C, J) in S 
such that T reflects (as well as preserves) covers. 


Proof If F is local, we may choose a local site (B, T) for it in E£, by 3.6.3(d). 
If we then express € as Shs(C, J) and externalize the site (B,T) to obtain a 
fibration of toposes as in 2.5.7, the condition that the terminal object of B is 
T-irreducible easily translates into the assertion that T reflects covers. 

Conversely, if f is induced by a fibration of sites as in the statement, then 
since T is both cover-reflecting and a morphism of sites it induces an adjoint 
pair of geometric morphisms as in 2.3.23; also, since T is full and faithful, the 
left adjoint c: E — F is an inclusion (equivalently, f is connected). So f is local 
by 3.6.1(iii). 


Next, we investigate the stability properties of local maps. 


Lemma 3.6.7 
(i) Local maps are tidy. 


(ii) A composite of local maps is local. 
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(iii) In a commutative triangle 


E 


of geometric morphisms, if h is local and g is connected, then f is local. 
Moreover, if c is the centre of h, then gc is the centre of f. 


(iv) Let 


jsa 


f 


F — > E 


be a pullback square in Lop with f local. Then k is local. Moreover, if c 
is the centre of f, then the centre of k is the morphism into the pullback 
induced by cg and lg. 
(v) Let 
fs f fi 
—> Ez —> E —> Eo 

be an inverse sequence of toposes and local morphisms having a (Cat- 
enriched) limit in Yop (or in BYop/S for some S). Then the legs 
fZ: Eœ > En of the limit cone are local. 


Proof (i) is immediate from 3.6.2, and (ii) is immediate from either (iii) or 
(iv) of 3.6.1. 

(iii): Since the unit of (g* 4 g,) is an isomorphism, we have f,(B) = 
f.9%9*(B) S h.g*(B) & c*g* (B) for any object B of F. So f, is indeed an inverse 
image functor; and f is connected, since we have f, f* & c*g*f* =c*h* S 1g. 

(iv): For this we use 3.6.1(vi). Let d: G + H be the morphism induced by 
cg and 1g. Then, for any l: L — G, composition with h induces an equivalence 
between Yop/G (L, k) and Top/E (gl, f); so from the fact that cgl S hdl is initial 
in the latter we deduce that dl is initial in the former. 

(v) Let cn: En—1 >En denote the centre of fn, and, for n < m, let 
Cm: En — Em denote the composite of the c} with n +1 < j < m. Since the 
f, are all connected, it is easy to verify that for each n we have a (pseudo-)cone 
over the diagram with vertex En, whose mth component is c}, if m > n, and 
fr" (the corresponding composite of the f,) if m < n. These induce a mor- 


phism c2: En —> Ew; it is clear that fZ c2, is isomorphic to the identity, and 
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using the fact that £ is a Cat-enriched limit it is easy to construct a geometric 
transformation from c2 f% to the identity, which is the counit of an adjunction 
(ct, 4 f%). o 
Remarks 3.6.8 (a) The hypothesis ‘g is connected’ in 3.6.7(iii) cannot be 
weakened to ‘g is surjective’, even if we add the further hypothesis that g is open 
(or even atomic). Let C be the finite category with three objects U, V, W and two 
non-identity morphisms U — V — W, and let D be the quotient of C obtained 
by identifying the objects U and W (but not the two non-identity morphisms). 
The quotient functor C — D is surjective on objects and a discrete fibration, 
so the induced geometric morphism [C°?, Set] — [D°P, Set] is a surjective local 
homeomorphism. However, [C°?, Set] is local over Set, since C has a terminal 
object; but [D°?, Set] is not local, since D is Cauchy-complete but does not have 
a terminal object. (In fact the direct image functor [D°P, Set] — Set sends a 
diagram of shape DP (that is, a parallel pair) to its equalizer, and it is easily 
seen that this functor fails to preserve epimorphisms. ) 

(b) Similarly, 3.6.7(iii) does not have a ‘dual’: even if we assume that both f 
and h are local in the diagram of 3.6.7(iii), and further that g has a right inverse, 
there is still no reason why g should be local. To see this, we use 3.6.3(c): we 
take E to be Set, and F and G to be the classifying toposes for the theories 
of abelian groups and of rings (not necessarily with 1), respectively. Then both 
F and G are local over E€, and we have a morphism g: G — F classifying the 
underlying additive group of the generic ring, which has a right inverse obtained 
by equipping the generic abelian group with the zero multiplication. However, 
g is not connected, and therefore not local; it does have a left adjoint in Yop, 
corresponding to the free ring generated by the generic abelian group, but the 
unit of the adjunction is not an isomorphism. 

(c) It will be noted that, for once, we have been able to prove the pullback- 
stability theorem (3.6.7(iv)) without worrying about which of the maps in the 
pullback is bounded. However, if in the situation of 3.6.7(iv) we happen to know 
that f is bounded as well as local, we can give an alternative proof of the result 
using the site characterization of bounded morphisms given in 3.6.3(d), since (as 
we observed in the proof of 3.2.21) the assertion that a particular object of a 
site is irreducible is stable under change of base. Note also that the assertion 
in 3.6.7(iv) that the centre of a local morphism is ‘stable under pullback’ easily 
yields the fact that local morphisms are SRBC morphisms in the sense of 2.4.16 
(which we knew in any case, since local morphisms are tidy). 

(d) Similarly, in proving the stability of local maps under inverse limits, we 
did not need to use our characterization of local maps in terms of fibrations of 
sites (3.6.6), although it could no doubt be derived using the latter. We note 
also that the argument of 3.6.7(v) works for limits over more general cofiltered 
categories, although we shall also be able to derive the latter fact by way of 
the descent theorem for open surjections (see 5.1.13 below), as we do for other 
properties of geometric morphisms. 
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Next, we note a useful ‘orthogonality property’ of local morphisms. We say an 
S-topos p: E — S (or the geometric morphism p) is grouplike if, for any S-topos 
F, the category Yop/S (F, E) is a groupoid. If £ is bounded over S, this says that 
it classifies a geometric theory T such that every homomorphism of T-models (in 
arbitrary S-toposes) is an isomorphism. 


Examples 3.6.9 (a) Let G be an internal group in a topos S. In B3.2.4(b) we 
gave a description of the notion of G-torsor, from which it follows that [G, S] 
is a grouplike S-topos. The same is true of [G, S] for any internal groupoid G 
in S. (However, the result does not extend to Cont(G) for a topological group 
G: for example, if G is the group of permutations of N, with the topology of 
pointwise convergence, then Cont(G) classifies the theory of infinite decidable 
objects (D3.4.10), and so any injection N — N which is not surjective defines a 
non-invertible endomorphism of a point of Cont(G).) 

(b) If X is a Ty-locale in a topos S (cf. 1.2.17), then Shs(X) is a grouplike 
S-topos, since geometric morphisms € — Shs(X) over S correspond to locale 


maps Y — X, where Y is (the locale in S corresponding to) the localic reflection 
of E. 


Proposition 3.6.10 Local morphisms are orthogonal (in the 2-categorical 
sense) to grouplike morphisms: that is, if p: E —> S is a grouplike S-topos and 
f:G — F is a local morphism of S-toposes, then the functor 


Vop/S (F,E) —> Top/S (G,E) 


induced by composition with f is part of an equivalence of categories. 


Proof The displayed functor has a right adjoint, induced by composition with 
the centre of f. But any adjunction between groupoids is an equivalence. O 


As an example of the applications of 3.6.10, we give 


Corollary 3.6.11 Let f: F —E be a local geometric morphism over some 
base topos S with a natural number object. Then f, preserves the object Ra 
of Dedekind real numbers (cf. D4.7.5). 


Proof Let A be an object of €. By definition, morphisms A — Ra in £E 
correspond to Dedekind real numbers in €/A, and hence to geometric morphisms 
E/A — Shs(Ry) over S, where Ry is the locale of ‘formal real numbers’ in S 
(cf. D4.7.4). But Ry is regular and hence Ty by 1.2.17, and F/f*A > E/A is 
local by 3.6.7(iv), so these correspond to morphisms F/f*A — Shs(Rp) over 
S, and hence to morphisms f*A — Ry in F. Thus f, Ra represents the same 
functor as the Dedekind real number object in £. oO 


We turn now to the notion of localization. Given a category C and an object 
U of C, we have a universal way of ‘making U into an initial object of C’: namely, 
forming the co-slice category U\C. (Formally, the forgetful functor U\C — C is 


704 CLASSES OF GEOMETRIC MORPHISMS C3.6 


universal amongst functors from categories with initial objects to C, which send 
the initial object to U; cf. A1.2.9.) In the same vein, given a space X and a point 
x of X, there is a largest subspace of X which contains z as a focal point, namely 
the subspace of all points y such that x < y in the specialization ordering. These 
observations inspire the proof of the following result: 


Theorem 3.6.12 Given a bounded S-topos p: E —> S and a particular sec- 
tion s: S — E of p, there exists a bounded local S-topos p,: Es > S (called the 
localization of E at s) equipped with a geometric morphism Es — E over S 
which carries the centre of E, into the chosen section s, and which induces an 
equivalence 


BLop/S (F, Es) ~ sq\(BLop/S (F, E)) 
for any bounded S-topos (q: F > S). 


Proof We need to recall a result from Part B: namely, the fact that the 
2-category of bounded S-toposes admits cotensors with 2 (B4.1.4). That is, for 
any bounded S-topos E, there exists a topos G = 2 +h € equipped with a diagram 


which is universal amongst 2-cells with codomain €. Now the identity 2-cell 
on the identity geometric morphism € — € induces a geometric morphism 
A: E —G, which clearly satisfies djA = d^ ©& lg; and there are canonical 
geometric transformations Adı —> 1g — Ado, which make A left adjoint to dı 
and right adjoint to dọ in Lop/S. In particular, dı is a local morphism, with 
centre A. Thus if we form the pullback 


fs ae 


Ds dı 


Soa ag 
then £, is local over S by 3.6.7 (iv). Moreover, for any bounded S-topos q: F — S, 
morphisms f: G — E, over S correspond to morphisms g = sf: F > 2h E sat- 
isfying djg = sq, and hence to morphisms h = dog: F — E equipped with a 
geometric transformation sq — h over S. 0 


There are two alternative ways of representing E, as the solution of a universal 
problem. For the first, let pSop/S denote the 2-category whose objects are 
bounded S-toposes (p: E — S) equipped with a specified point s (that is, a 
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section of p), and whose 1-cells are geometric morphisms over S preserving the 
specified points; 2-cells between morphisms f, f’: (p,s) 3 (p’, s’) are geometric 
transformations a: f — f’ such that both p’ oa and aos are (the canonical) 
isomorphisms. 


Corollary 3.6.13 The assignment (p:E—>S,s) = E is a functor 
pLop/S — BLop/S, right adjoint to the functor which sends p:E— S to 
scns(E) equipped with its closed point. 


Proof For an S-topos (q: F + S), morphisms F — E, over S correspond to 
morphisms f: F — 2h E such that d;f = sq, and hence to pairs (g, œ) where 
g = dof is a morphism F — € over S, and a is a geometric transformation 
sq — g. But these in turn correspond to geometric morphisms scns(F) — E 
over S carrying the closed point of sens(F) to s. 


Another, more ‘relaxed’, way of making pointed bounded S-toposes into a 
2-category is to allow the 1-cells (p, s) — (p’,s’) to be pairs (f,a) where f isa 
morphism over S (that is, p' f S p) but a: s’ > fs is a geometric transformation 
over S which is not assumed to be invertible. With the appropriate relaxation of 
the notion of 2-cell, this yields a 2-category which we shall denote by PTop/S. 
We write LocLop/S for the full sub-2-category of BLop/S whose objects are 
local S-toposes; by 3.6.1(vi) we see that the operation of equipping a local topos 
with its centre defines a full embedding of 2-categories CocLop/S — PLop/S. 


Corollary 3.6.14 The assignment which sends (p: E > S,s) to E, is a functor 
PBLop/S — LocTLop/S, right adjoint to the full embedding just defined. 


Proof First we indicate how (E, s) ++ E, becomes a functor on PTop/S. Let 
(f,a): (F,t) > (E,s) be a 1-cell in this 2-category; then we have a diagram 


Fy 3 20mF 


1 
t 
Ta f 
S e 


which on pasting yields a diagram of the required form to induce a morphism 
Fi — E, over S. The definition of the localization functor on 2-cells of PTop/S 
is straightforward. 

To establish the adjunction, let (q: F — S) be a local S-topos, with centre 
c, and suppose we are given (f,a):(F,c)— (E€, s) in PTop/S. Then we may 
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obtain a morphism F — E, from the diagram 


er, oe eee 
T Ta 
C 
q S 
S 


Conversely, given g: F — E, over S, the diagram 


à 
EREE a Se Se 


t 
C q dı 


S —— S —— E 


yields a morphism (F,c) — (E, s) in PTop/S. The rest of the proof is straight- 
forward verification. o 


Examples 3.6.15 (a) If p: E > S is a grouplike S-topos, then the domain 
map 2 h € — £ is an equivalence — indeed, this property characterizes grouplike 
S-toposes, since a category C is a groupoid iff the domain map [2,C] — C is an 
equivalence. It is therefore immediate from the definition that any localization 
of a grouplike S-topos is equivalent to the base topos S. 

(b) If p: E > S is already local, and c is its centre, then the localization Ee is 
simply (equivalent to) € itself, by 3.6.14. 

(c) If C is a small category and U is an object of C, then the localization of 
[C, Set] at the point induced (as in A4.1.4) by the functor U: 1 > C is simply 
the functor category [U\C, Set]; for it is easy to verify that the notion of a 
C-torsor F equipped with a natural transformation from the functor C (—,U) to 
F (corresponding to an element x € F(U)) is equivalent to that of a torsor G on 
U\C, by an argument like that in the proof of B4.1.2. 

(d) If X is the spectrum of a commutative ring A in Set (ie. the space 
of prime ideals of A, equipped with the topology generated by the open sets 
D(a) = {Pe X | a ¢ P}, a € A — it is well known that this topology is 
isomorphic to the frame of radical ideals of A) and P is a particular prime ideal, 
then Sh(X)p may be identified with the spectrum of the ring Ap (the localization 
of A at P) obtained by formally adjoining inverses for all elements of A \ P. Of 
course, the topos-theoretic notion of localization was originally inspired by this 
algebraic example. 

(e) Finally, we mention an example from [545] of one way to define the notion 
of ‘the germ of a manifold at a point’ in topos-theoretic terms. Let M be asmooth 
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manifold, and x a point of M; if we simply localize the topos Sh( M) at the point 
x, we will obtain the topos Set by (a) above, since M is a Hausdorff space and so 
Sh(M) is grouplike. However, we have another topos associated with M, namely 
the slice €/1(M), where € is the topos constructed as in A2.1.11(e) from a small 
full subcategory C of Mf containing M, and l: C — E is the canonical functor. 
As we saw in 3.6.3(e) above, there is a local morphism €/1(M) — Sh(M)); if c 
denotes the centre of this morphism, then the point cx of E/M corresponds to the 
(C/M)°?-torsor whose value at f: N — M is simply the set f~!(x). Localizing 
at this point is clearly unlikely to tell us anything about the topological structure 
of M at x; and indeed, it is easy to show that (€/I(M))-z =~ Sh(M), ~ Set. 
However, we have another point ž of €/1(M), corresponding to the torsor which 
sends f: N — M to the set of germs at x of continuous sections of f. Clearly, 
evaluation of germs at x yields a natural transformation from this functor to 
the one considered previously, and hence to a geometric transformation č — cx 
(so that cx will ‘appear’ as a point of the localization (€/1(M))z). In [545] it is 
shown, by constructing an explicit site of definition for this localization, that it 
does indeed represent ‘the germ of M at x’, at least in the sense that, for any 
object N of C, the category BTop/E ((E/l(M))z,E/1(N)) is equivalent to the 
(discrete) set of germs at x of smooth maps M — N. 


As promised, we conclude this section by briefly reviewing the duals of local 
maps. Paralleling 3.6.1, we have 


Theorem 3.6.16 For a geometric morphism f: F — E, the following are 
equivalent: 


(i) f is locally connected and the left adjoint fi of f* is cartesian. 

(ii) f has a right adjoint when regarded as a morphism f — le in Yop/E. 
(iii) f is connected and has a right adjoint in Lop. 
(iv) There exists a geometric morphism d: E > F such that fd ~ lg and, 
for every E-topos g: G — E, the composite dg is a terminal object of 
Lop/E (g, f). 
(v) f has a section d for which there exists a geometric transformation 

B: 1f —> df over E. 


Proof (i)> (iii): If (i) holds, then f is connected by 3.3.3; and f, is the inverse 
image of a geometric morphism, which is right adjoint to f in Yop. 

(iii) > (ii) is similar to the implication (iii) = (ii) of 3.6.1. 

(ii) > (i): If f has a right adjoint d in Top/E, then d* is an €-indexed left 
adjoint for f*. So f is locally connected, and (i) holds. 

(ii) > (iv) > (v) = (ii) is exactly dual to the proof of (ii) > (vi) > (vii) > (ii) 
in 3.6.1. o 


We say that a geometric morphism is totally connected if it satisfies the equiv- 
alent conditions of 3.6.16. Note that the morphism d of 3.6.16(iv) is necessarily a 
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dense inclusion (indeed, a flat inclusion in the sense of 1.1.16(d)), since d, = f* 
is full and faithful and preserves finite unions. 


Examples 3.6.17 (a) For a space X, we saw that Sh(X) is local over Set 
iff X has a focal point, i.e. one whose only neighbourhood is the whole space. 
Similarly, Sh(X) is totally connected iff X has a dense point d — one whose 
closure is the whole space. This condition implies that every inhabited open set 
is connected (which explains the use of the term ‘totally connected’); and the 
converse holds provided X is sober, since the condition ‘every inhabited open 
set is connected’ implies that the inhabited open sets form a completely prime 
filter in O(X), and the corresponding point of X is clearly dense. To see that 
this implies that Sh(X) is a totally connected topos, observe that it clearly 
implies that X is locally connected, and hence so is the domain of any local 
homeomorphism & — X; now each connected component of E must meet the 
fibre over the dense point d in exactly one point, and so we may identify d* with 
the ‘connected components’ functor 7 of 1.5.9. 

(b) Paralleling 3.6.3(b), we see that if a small category C has a terminal 
object, then [C, Set] is a totally connected Set-topos. However, this condition is 
not necessary, since the right adjoint d need not be induced by a functor 1 > C 
(that is, an object of C). In fact, as we know from B2.6.8, a diagram category 
[C, S] is totally connected over S iff C is filtered. 

(c) Generalizing the last example, let (C, J) be an internal site in a topos S, 
such that C°P is filtered (for example, it would be sufficient for C to be carte- 
sian) and every J-covering family is connected. Then by 3.3.10 we know that 
constant diagrams of shape C°P are sheaves, and hence the left adjoint p of the 
inverse image functor p*: S —> Shs(C, J) is simply limcop applied to sheaves 
on C: in particular, it is cartesian by B2.6.8 and the fact that the inclusion 
Shs(C, J) — [C°?, S] preserves finite limits. So Shs(C, J) is totally connected 
over S. Conversely, if S has a natural number object, then every totally con- 
nected bounded S-topos € can be generated by a site of this form, since the 
connected objects of E (i.e. those objects A satisfying p(A) = 1) are closed 
under finite limits. 

(d) We may also dualize 3.6.3( f): if we glue along the inverse image p*: S — E 
of a geometric morphism p, instead of the direct image -— that is, if we form the 
cocomma object (le fp) in Top, instead of (pT1le) — then we obtain a topos 
which is readily seen to be totally connected over S. In particular, any geometric 
morphism can be factored as a closed inclusion followed by a totally connected 
morphism. (It will be noted that we have already exploited this factorization in 
the proof of 3.3.14.) And, dualizing 3.6.4, we see that p itself is totally connected 
iff € is a retract of (1e Tp) in Lop/S, iff it is a coadjoint retract of (1¢ fp). Note 
also that the functor (p: E — S) — (le Tp) may be regarded as a contravariant 
partial product functor in the sense of B4.4.13, using the closed inclusion S ~> 
[2, S] in place of the open inclusion which we used in 3.6.5. In particular, since 
closed inclusions are entire, one may deduce that any bounded S-topos which 
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admits an algebra structure for the upper bagdomain monad of B4.4.21 is totally 
connected. 

(e) Our final example has no parallel in the local case. Let p: E > S be a 
bounded S-topos, where S has a natural number object. We recall that in B4.5.8 
we constructed the ‘measure topos’ M€, which has the property that geometric 
morphisms F — ME over S correspond to cocontinuous S-indexed functors 
E — S, for any bounded S-topos F. Suppose p is locally connected; then, for 
any S-topos q: F — S and any cocontinuous S-indexed functor 0: E — F, there 
is a unique S-indexed natural transformation 0p* — q* (since each object of S 
is an S-indexed coproduct of copies of 1, and q* preserves 1), and hence there 
is a unique S-indexed natural transformation 0 — q*pı. So q*p: is terminal in 
LocPress (E, F); equivalently, the section d of ME — S which corresponds 
to pı has the property that dq is terminal in Top/S (F, ME). Thus we deduce 
that if E is a locally connected bounded S-topos, then ME is totally connected 
over S. In fact the converse is also true: for the canonical inclusion 6: E — ME 
is S-essential by the remark before B4.5.8, and hence so is its composite with 
ME — S, even if we merely assume that the latter is locally connected. Note 
also that, if E is connected and locally connected over S, then the topos PE of 
probability measures on € (cf. B4.5.12) is totally connected. 


From 3.6.17(c), we may obtain a characterization of totally connected mor- 
phisms in terms of fibrations of sites: a morphism between bounded S-toposes 
(where S has a natural number object) is totally connected iff it can be induced 
by a fibration of sites (P,T): (D, K) — (C, J) such that C and D are cartesian 
and P is a morphism of sites (i.e. it is cartesian and preserves covers) — cf. the 
remark after 3.1.21. 


Lemma 3.6.18 
(i) A composite of totally connected morphisms is totally connected. 


(ii) In a commutative triangle 


where h is totally connected and g is connected, f is totally connected. 
Moreover, if d is the right adjoint of h in Yop, then gd is the right adjoint 
of f. 

(iii) Totally connected morphisms, and their right adjoints, are stable under 
pullback. 
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(iv) Given an inverse sequence of totally connected morphisms whose (Cat- 
enriched) limit exists in Yop, the legs of the limit cone are totally 
connected. 


(v) Totally connected morphisms are orthogonal to grouplike morphisms (cf. 
3.6.9). 


Proof (i) is immediate from either (i) or (iii) of 3.6.16. 

(ii) If we define fi = hig*, then it is clearly cartesian; and it is left adjoint to 
f* since g* is full and faithful. So f has a right adjoint in Top; but it is clearly 
connected, since g and h are. 

(iii) and (iv) are proved in a similar manner to 3.6.7(iv) and (v); and (v) is 
exactly like 3.6.10. O 


Similarly, for bounded toposes over a base, we have the dual of the notion of 
localization. 


Theorem 3.6.19 Given a bounded S-topos p: E —> S and a particular section 
s: S — E ofp, there exists a bounded totally connected S-topos p°: E° — S (called 
the colocalization of E at s) equipped with a geometric morphism E° — E over 
S which carries the dense section of E° into the chosen section s, and which 
induces an equivalence 


BTop/S (F, E) ~ (BLop/S (F, E))/sq 


for any bounded S-topos (q: F — S). 


Proof E° is constructed in the same way as the localization €,, except that 
we interchange the rôles of the domain and codomain maps 2 h € = E. We have 
already observed in the proof of 3.6.12 that do: 2 h E — E is totally connected, 
its right adjoint being given by the diagonal A; so if we form the pullback 


fi Se 
p° do 
S22 Se 


and define €° — E to be the composite d;§, then the stated properties of E° all 
follow by arguments identical to those in the proof of 3.6.12. o 


Suggestions for further reading: Bunge & Funk [197, 198], Johnstone & 
Moerdijk [545]. 


C4 


LOCAL COMPACTNESS AND 
EXPONENTIABILITY 


C4.1 Locally compact locales 


Our principal goal in this chapter is to establish the characterization of 
exponentiable objects in the 2-category BYop/S, for a general base topos S, 
as those which are ‘locally compact’ in an appropriate sense. We have already 
made some observations about exponentiability in BTop/S in Section B4.3; and 
we Shall assume that the reader of this chapter is familiar with them. In this and 
the next section we shall take our base topos to be Set, in order to simplify the 
notation; but virtually all that we do can be extended to a general base topos S 
with a natural number object, using an appropriate internal finiteness notion in 
S to interpret the finiteness conditions which appear below. 

The notion of local compactness for locales was not considered in 
Section C1.5 (or in Chapter C3): this was a deliberate omission, so that we 
could begin the present chapter by discussing it, and thereby provide the moti- 
vation for the notion of continuous category which underlies the definition of 
local compactness for toposes. 

First we need to recall the notions of continuous poset and continuous lattice. 


Definition 4.1.1 Let A be a poset with directed joins (that is, a domain in 
the sense of 1.1.3). 


(a) Given elements a and b of A, we say a is way below b (and write a << b) if, 
for every directed S C A with V S > b, there exists s € S with s > a. 


(b) We say A is a continuous poset if, for every a € A, the set {b € A | b << a} 
is directed and has join a. 


(c) We say A is a continuous semilattice (resp. continuous lattice) if it is a 
continuous poset and has finite meets (resp. joins). 


Note that a poset with finite joins and directed joins has all joins, and is thus 
a complete lattice. It is also easy to see that the way-below relation is stable 
under finite joins, to the extent that they exist in A (that is, if a; << bı and 
a2 << be, then (a; V a2) << (bı V b2) provided the joins exist), so that in a 
complete lattice the directedness of the set {b | b << a} is automatic. Another 
important property of the way-below relation on a continuous poset is the so- 
called interpolation property: if a << b, then there exists c satisfying a << c and 
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c << b. To prove this, we simply note that the set {d | (Sc)(d << c << b)} is 
directed and has join b. 


Lemma 4.1.2 A continuous semilattice is a preframe. Hence a continuous 
lattice is a frame iff it satisfies the finite distributive law. 


Proof Let S be a directed subset of a continuous semilattice A, a an element 
of A. The inequality V{aAs|s E S} <a ^V 8S is always true, so it suffices to 
show the reverse. But if b << (aA V S), then b < a and b < s for some s E S, 
and hence b < V{aAs|s E€ S}; since a A V S is the join of all such b, the result 
follows. 

The second assertion is immediate from the first, since a frame is precisely a 
preframe which is a (finitely) distributive lattice. oO 


We shall make extensive use of the Scott topology on continuous posets: recall 
that a subset U of a poset A with directed joins is said to be Scott-open if it is an 
upper set and inaccessible by directed joins (i.e. whenever the join of a directed 
set S is in U, then S meets U). The following facts about Scott-open subsets are 
easily established: 


Lemma 4.1.3 
(i) For a poset A with directed joins, the set XA of all Scott-open subsets of 
A is a topology on A. 


(ii) A subset U of a domain A is Scott-open iff its characteristic function 
A —> Q is a morphism of domains. 


(iii) If A is a continuous poset, then the sets f(a) = {b E€ A | a << b} are 
Scott-open for alla € A, and they form a base for the Scott topology on A. 


(iv) Let f: A— B be a map between posets with directed joins. If f preserves 
directed joins, then it is continuous for the Scott topologies on A and B. 
The converse holds if B is continuous. 


Proof (i) and (ii) are trivial. For (iii), the openness of the sets f(a) follows 
from the interpolation property for <<, mentioned earlier; and they form a base 
because, if U is Scott-open and a € U, we can find b << a with b € U, and hence 
a € f(b) CU. The first half of (iv) is trivial; for the second, suppose S C A is 
directed and b << f(V S) in B. Then V S € f~!(f(b)), which is Scott-open if f 
is continuous, so there must exist s € S with f(s) > b. Hence 


Vi f(s) | s € 5} > VEV S) = FOV 9): 
the reverse inequality is trivial. oO 


There are two alternative characterizations of continuous posets, which will 
be of importance in what follows. For the first, we recall from 1.1.3 that an ideal 
in a poset is a subset which is (upwards) directed and also closed downwards; 
we write TA for the set of ideals of A, ordered by inclusion. The mapping which 
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sends an element a of A to the principal ideal | (a) is an order-preserving map 
A—> IA. 


Proposition 4.1.4 The following conditions on a poset A are equivalent: 


(i) A is continuous. 
(ii) | A— IA has a left adjoint, which in turn has a left adjoint. 
(iii) A is a retract in Dom of a poset of the form IB. 
(iv) A is a retract in Dom of a continuous poset. 
Proof (i)> (ii): If A has directed joins, then the mapping V: JA — A which 
sends an ideal to its join is easily seen to be left adjoint to |. The definition of the 
way-below relation says that the left adjoint of \/, if it exists, necessarily sends 
an element a to {(a) = {b € A | b << a}; the definition of continuity says that 
the map | does indeed take values in JA, and that the unit of the adjunction 
exists (the existence of the counit is automatic from the definition of <<). 

(ii) > (iii): Reversing the first part of the previous argument, if | has a left 
adjoint \/, then the latter necessarily sends an ideal to its join in A; hence A 
has directed joins, since the join of any directed set coincides with that of the 
ideal which is its downward closure. Now V and its left adjoint both preserve 
any joins which exist; and they express A as a retract of JA, since | is injective 
and hence \/ is surjective. 

(iii) = (iv): We have to show that a poset of the form IB is continuous. But 
IB certainly has directed joins, since they are just unions; and any principal 
ideal | (b) is inaccessible by directed joins, i.e. satisfies | (b) << | (b). Since any 
ideal J is the (directed) union of the principal ideals which it contains, it follows 
easily that J << J iff J is contained in | (b) for some b € J. It is now immediate 
that |(J) is directed and has T as its union. 

(iv) = (i): Suppose B is continuous, and let i: A — B, r: B — A be directed- 
join-preserving maps with ri = 14. (It is then clear that A has directed joins.) 
Given a € A, we claim first that r(b) << a in A whenever b << i(a) in B: 
for if S C A is directed with V S > a, then b < i(s) for some s € S, whence 
r(b) < ri(s) = s. Since a = ri(a) is the join of all r(b) with b << i(a), it follows 
as in the previous paragraph that a’ << a in A iff a’ < r(b) for some b << i(a), 
and hence that A is continuous. O 


Remark 4.1.5 We note that, if A is expressible as a retract in Dom of a 
poset of the form IB, then it is expressible as a coadjoint retract (in the sense of 
B1.1.9(c)) of such a poset; i.e. we can find i: A — IC and r: IC — A such that 
i is left adjoint to r and the unit 14 — ri is an equality. Equivalently, if we start 
from the full subcategory of Dom whose objects are those of the form JB, then 
splitting all the idempotents in this category has no more effect than splitting 
the idempotent comonads, i.e. those idempotents e: IB — IB satisfying e < lrg. 
Results of this type will form a recurring theme throughout this chapter. 
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If the remarks in the previous paragraph seem reminiscent of B1.1.15, that 
is no accident: of course, what is going on here is that we are dealing with 
a KZ-monad on the (locally ordered) 2-category Poset, whose functor part is 
given by A> JA, whose unit is the principal ideals map |: A — J. A, and whose 
multiplication is the join map I(IA) — JA. The algebras for this monad are 
exactly the domains; so the equivalence of (ii) and (iii) in 4.1.4 is more or less a 
direct transcription of B1.1.15 in this particular case. 


For the second characterization (which was Scott’s original justification for 
studying continuous lattices [1102]), we restrict our attention to lattices. Note 
that, in the characterization of 4.1.4(iii), continuous lattices are just the retracts 
in Dom of posets of the form IB where B is a join-semilattice. 


Proposition 4.1.6 For a complete lattice A, the following are equivalent: 


(i) A is continuous. 


(ii) (A, EA) is a sober space, and is injective with respect to subspace inclusions 
in the category Sob. 


(iii) A is the set of points of a locale X which is injective with respect to inclu- 
sions in Loc, ordered by the ‘specialization ordering’ (that is, the canonical 
partial ordering on maps 1 —> X in Loc, as defined in 1.1.6). 


(iv) (A, EA) is a retract of a power of the Sierpiński space S (cf. 1.1.4). 


Proof (i) = (ii) and (iii): First consider the case when A = IB for a join- 
semilattice B. Then a Scott-open subset U C A is determined by the set {b € 
B | |(b) € U}, since any element of A is a directed join of principal ideals; but 
the latter set may be any upper subset of B, so XA is isomorphic to the lattice 
UB of all upper sets in B. But the latter may also be regarded as the lattice 
L(B°?) of lower sets in the meet-semilattice B°P, and we saw in 1.1.3 that this 
is the free frame generated by B®. Thus a frame homomorphism ¢: UB >Q 
corresponds to a meet-semilattice homomorphism B°? — Q; but the latter is the 
same thing as the characteristic function of a filter of B°?, i.e. of an ideal of B. 
So we may identify (A, XA) with the space of points of the locale X defined by 
O(X) = UB. In particular, it is sober, and the locale X is spatial. 

To verify that it is injective, we observe that for any locale Y (in particu- 
lar for any sober space), a continuous map f: Y — X corresponds to a frame 
homomorphism f*: L(B°?) = O(X) — O(Y) and hence to a meet-semilattice 
homomorphism BP — O(Y). Now if Y is a sublocale of Z, then O(Y) is a 
retract of O(Z) in mSLat, since frame homomorphisms and their right adjoints 
are both meet-semilattice homomorphisms; hence we may extend any continuous 
map Y ~ X toa map Z — X. Thus we have verified that (ii) and (iii) hold 
whenever A is of the form IB. 

The general case follows easily from 4.1.4(iii): for if A is a retract of IB by 
maps preserving directed joins, the latter are continuous with respect to the 
Scott topologies by 4.1.3(iv). But any retract of a sober space is sober, since 
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Sob is reflective in Sp and hence closed under equalizers; and a retract of an 
injective object is injective in any category. 

(ii) > (iv): Any sober space (indeed, any To-space) X may be embedded as a 
subspace of a power S” of Sierpinski space: just take B to be the set O(X), and 
map z € X to the point whose Uth coordinate is 1 iff x € U. If X is injective, 
this embedding must have a retraction. 

(iii) > (iv): Similarly, we saw in 1.1.4 that the free objects of Frm are exactly 
the topologies of powers of Sierpiński space. It follows that the projectives (with 
respect to regular epimorphisms) in Frm are exactly the retracts of such powers 
(in particular, they are all spatial, which explains the equivalence of (ii) and (iii)). 

(iv) > (i): First we observe that the product topology on any power S? of 
Sierpinski space coincides with the Scott topology for the pointwise ordering on 
SP (where S is given the obvious ordering 0 < 1). But SP S PB ~ I(KB) is 
clearly a continuous lattice. If (A, £A) is a retract of SP, then the inclusion and 
retraction must preserve directed joins, since they are continuous for the Scott 
topologies; so A is continuous by 4.1.4(iv). O 


We remark that it follows from 4.1.6 that, for any injective sober space X, 
the specialization ordering on points of X is a complete-lattice ordering. (Recall 
that it always has directed joins, by 1.1.6; so the force of this observation is that 
it has finite joins.) 

In the lattice O(X) of open subsets of a topological space X, it is easy 
to see that the relation U << V holds if there is a compact set K C X with 
U C KC V. (In particular, U << U holds iff U itself is compact.) Hence if X is a 
locally compact space (i.e. one in which every open set is covered by the interiors 
of its compact subsets), then O(X) is a continuous lattice. Conversely, it can be 
shown using the axiom of choice that if a continuous lattice A is distributive, 
then the locale X defined by O(X) = A is spatial, and its space of points is 
locally compact. (We shall not give the proof here; see [452] for the details.) 
We therefore define a locally compact locale to be one whose frame of opens is a 
continuous lattice. 

Given a nucleus j on a frame O(X), it is easy to see that the sublocale 
corresponding to the fixset O(X), is compact iff the filter F = {U € O(X) | 
j(U) = X} is Scott-open. In general, not every filter in a frame is representable 
as the filter of elements mapped to the top element by a nucleus; but it turns 
out that every Scott-open filter is so representable (see [526]). Using this, plus 
the result of [45] that distributive continuous lattices have ‘enough Scott-open 
filters’, it is possible to give an alternative characterization of locally compact 
locales as those which have ‘enough compact sublocales’, which looks more like 
the usual definition of local compactness for spaces. However, we shall not need 
this characterization here; and it is in any case non-constructive (the second 
ingredient mentioned above requires the axiom of countable dependent choices), 
so we cannot use it when dealing with internal locales in a topos. 
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For future reference, we note a couple of stability properties of local 
compactness. 


Lemma 4.1.7 
(i) A locally closed sublocale of a locally compact locale is locaily compact. 


(ii) If f: X > Y is a triquotient map in the sense of 3.2.7 (for example, an 
open or a proper surjection) and X is locally compact, then so is Y. 


Proof (i) Since a locally closed sublocale is an open sublocale of its closure, it 
suffices to prove separately that local compactness is inherited by open and by 
closed sublocales. But each of these is easy, since principal ideals and principal 
filters in a continuous lattice are continuous (either directly from the definition, 
or using 4.1.4(iv)). 

(ii) We can express O(Y) as a retract of O(X) by maps preserving directed 
joins. So this is immediate from 4.1.4(iv). oO 


Local compactness is also inherited by finite products in Loc: it is not hard 
to prove this directly from the definition, but we do not need to do so, since 
it follows from 4.1.9 below and A1.5.1(i). On the other hand, we shall give a 
constructive proof that spatial and localic products coincide if one of the factors 
is locally compact. (For more information on when spatial and localic products 
coincide, see [478] and [978].) 


Lemma 4.1.8 Suppose X and Y are spatial locales, and X is locally compact. 
Then the locale product X x Y is spatial. 


Proof Let W and W’ be opens in X x Y such that every point of W belongs 
to W’; we must show that W < W’, for which it is enough to show that every 
open rectangle U x V < W satisfies U x V < W’. For any open V’ < Y, let 
Uy = U{U' € O(X) | U' x V' < W’}; and for any y in V, let U, = U{Uv | 
V' € O(Y),y € V’}. For each (x,y) € U x V, we have (z, y) € W’, so there must 
be a rectangular neighbourhood U’ x V’ of (x,y) contained in W’, and hence 
x € U}. Since X is spatial, U is covered by its points, and so this implies U < Uy 
for each y € V; but the join in the definition of U, is directed, and hence for’ 
each U' << U we may conclude that every y € V has a neighbourhood V’ such 
that U’ x V' < W’. Since V is also covered by its points, this in turn implies 
that we have U’ x V < W’ for each U’ << U; finally, local compactness of X 
implies that we have U x V < W’. O 


A well-known application of local compactness in ‘classical’ topology is that 
locally compact spaces are exactly those for which the compact-open topology 
on function spaces ‘works well’ (cf. [367]): in the terminology introduced in 
Section A1.5, they are exactly the exponentiable objects of Sp. The correspond- 
ing result for locales was first proved by M. Hyland [461]: 


Theorem 4.1.9 A locale is erponentiable in Loc iff it is locally compact. 
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Proof First suppose X is exponentiable. Since the functor (—) x X: Loc > 
Loc preserves inclusions (as they are just regular monomorphisms), its right 
adjoint (—)* preserves injectives; in particular, the exponential S* must be 
injective, where S is the Sierpiński space. But points of S* correspond bijectively 
to locale maps X — S, and hence (since O(S) is the free frame on one generator) 
to elements of O(X); we claim that this correspondence maps the specialization 
ordering on points of S* to the given ordering on the frame O(X), so that the 
latter must be a continuous lattice by 4.1.6. 

To verify the claim, note that the correspondence sends a point p: 1 — S* 
to g*(U) € O(X), where q is the composite 


pxl 
Ga aa E aE 


and U is the generator of O(S) when the latter is regarded as the free frame 
on one generator. Clearly pı < p2 implies q) < q2 and hence gjf(U) < q3(U). 
Conversely, if Vi < V2 in O(X), consider the element 


W =(S@Vi)U(U 8 V2) 


of O(SxX) ¥ O(S)@O(X); this element corresponds to a locale map Sx X = S, 
whose transpose w: S — S* has the property that w-0 and w- 1 are the points 
corresponding to V; and V2 respectively. (Here 0 and 1 are the two points of S.) 
But 0 < 1 in the specialization order on S, sow-0<w-1. 

Conversely, suppose X is locally compact. Let Y be the locale defined by 
O(Y) = 4(O(X)); then Y is an injective locale by 4.1.6(iii), and it is clearly the 
only possible candidate for the exponential S* in Loc. We first verify that it does 
indeed have the universal property of an exponential S*. The evaluation map 
ev: Y x X — S must correspond to an element of O(Y x X) = L(O(X)) @O(X); 
for this we take 


E =| J{Î(U) 8U) | U € O(X)}. 


Given a locale Z and a map h: Z x X — S, corresponding to an open H € 
O(Z x X) = O(Z) 8 O(X), we define h: Z => Y by 


A (V) =({W €O(Z) | GU € V)\(W &U < H)}. 


(It is straightforward to verify that h is a frame homomorphism.) We claim that 
the diagram 


hx1 
Z x X —— Y xX 
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commutes; for this, it clearly suffices to show that (h x 1)*(E) = H. But by 
definition, we have 


es =| J U) @U |U €O(X)} 


and 
h ÂU) SU =| ]J{W @U | GU' € Î(U))(W 8 U’ < H)}. 


The union of the right-hand side of the latter equation is clearly below H, since 
U << U' implies W QU < W@U’; so (h x1)*(E) < H. But H can be written as 
a union of ‘open rectangles’ W QU, since these generate O(Z x X) as a complete 
join-semilattice, and hence by continuity of O(X) it can also be written as a join 
of rectangles W @ U for which there exists U’ with U << U’ and W QU’ < H. 
So the reverse inequality also holds. 

We must next show that if k: Z— Y is any locale map which satisfies 
ev(k x 1)=h, then k = h. Since the Scott-open sets of the form f(U) form 
a base for ©(O(X)) by 4.1.4(iii), it suffices to show that k* and h’ agree on all 
such sets. But we have 


H = (k x 1)*E = |_J{k*(Î(U)) x U | U € O(X)}, 


and hence for any U’ with U << U’ we have 


_—* 


h (f(U)) = k*(F(U')) 


by the definition of h . But f(U) is the union of all f(U’) with U << U’, by the 
interpolation property; so it follows that we have k*(f(U)) < A’ (f(U)) for all U. 

For the reverse inequality, suppose U << U’ and W € O(Z) are such that 
WxU' < H. Then W xV” is covered by the opens k*(f(V)) x V, V € O(X). Now 
the set {V € O(X) | k*(#(V)) > W} is directed, since we have k*(Î(Vi U V2)) = 
k*(F(Vi) N F(V2)) = k*(Î(V1)) N k*(Î(V2)), and it covers U’, so there exists 
V > U with k*(f(V)) > W. But this implies that h’(f(U’)) < k*(f(U)), whence 
h (f(U)) < k*(Î(U)) follows as before. 

Having thus constructed the exponential S*, we may clearly now define 
(S4)X, where A is any set, as (S*)4 (that is, the product in Loc of A 
copies of S*). And any continuous map f: S4— SP gives rise to a map 
f*:(94)* — (S8)*, namely the transpose of 


f 


(S4)* x x “Ys s4 —-_» SP; 


that is, (—)* becomes a functor on the full subcategory of Loc whose objects are 
the powers of S. But any locale Y is a sublocale of a power of S, as we observed 
in the proof of 4.1.6; and locale inclusions are regular monomorphisms, so we 
can express Y as the equalizer of a pair of maps S4 = SP. Since the functor 
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(—)* is a right adjoint, it should preserve equalizers; so we have merely to define 
Y~ to be the equalizer of the corresponding pair of maps (S4)* = (S8)*, and 
to verify that this has the right universal property. 


We remark that it is possible to give a direct construction of the exponential 
Y~* for an arbitrary Y and locally compact X, by constructing a presentation for 
its frame of opens from the elements of O(Y) and O(X): this is done in Hyland’s 
original paper [461]. However, the fact that the existence of the particular expo- 
nential S* is sufficient to construct Y* for an arbitrary Y is of interest in its own 
right — note that we have seen the topos-theoretic version of this locale-theoretic 
fact in B4.3.1. 

To conclude this section, we briefly discuss stable local compactness. We have 
seen that the way-below relation in a complete lattice is always stable under finite 
joins; but it need not be stable under finite meets, even if the lattice is continuous 
and distributive: 


Example 4.1.10 As in 3.4.1(a), let X be the space obtained from two copies 
of the closed unit interval [0,1] by identifying the two copies of the open interval 
(0,1). Then the two copies of [0,1] are open in X and compact, so they define 
elements U and V of O(X) satisfying U << U and V << V. But UNV = (0,1) 
is not compact, so we do not have (UNV) << (UNV). 


We say a continuous (semi)lattice A is stably continuous if the way-below 
relation on A is stable under finite meets (including the empty meet; i.e. the top 
element 14 is compact); and we call a locale X stably locally compact if O(X) is 
a stably continuous lattice. (If we merely wish to assert that << is stable under 
binary meets, without assuming compactness of the top element, we shall say 
that X is locally stably compact.) 


Lemma 4.1.11 Any compact Hausdorff locale is stably locally compact. 


Proof We recall from 3.2.10(ii) that a compact Hausdorff locale X is regular; 
that is, every open U is the join of the opens V such that there exists an open W 
with VAW =0 and UUW = X - as in 1.2.17, we denote this relation between 
U and V by V SU. Since the relation < is clearly stable under finite meets, it 
suffices to show that it coincides with <<. 

But < is also stable under finite joins, so the join appearing in the definition 
of regularity is directed; thus in a regular locale the relation V << U implies 
V < W for some W 4U, and hence V <U. And any closed sublocale of a compact 
locale is compact, so in a compact locale the relation V 4U implies V << U. So 
the result is established. O 


For future reference, we also note: 


Lemma 4.1.12 Let X be a compact Hausdorff locale. Then a sublocale of X 
is locally compact iff it is locally closed. 
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Proof One direction is immediate from 4.1.11 and 4.1.7(i). Conversely, sup- 
pose Y is a locally compact sublocale of X. By replacing X by the closure of 
Y, we may assume Y is dense in X; we then have to prove that it is open. We 
first note that if V << Y in O(Y), then the closure V = C(-V) of V in Y is 
compact; for if we have a directed family of opens U; with (J; U: > V, then 
U.(U; U =V) = Y, whence if V << W << Y we have U; U ~V > W for some i. 
But Y is regular since it is a sublocale of X; so, as in the proof of 4.1.11, the 
relation V << W implies V «W, and hence U; UnV > W implies U; > V. 
Now by 3.2.10(i) we know that all compact sublocales of X are closed, so V 
must actually be closed in X. Suppose it equals CU where U € O(X), and let 
V € O(X) be such that V NY = V. Then VN YNU = f, whence VNU =9 
since Y is dense, so V < CU = V. But V < Y by definition, so V < Y, and thus 
V = V is open in X. Since Y is locally compact, we can express it as a union 
of opens V which satisfy V << Y and are therefore open in X; so Y is open 
in X. o 


We recall from 1.1.3 that if B is a distributive lattice, then its ideal lattice 
IB is (distributive, and hence) a frame; as in 2.4.3, we call a locale X coherent 
if O(X) = IB for some distributive lattice B. (Of course, B is determined up to 
isomorphism by X: it is the lattice of compact open sublocales of X.) 


Proposition 4.1.13 For a locale X, the following are equivalent: 


(i) X is stably locally compact. 
(ii) X is a retract in Loc of a coherent locale. 


(iii) X ts a coadjoint retract in Loc of a coherent locale. 


Proof (i) = (iii): Suppose X is stably locally compact. Then, as in 4.1.4, 
we know that O(X) is a retract of I(O(X)) by maps |: O(X) — I(O(X)) and 
V: I(O(X)) — O(X) preserving arbitrary joins; we claim that these maps also 
preserve finite meets, i.e. they are frame homomorphisms. For \/, this is obvious, 
since it is right adjoint to |; and the preservation of finite meets by | is exactly 
the definition of stability for <<. Thus we have expressed X as a retract of the 
coherent locale Y defined by O(Y) = [(O(X)); moreover, since | is left adjoint 
to \/, it is actually a coadjoint retract as defined in B1.1.9(c). 

(iii) = (ii) is trivial. For (ii) => (i), we show that coherent locales are stably 
locally compact and that retracts of stably locally compact locales are stably 
locally compact. The first of these is easy, since in the frame IB (for a distributive 
lattice B) we have I << J iff there exists a principal ideal | (b) with Z C | (b) C J; 
and the set of principal ideals is closed under finite meets. For the second, suppose 
X is a retract of Y by locale maps i: X — Y andr: Y — X; then, from the proof 
of (iv) > (i) in 4.1.4, we know that U << V in O(X) iff U < i*(W) for some 
W << r*(V) in O(Y). Since Y is stably locally compact and both r* and 2” 
preserve finite meets, it is easy to see that this relation is stable under finite 
meets. 0 
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We remark that, by essentially the same proof, locally stably compact locales 
are exactly the retracts in Loc of locally coherent locales, the latter being defined 
as those whose frames are of the form IB where B is a ‘distributive pre-lattice’; 
i.e. it satisfies all the axioms for a distributive lattice, except that it may lack a 
top element. (Given a locally stably compact locale X, we may express O(X) as 
a retract of IB where B = {U e O(X) | U << X}.) 

We may now prove a result promised in Section C3.4: 


Corollary 4.1.14 Stably locally compact locales (in particular, compact Haus- 
dorff locales) are strongly compact. 


Proof If X is a coherent locale, say O(X) = IB for a distributive lattice B, 
then the Comparison Lemma 2.2.3 yields an equivalence Sh(X) ~ Sh(B,C), 
where C is the coherent coverage on B considered as a coherent category 
(cf. A2.1.11(b)). So X is strongly compact by 3.4.1(c). But if X is a coadjoint 
retract of a strongly compact locale Y, consider the diagram 

i 


Sh(X) z > Sh(Y) 


Yx Ôx 


Set 


where 7 and r are the inclusion and the retraction. Given a filtered diagram in 
Sh(X) with vertices (A; | j € J), we have 


limyyx(Aj) = lims6ui.(A,) = 8, (limyi.(A,)) = yar, (limi. (Ay) 
X ,(limyr.i.(A,)) = v (limyA)) , 


where the fourth isomorphism uses the fact that rą is left adjoint to ią. So X is 
strongly compact. O 


Another application of 4.1.13, sometimes known as ‘Joyal’s Lemma’ (see [520, 
III 1.11]), is of use in various contexts. 


Corollary 4.1.15 
(i) A locale X is stably locally compact iff it is injective with respect to flat 
anclusions of sublocales. 


(ii) Compact Hausdorff locales are orthogonal to flat inclusions; i.e. if X is 
compact Hausdorff and i: Y — Z is a flat inclusion of locales, then any 
locale morphism Y — X extends uniquely to a morphism Z —> X. 


Proof (i) First suppose X is stably locally compact. Recall from 1.1.16(d) 
that a locale inclusion i: Y —> Z is said to be flat if ią: O(Y) — O(Z) preserves 
finite joins - equivalently, if it is a lattice homomorphism. But the assignment 
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At I(A) is a functor DLat — Frm - indeed, it is left adjoint to the forgetful 
functor, as we saw in 1.1.3. So, given a frame homomorphism f*: O(X) — O(Y), 
we may define g*: O(X) — O(Z) to be the composite frame homomorphism 


O(X) ee IO(X) a IO(Y) ea, IO(Z) Aa O(Z) . 


We must verify that i*g* = f*; but 


rg =i V, IGAO 
Vi 1 TG) LF") 
= V I(f"){ 
EPN 


Aa 


where the last step uses 4.1.13, and the others are instances of the functoriality 
of I or the naturality of V. 

For the converse, we recall from 1.1.16(e) that any locale X is a flat sublocale 
of a coherent locale; so if X is injective with respect to flat inclusions it must be 
a retract of a coherent locale. 

(ii) Since compact Hausdorff locales are stably locally compact by 4.1.11, the 
existence of the extension follows from (i). For the uniqueness, we use the fact 
that if X is Hausdorff then the equalizer of any two locale morphisms Z 3 X 
is a closed sublocale of Z, because it is a pullback of the diagonal X — X x X; 
but flat inclusions are dense, so if the equalizer contains Y then it must be the 
whole of Z. 0 


Suggestions for further reading: Banaschewski [45], Gierz et al. [404], 
Hofmann [450], Hofmann & Lawson [452], Hyland [461], Johnstone [520, 523], 
Niefield [894], Scott [1102]. 


C4.2 Continuous categories 


In this section, we develop an analogue for categories of the notion of continu- 
ous poset which we recalled in the previous section. For this purpose, it is most 
convenient to start from the characterization provided by 4.1.4: the fact that 
continuous posets are the retracts of posets of the form IB by maps preserving 
directed joins. Obviously, when we generalize from posets to categories, we need 
to replace directed joins by filtered colimits; and the analogue of the ideal com- 
pletion IB (the poset obtained by freely adding directed joins to a poset B) is 


C4.2 CONTINUOUS CATEGORIES 723 


the inductive completion Ind-C of a category C. We therefore begin by reviewing 
this notion. 

Throughout this section, we shall work explicitly with Set-based categories; 
but all our results may be reinterpreted in the context of S-indexed categories 
for an arbitrary base topos S, using the techniques developed in Part B, and 
when we come to apply the results of this section to toposes in the subsequent 
sections of this chapter, we shall in fact do so at that level of generality. 

If C is locally small, then it is fully embedded (by the Yoneda embedding) in 
the functor category [C°?, Set], and the latter may be viewed as the free cocom- 
pletion of C, as we saw in B2.5.8. If we merely wish to adjoin filtered colimits, we 
may restrict our attention to the full subcategory C of those functors which are 
expressible as (small) filtered colimits of representable functors. (If C is small, 
the objects of C are just the C-torsors as defined in B3.2.3, i.e. those functors 
F: CP — Set for which the domain F of the corresponding discrete fibration 
F — C is filtered.) We note that this category has the following properties: 


(i) There is a full and faithful functor y: C > Ĉ. 


(ii) C has filtered colimits, and every object of C is expressible as a (small) 
filtered colimit of objects in the image of y. 


(iii) The objects in the image of y are finitely presentable as defined in 
D2.3.1(a); that is, the functor C (y(A), —) preserves filtered colimits for 
every A € ob C. 


(These three properties should be compared with those which we used to char- 
acterize the regularization Reg(C) of a cartesian category C in Section A1.3.) 
The first two properties are obvious from the definition of C ; for the third, we 
note that as an object of [C°P, Set] the functor represented by y(A) preserves all 
small colimits, by the Yoneda lemma, and the inclusion C [C°P, Set] creates 
filtered colimits. D 

In fact properties (i)-(iii) suffice to characterize C up to (weak) equivalence, 
and provide the key to an alternative definition which is in many ways more 
convenient to use. To see this, note that condition (ii) says that we might as well 
take the objects of C to be small filtered diagrams in C, i.e. functors D: I > C 
where I is a small filtered category. (We shall normally denote such a diagram 
by (A; | i € I), where A; is the object D(z) of C, leaving the reader’s imagination 
to supply names for the edges of the diagram.) For two such objects (A; | 7 € J) 
and (B; | j € J), the class of morphisms between them must satisfy 


C((Ai | i € 1), (By | 3 € J)) S limie€ (y( Ai), (By |j € J) 
= lime slimje J (y( Ai), y(B,)) 


= lim;erlim;e JC (Aj, B,) 
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using conditions (iii) and (i), plus the fact that (A; | i € T) is intended to be the 
colimit in Ĉ of the y(A;). Accordingly, we now define Ind-C to be the category 
whose objects are all small filtered diagrams in C, and whose morphisms are 
defined by the ‘double-limit’ formula in the last line of the above display: in 
other words, a morphism (A; | i € I) > (B; | j € J) is a family (@; | i € I), 
where each ¢; is an equivalence class of morphisms f: A; — B; (two morphisms 
f: Ai > B; and f’: A; — By being equivalent iff there exist morphisms j — j” 
and j’ — j” in J such that 


f 


B; ——> B, 


commutes), subject to the obvious compatibility conditions. The composite of 
two morphisms 


(gili € J) (Y |j € J) 


(Aye Gd Bd Sd) Cee) 


is (xi | i € I), where x; is the element of lim,e«C (Ai, Ck) containing all the 
composites gf such that f: A; — B; is in ¢; and g: B, — Ck is in w,. The 
embedding y: C — Ind-C sends an object A to itself regarded as a diagram over 
the terminal category 1; it is clear that this is indeed full and faithful. 


Remark 4.2.1 If C is a preorder, then so is Ind-C: for if the hom-sets C (A,, B,) 
each have at most one element, then so do the filtered colimits lim,¢ jC (A, B,), 
and hence so does the double limit lim,¢;limj¢yC (A;,B;). In particular, if 
B is a (small) poset, then Ind-B is equivalent to the ideal completion IB: in 
one direction, the equivalence maps a filtered diagram (a; | i € I) to the ideal 
{b € B| (at € T)(b < a;)}, and in the other it sends an ideal J C B to the inclu- 
sion J — B regarded as a filtered diagram in B. This ensures that the definition 
of continuity for categories, which we are about to give, genuinely extends that 
for posets which we studied in the last section. (We should mention in pass- 
ing that J. Koslowski [650] has proposed an alternative definition of ‘continuous 
category’, but it fails to have this ‘conservativity’ property.) 


Proposition 4.2.2 
(i) For any locally small C, the category Ind has filtered colimits. 
(ii) An object of Ind-C is finitely presentable iff it is a retract of an object of 
the form y(A), A € ob C. 
(iii) If C is Cauchy-complete, then it is recoverable (up to equivalence) from 
Ind-C, as the full subcategory (Ind-C),, of finitely presentable objects. 
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(iv) If C is cocartesian then so is Ind-C, and the embedding y: C — Ind 
preserves finite colimits. 


(v) If C is small and cocartesian, then Ind-C is complete and cocomplete. 


Proof (i) Let (A; | i € J) be the vertices of a filtered diagram in Ind-C, and 
suppose each A; is the ind-object (Bi, | j € Ji). We define a small category 
K as follows: its objects are pairs (i,j) with i € J and j € Ji, and mor- 
phisms (i,j) — (i’,j’) are pairs (a, f) where a: i —> 7 in I and f: By, > Biy 
is a representative for the jth component of the ath edge of the filtered dia- 
gram. Composition of morphisms of K is defined in the obvious way; we claim 
that it is a filtered category. We shall verify the third condition for filteredness 
(cf. B2.6.2(a)) — the other two are similar. Let 


(a, f) 
Cp os G7) 
(anh) 


be a parallel pair of morphisms in K. Since I is filtered, we can find 8: i’ — i” 
such that Ba = Ba’. Let g: Bij: > Bij be any representative of the j’th 
component of the Bth edge of the diagram: the composites gf and gf’ need 
not be equal, but they both represent the jth component of the Bath edge of 
the diagram, so they are coequalized by some B,v,” — By jy. Composing this 
morphism with g, we obtain a morphism g’ such that (G, 9’) coequalizes (a, f) 
and (a’, f’) in K. 

The assignment (2,7) ++ Bi, becomes a functor K — C, and hence an object 
of Ind-C, in an obvious way. Moreover, we have a morphism of ind-objects (Bi; | 
J € Ji) — (Bi; | (i,j) € K) for each i, whose jth component is represented by the 
identity morphism on B;;. It is straightforward to verify that these morphisms 
form a cone under the original filtered diagram, and that it has the universal 
property of a colimit. 

(ii) It follows easily from the construction of filtered colimits in (i) that they 
are preserved by the functors Ind-C (y(A),—). Thus any y(A) is finitely pre- 
sentable; hence so is any retract of such an object. Conversely, if A = (A; |i € I) 
is finitely presentable in Ind-C, then by expressing it as the colimit of the y(A;) 
we see that the identity morphism on A must factor through one of the maps 
y(A,) — A. 

(iii) follows at once from (ii) and the fact that y is a full embedding. 

(iv) It is clear that if 0 is initial in C then y(0) is initial in Ind-C. If (A, | i € J) 
and (B; | 7 € J) are two ind-objects, we define their coproduct to be the I x J- 
indexed diagram whose (i, 7)th vertex is A, + B,; the coprojection (A; | i € I) > 
(Aj + Bj |i € I,j € J) has as its ith component the equivalence class containing 
the coprojections A, — A; + B, for all 7, and it is straightforward to verify the 
Universal property of the coproduct (and that y( A+ B) = y(A)+ y(B)). To form 
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the coequalizer of two morphisms 


$ 
(Ai |i € 1) = (B; |j € J), 
Y 


we first form the category K whose objects are quadruples (i, j, f, g) where i € I, 
j € J and f, g: A; 3 B; represent the ith components of ¢ and w respectively, 
and whose morphisms (7,7, f,g) — (t, J’, f’, g') are pairs (a: i > it', 8: j > J’) 
such that the squares 


f 
A; 2—3 B; 
g 


Aa Bg 
F 
Ay ———$ By 
g' 


both commute; for k = (i,j, f,g) € K, we define Ck to be the coequalizer of f 
and g in C. Then it is straightforward to verify that K is filtered, and that the Ck 
define an ind-object (Cp | k € K). Not every object j of J necessarily occurs as 
the second component of an object of K, but for each j we can find a morphism 
j — j’ such that 7’ occurs in an object (i, 3J’, f,g), and thus we have a composite 
morphism B, > By — Cu,j',f,g); all such morphisms are moreover equivalent 
as morphisms into the ind-object (Ck | k € K), and we take their equivalence 
class to be the jth component of a morphism (B, | 7 € J) — (Ck | k € K). This 
morphism is readily seen to have the universal property of a coequalizer of ¢ 
and w. And it is again easy to see that the embedding y preserves coequalizers. 

(v) Cocompleteness follows from (i) and (iv). For completeness, it is easier to 
use the alternative representation of Ind-C as the closure of the full subcategory of 
representables under filtered colimits in [C°P, Set]; for in this case we may identify 
the latter with the category Cart (C°P, Set) of cartesian functors, by B3.2.5, and 
this is clearly closed under limits in [C°P, Set]. (Alternatively, we could use the 
Adjoint Functor Theorem to construct right adjoints for the diagonal functors 
Ind-C — [D,Ind-C] for arbitrary small categories D: note that the objects in the 
image of y form a separating set for Ind-C.) O 


Of course, the categories Ind-C where C is small (resp. small and cocartesian) 
are exactly the finitely accessible (resp. locally finitely presentable) categories 
studied in Section D2.3. 

We digress for a moment to fulfil a promise made in Section C2.2. First 
we need: 


Lemma 4.2.3 Let C be a small cocartesian category with pullbacks, such that 
finite colimits are stable under pullback in C. Then Ind-C is locally cartesian 
closed. 
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Proof We use the identification of Ind-C with Cart (C°P, Set). We recall from 
A1.1.7 that, for any functor F: C°P — Set, the slice category [C°P, Set]/F may 
be identified with [F°?, Set], where F — C is the discrete fibration corresponding 
to F. If F is cartesian, then it is easily verified that F inherits finite colimits and 
pullbacks, and the stability of the former under the latter, from C. Moreover, 
given a natural transformation G — F between functors C°P 3 Set, the functor 
G is cartesian iff the corresponding functor FP — Set is cartesian; so we may 
identify Cart (C°?, Set)/F with Cart(F°?, Set). Thus it suffices to show that 
Cart (C°P, Set) is cartesian closed; we shall do this by showing that it is an 
exponential ideal in [C°P, Set]. 

Suppose F is a cartesian functor C°P — Set, and let G be an arbitrary func- 
tor; then by the proof of A1.5.5 we know that the exponential FC in [C°P, Set] 
may be defined by taking F°(A) to be the set of natural transformations 
h(A) x G — F, for each object A of C. Now suppose A is the colimit of a finite 
diagram with vertices A;, and suppose we are given a compatible family of natu- 
ral transformations 0;: h(A;) x G — F. We may construct 0: h( A) x G — F as 
follows: given an element x € G(B) and a morphism B — A, we pull back the 
morphisms A; — A along the latter to obtain a colimit diagram B = lim B;, 
and then restrict x to a family of elements x; € G(B;). Then we obtain a family 
of elements 6;(z;,B; — A;) E F(A;,); but since F is cartesian these correspond 
to a unique element of F(A), which we define to be 6(z, B — A). It is straight- 
forward to verify that this does define a natural transformation @ as required, 
and that it sets up a natural bijection between F(A) and lim F°(A;). o 


Example 4.2.4 We may now give our promised example (due to F. Borceux 
and M. C. Pedicchio [156]) of a locally cartesian closed, locally finitely pre- 
sentable category which is not a quasitopos. Let C be the category Set, of finite 
or countably infinite sets: C clearly satisfies the hypotheses of 4.2.3, and so Ind-C 
is locally cartesian closed. We claim next that Ind-C is balanced (so that, by 
A2.6.3(ii), it is a quasitopos iff it is a topos); for this we need only the fact 
that every epimorphism in C is regular. For if A >> B is both monic and epic 
in Ind-C, then so is its pullback along any y(C) — B, and since B is a col- 
imit of objects in the image of y (and this colimit is stable under pullback along 
A >» B) it suffices to show that these pullbacks are all isomorphisms. So consider 
a morphism ø: (A; |i € I) —» y(C) which is both monic and epic. By the con- 
struction of 4.2.2(iv), we may identify the cokernel-pair of @ with the ind-object 
(C+,4, C |i € I) whose ith vertex is the cokernel-pair of the ith component of 
¢; so this ind-object must be isomorphic to y(C), and since the latter is finitely 
presentable it follows that the coprojections C 3 C +4, C must be equal for 
some i, i.e. some A; — C is (regular) epic. But then y(A;) — y(C) is regular 
epic in Ind-C, since y preserves coequalizers; and since it factors through the 
monomorphism @¢, the latter must be an isomorphism. 

Now we observe that the set N of natural numbers satisfies Freyd’s axioms 
for a natural number object (A2.5.5) in C, so by 4.2.2(iv) y(N) must satisfy them 
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in Ind-C, and if the latter were a topos then y(N) would be its natural number 
object. But y(N) is finitely presentable in Ind-C; and the natural number object 
cannot be finitely presentable in any non-degenerate cocomplete topos, since it 
is a countable copower of 1 and thus expressible as the filtered colimit of the 
finite copowers of t. So Ind-C is not a topos; hence it is not a quasitopos, either. 


The above counterexample should be compared with D3.3.9. In the latter 
example, the codomain of the Yoneda embedding is a topos, but the embedding 
does not preserve coequalizers; here, the embedding preserves coequalizers, but 
its codomain is not a topos. 

We now return to the general study of ind-completions. The mapping C +> 
Ind-C is functorial, and indeed 2-functorial, on CAT, in an obvious way; and the 
embedding y: C — Ind-C is the unit of a monad structure on this 2-functor. In 
fact this monad is a KZ-monad, as defined in B1.1.11; so an algebra structure 
for it on a category C is the same thing as a left adjoint for y. Explicitly, we have 


Lemma 4.2.5 The embedding y: C — Ind has a left adjoint iff C has filtered 
colimits. 


Proof Suppose the left adjoint exists; let us call it F for the moment. For any 
object (A; | 7 € I) of Ind-C and any object B of C, we have 


C (F(A; |i € I), B) S Ind-C ((A; | i € J), y(B)) = limjerC (Aj, B), 


so F(A, | i € I) must be an actual colimit in C of this filtered diagram. Con- 
versely, if C has filtered colimits, then by defining F(A; | i € I) to be such a 
colimit we clearly obtain a functor Ind-C — C left adjoint to y. CJ 


In view of the above result, we denote the left adjoint of y when it exists 
by lim. 


Corollary 4.2.6 Ind is the free filtered-colimit completion of C; that is, for 
any category D with filtered colimits, any functor F:C — D eztends, uniquely 
up to canonical isomorphism, to a filtered-colimit-preserving functor Ind-C — D. 


Proof Uniqueness of the extension is immediate from the fact that every 
object of Ind-C is a filtered colimit of objects of C. For the existence, consider 
the composite 
Ind-F lim 
Ind-C £25 Ind-D ——> D; 
note that a functor of the form Ind-F always preserves filtered colimits, from the 
way in which we constructed them in 4.2.2. O 


Definition 4.2.7 We define a category C to be continuous if it has filtered 
colimits and the functor lim : Ind-C — C itself has a left adjoint. We shall denote 
this left adjoint by w if it exists. 
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Paralleling 4.1.4, we have what is essentially a special case of B1.1.15: 


Proposition 4.2.8 For a category C with filtered colimits, the following are 
equivalent: 


(i) C is continuous. 


(ii) C is a coadjoint retract of a category of the form Ind-D by functors 
preserving filtered colimits. 


(iii) C is a retract of a category of the form Ind-D by functors preserving filtered 
colimits. 


(iv) C is a retract of a continuous category by functors preserving filtered 
colimits. 


Proof (i) > (ii) is trivial: note that the unit of the adjunction (w 4 lim) is 
necessarily an isomorphism, since the counit of (lim + y) is an isomorphism. 

(ii) = (iii) is trivial. 

(iii) > (iv): We have to show that categories of the form Ind-D are continuous; 
we have already observed that they have filtered colimits and that functors of 
the form Ind-F preserve them, so that y: Ind-D — Ind-Ind-D is right adjoint to 
lim : Ind-Ind-D — Ind-D, naturally in D. It now follows from the general theory 
of KZ-monads (see B1.1.12) that Ind-y: Ind-D — Ind-Ind-D is left adjoint to 
lim. In fact it is not hard to show this directly: let (B; | i € I) be a filtered 
diagram in Ind-D, and suppose that B = lim(B; | i € J) is the filtered diagram 
(A, | 7 € J) in D. Then, because the y(A;) are finitely presentable in Ind-D, the 
canonical morphisms y(A,) — B in Ind-D each factor in an essentially unique 
way through some leg B; — B of the colimiting cone. In this way we obtain a 
well-defined morphism 


Ind-y(B) = (y(Aj) |j € J) — (Bi | i€ TD) 


in Ind-Ind-D, whose image under lim is the identity map on B. It is now straight- 
forward to verify that this morphism is a component of a natural transformation 
from Ind-y- lim to the identity functor on Ind-Ind-D, which satisfies the triangu- 
lar identities with the isomorphism ljng-p & lim -Ind-y. So we have the required 
adjunction. 

(iv) = (i): Suppose C is a retract of a continuous category D. Then we may 
apply B1.1.10 to the diagram 


gae e ean O Wg 
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whose commutativity is just the statement that r and i preserve filtered colimits. 
(To do this, we need to consider the diagram above as lying in a (meta-)2- 
category in which idempotent 2-cells split; CAT will not serve for this purpose, 
but in fact the diagram lies in the full sub-2-category CAL of Cauchy-complete 
categories (cf. A1.1.10), since all the categories in it have filtered colimits. Recall 
that splitting an idempotent may be regarded as forming the colimit of a diagram 
over the two-element monoid M = {1,e} with e? = e, and this monoid is a 
filtered category.) O 


The next two results are sometimes helpful in identifying which categories 
are continuous. 


Lemma 4.2.9 If C is a continuous category, then so is C/A for any object 
A of C. 


Proof From the ‘double-limit’ definition of morphisms in Ind-C, it is easy 
to see that a morphism from an arbitrary ind-object (B; | i € I) to y(A) is 
the same thing as a cone under (B; | i € J) with vertex A, and hence that 
Ind-(C/A) is isomorphic to Ind-C/y(A). It is now easy to see that if C is contin- 
uous then we may ‘lift’ the left adjoint w of lim : Ind-C — C to a left adjoint for 
lim : Ind-C/y(A) > C/A. o 


Proposition 4.2.10 Let C be a category with filtered colimits and pullbacks. 
Then C is continuous iff 


(i) filtered colimits are stable under pullback in C, and 


(ii) for each object A of C, the category Ind-C4 of ind-objects in C with colimit 
A, and morphisms between them inducing the identity morphism on A, has 
an initial object. 


Proof First we show that if C is continuous and has pullbacks then it nec- 
essarily satisfies (i). (This is, of course, the analogue for categories of the first 
assertion of 4.1.2, that a continuous semilattice is a preframe.) Suppose given an 
ind-object (B; | i € 7), and a morphism f: A — lim; B,; = B. Then we may make 
the pullbacks A x g B; into the vertices of an J-indexed diagram, with colimit A’ 
say. and we clearly have an induced morphism g: A’ — A. But we may transpose 
f toa morphism w(A) — (B, | i € J) in Ind-C; and each representative A, > B, 
for the jth component. of this morphism may be factored through the correspond- 
ing A xg B,, yielding a morphism of ind-objects w(A) > (Axp B, |i €T) 
Applying the functor lim to this morphism yields a morphism h: A — A’ in C. 
Now the composite gh: A — A’ — A is the identity, since we may regard it as 
the effect of applying lim to the composite w(A) — (A x p B; |i € I) > y(A): 
hence in particular h is split monic. But since fgh = f, every morphism in the 


colimit cone (A xg B; — A’ | i € I) factors through h; so g and h are inverse 
isomorphisms. 
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Given that (i) holds, it is now easy to see that the functor lim: Ind-C — C 
is a fibration in the sense of B1.3.4: to obtain a prone lifting of a morphism 
A — B = lim(B; | i € J) in C, we form the ind-object (A xg B; | i € I), 
and the morphism from this ind-object to (B; | i € I) whose ith component 
is the equivalence class of the projection A xg Bi — B;. And it is well known 
that specifying an initial object in each of the fibres of a fibration II: S — T is 
equivalent to specifying a left adjoint for I such that the unit of the adjunction 
is an isomorphism. O 


Corollary 4.2.11 Let C be a continuous category with pullbacks and images. 
Then, for any object A of C, Subc(A) is a continuous semilattice. 


Proof By 4.2.9, it suffices to consider the case (when C has a terminal object 
and) A = 1. In this case, we know that the support functor 0: C — Subc(1) 
preserves filtered colimits, since it is left adjoint to the inclusion; we claim that 
the inclusion does so too (i.e. that a filtered colimit in C of subterminal objects 
is subterminal), from which the result will follow by 4.2.8 (plus 4.2.1). But if 
(U; | i € I) is a filtered diagram of subterminal objects, we have 


lim, U; x lim;U; = limy x1(Uj x U;) 


by two applications of condition (i) of 4.2.10; and the latter is isomorphic to 
lim ;(U; x Uj) since the diagonal I — I x I is a final functor. So lim ;U; is 
subterminal. O 


Next, we introduce the analogue for continuous categories of the way-below 
relation which we used in defining the notion of continuous poset. We note that, 
in a continuous poset B, the relation a << b holds iff | (a) G {(6) in IB; this 
provides the motivation for the following definition. 


Definition 4.2.12 Let C be a continuous category, A and B two objects of C. 
By a wavy arrow from A to B in C, we mean a morphism y(A) — w(B) in Ind-C. 
We write f: A~ B for ‘f is a wavy arrow from A to B’. 


Equivalently, if w(B) is the ind-object (B; | i € T), then a wavy arrow A ~ B 
is an equivalence class of ‘straight’ arrows A — B; in C, where the equivalence 
relation identifies (A — B;) with (A — B; ) iff there exist i — i” and i’ > i” in 
I such that the diagram 


A ———> B; 


By —> By 


commutes. 
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Since lim y(A) = A and lim w(B) = B, applying lim to a wavy arrow 
f: A~ B yields an underlying straight arrow A — B, which we denote by e(f). 
However, the mapping f + e(f) is not injective in general; thus we have to think 
of waviness as an additional structure carried by a morphism of C, rather than 
as a property of such morphisms. 


Example 4.2.13 Let G be a group, and consider the category [G, Set] of left 
G-sets. As in any locally finitely presentable category (cf. the proof of (iii) = (iv) 
in 4.2.8), the ind-object w(B), for a G-set B, may be identified with the filtered 
diagram of all morphisms from finitely presentable G-sets to B; in particular, 
w(1) is the filtered diagram of all finitely presentable G-sets. And it is easy to 
see that a G-set is finitely presentable iff it has finitely many orbits, and the 
stabilizer of each element is a finitely generated subgroup of G (cf. D2.4.1). In 
particular, if G itself is not finitely generated, then the projections G x G 3 G 
both define wavy arrows G x G ~ 1 in [G,Set], which are not equal since the 
projections cannot be coequalized by any map from G to a finitely presentable 
G-set. But they clearly have the same underlying straight arrow. 


A wavy arrow f: A~» B may be composed with a straight arrow g: A’ > A 
or h: B — B', by forming f - y(g) or w(h)- f as appropriate; these compositions 
are associative and commute with each other, so that the function W sending 
(A, B) to the set of all wavy arrows A ~> B may be viewed as a profunctor 
C % C, as defined in Section B2.7. In addition, we have 


Lemma 4.2.14 Let f: A~ B and g: B~C be two wavy arrows in a con- 


tinuous category C. Then the composites g - e(f) and e(g)- f are equal as wavy 
arrows A ~ C. 


Proof It is easy to see that each is the composite 


EE E ie 


in Ind-C. where i is the transpose across (w 4 lim) of the isomorphism B = 
lim y(B) (equivalently, the transpose across (lim 4 y) of lim w(B) = B}. E 


Lemma 4.2.14 tells us that we have a well-defined composition for pairs of 
wavy arrows; and it follows easily from the definitions that the eight possible 
associative laws h-(g- f) = (h-g)- f, where each of f, g and h may be either wavy 
or straight, are all satisfied. In particular, taking g to be straight and the other 
two to be wavy, we should be entitled to regard composition of wavy arrows as à 
morphisin of profunctors u: W Qe W > W, if only the domain of this morphism 
were defined. The problem is that, in order to form (W @c W) (A, B), we have 
to form a suitable quotient of the disjoint union Leeon ce W (C, B) x W (A, C): 
and since C is not a small category, the latter will be a proper class in general. 

However, we note that each wavy arrow C ~ B can be factored as C > 
B, ~> B, where B, is one of the vertices of the ind-object w(B) = (B, |i € J) 
and the wavy arrow B, ~ B corresponds to the identity morphism on B,. Since 
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there is only a set of such B;, for each B, it follows that we may reduce the 
proper class above to the set ] J -; W(B;, B) x W (A, Bi); more explicitly, we may 
define (W @c W) (A, B) to be lim;e;W (A, B;). Thus u becomes a well-defined 
morphism of profunctors. 

The following result is the analogue for continuous categories of the interpo- 
lation property for the way-below relation in a continuous poset. 


Proposition 4.2.15 The morphism u: W &c W —> W just defined is an iso- 
morphism of profunctors. 


Proof To show that u is surjective, we must show that each wavy arrow can 
be factored as a composite of two wavy arrows. But if w(A) is the ind-object 
(A; | i € I), and w(A;) = (B;; | j € J;) for each i, then the w(A;) form a diagram 
of shape J in Ind-C, and we can form its colimit (B; | (i,j) € K) as in 4.2.2(i). It 
is clear that a cone under this filtered diagram in C must factor uniquely through 
each A; and hence through A; that is, lim(B,; | (i,j) € K) = A. Transposing, 
we get a morphism of ind-objects w(A) — (Bi; | (i,j) € K); hence every A; > A 
factors through some B; — A. This suffices to prove surjectivity. 
For injectivity, suppose we have a commutative square of wavy arrows 


f 
Lg th 
? i 
Ù p oY 


Since w(D) is filtered, we may respresent h and k by morphisms h;: B — D; and 
kı: C — D; into the same vertex D; of w( D). Now the square 


e(f) 


A B 
| (9) h 
k; 


a 


C ——} D, 


| 


need not commute, but both ways round represent the same wavy arrow A ~ D, 
so we can find D; > Dy in w(D) coequalizing them. Even so, the square 


Í 


Å nannnnn n> 


B 
0g hj 
l 

y ku 
C D 


p 
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need not commute at the wavy level; but since the two ways round it have the 
same underlying straight arrow, it follows from 4.2.14 that they have the same 
composite with (the underlying straight arrow of) any wavy arrow with domain 
Di. So we may use the first part of the proof to factor the wavy arrow Dy ~ D 
as a composite D; ~ Dy ~> D, and replace hy and ky by their composites 
with the underlying straight arrow of D; ~ Div. We thus have diagrams 


A wrna > Dy and 


f? 
? 
Y 
B 


in which all cells commute at the wavy level, which shows that (h, f) and (k,g) 
are equal as elements of W & W (A, D). 


In view of 4.2.15, we may consider the inverse of j: as a morphism of pro- 
functors W — W & W. Since u is associative, p7! is coassociative; and from the 
way in which jz was defined it is easy to see that e: W — C (—, —) is a counit for 
u! (recall from B2.7.3 that the unit for profunctor composition is the Yoneda 
profunctor C (—, —)). We thus have 


Theorem 4.2.16 For any continuous category C, the structure (W,€, #7") 15 
an ıdempotent profunctor comonad on C. O 


Of course, 4.2.16 has to be read cum grano salis, since profunctor composition 
for profunctors between large categories is not defined in general (as we saw, it is 
merely a happy accident that W@W is defined), and so we do not have a genuine 
bicategory for this comonad to live in. However, 4.2.19 below provides a more 
legitimate version of the theorem, applicable to all the continuous categories we 
shall be interested in. 


A further property of wavy arrows which we shall require is the following: 


Lemma 4.2.17 LetC be a continuous category. Then any finite colimits which 
exist ın C are also ‘wavy colimits’; that is, given a finite (straight-edged) diagram 
in C unth vertices (A,,...,An) and a colimit cone (Aj: A> L| 1 <1 n), if 
we are given any ‘wavy cone’ under the diagram (that is, a family of wavy arrows 
(fi: A~ B|1<i< n) which commute with all the edges of the diagram), then 
there exists a unique wavy arrow f: L~ B such that fr, = f; for allt. 


Proof Suppose given a wavy cone as above. Let (B, | j € J) be the ver 
tices of the ind-object w(B); then we can represent each f; by a straight arrow 
gı: A; — B, for some j, and since there are finitely many of them we can choose 
the same j for each i. Even so, given an edge A; — Aj; of the diagram, the 
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triangle 


need not commute; but the two ways round it are coequalized by some B, > By, 
since they represent the same wavy arrow A; ~ B. Again, since there are finitely 
many edges in the diagram, we can choose a single B, — Bj coequalizing all such 
triangles; then we have a straight-edged cone under the diagram with vertex B;/, 
and hence an induced morphism g: L — By, which we can regard as representing 
a wavy arrow f: L~ B. 

This establishes the existence of f; for the uniqueness, suppose we had 
another such morphism f’. We could represent f’ by a straight arrow 
g': L — By for some j”; then, for each i, the composite g'A; would represent 
the same wavy arrow A; ~ B as gi, and so we could find morphisms j! — j’” 
and j” —> j/” in J such that 


gi 
A; ————> By 


g' Xi 


B,» > Bjr 


commutes. Doing this for all the A;, we obtain a vertex B,» of w(B) such 
that, when we compose g and g’ with the morphisms into B,, they become 
factorizations of the same cone through L, and therefore equal; so f and f’ are 
equal as wavy arrows L ~ B. O 


We shall be interested in two ‘refinements’ of 4.2.8, dealing with the cases 
when C is a retract of Ind-D for some small category D, and when C is cocom- 
plete. In connection with the first of these, we note that if C S Ind-D where D 
is small, then C contains a small full subcategory such that every object of C is 
expressible as a filtered colimit of objects of the subcategory. We shall say that 
a category C with filtered colimits is ind-small if it contains a small full subcate- 
Bory D with this property; more precisely, if there exists a functor h: C - Ind-D 
such that the composite 

h lim 
C ——+> Ind-D —— > Ind-C ———+>C 
is isomorphic to the identity. A sufficient condition for this is that D should be a 
dense full subcategory (i.e. one such that, for each A € ob C, the canonical cone 
under the forgetful diagram (D| A) — C is colimiting) for which the comma 
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category (D | A) is filtered for all A. (If C is cocartesian, the latter condition 
will hold provided D is closed under finite colimits in C. ) However, we do not 
actually require D to be dense — though it is so when (as in the case C = Ind-D) 
its objects are finitely presentable in C, since then the obvious functor from (the 
index category of) h(A) to (D | A) is final. 


Proposition 4.2.18 For a locally small category C, the following are 
equivalent: 


(i) C is continuous and ind-small. 


(ii) C expressible as a retract, by filtered-colimit-preserving functors, of a cat- 
egory of the form Ind-D where D is small (equivalently, of a finitely 
accessible category). 


(iii) C ıs expressible as a coadjoint retract of a category as in (ii). 


Proof (iii) > (ii) is obvious. For (ii) = (i), we have already observed that a 
category of the form Ind-D with D small is (continuous and) ind-small, so we 
need only show that this property is inherited by retracts. So suppose we have 
filtered-colimit-preserving functors 


(EE, , HERE EY 6 


with ri = 1, and a small subcategory D of C with the required property. Let D 
be the full subcategory of C on the objects r(B), B € ob D. Then the composite 


Dae EREE ce iy PO aa 


is easily seen to have the required property, using the fact that r preserves filtered 
colimits. 

(i) = (iii): Suppose C is ind-small. Let D be the small full subcategory of C 
provided by the definition, and consider the composite 


Ind-h lim 
C2 Ss Inde SS natn D — SnD 

that is. given an object A of C, if w(A) is the ind-object (A; | i € J) and 
h(A,) = (Bi, | 9 € Ji) for each i, we consider the colimit of the (Biz), con- 
structed as in 4.2.2(i), as an ind-object k(A) in D. For each pair (i,j), W 
have a morphism B,, — A;, and these morphisms clearly define a a hie 
k(A) — w(A) in Ind-C; but it is also easy to see that we have lim k(A) = A. 
so the adjunction yields a morphism w(A) — k(A) in Ind-C. A further straight- 
forward verification shows that these two morphisms are inverse to each other. 
so we have shown that (up to isomorphism) the functor w:C — Ind-C factors 
through the full embedding Ind-D — Ind-C induced by D C C. As such, it is 
clearly still left adjoint to the restriction of lim: Ind-C — C to Ind-D, and so it 
preserves filtered colimits; but so does the restriction of lim, since the inclusion 
Ind-D — Ind-C preserves hei: 
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Remark 4.2.19 In the situation of 4.2.18, we may restrict W to a profunctor 
D +} D; and, given that we have seen that the functor w: C — Ind-C may be 
taken to factor through Ind-D, it is clear that the proof of its idempotency which 
we gave in 4.2.14 is still valid. That is, we now have a (legitimate) profunctor 
comonad (W,,1~') on D. We shall see in the next section that the continu- 
ous category C may be reconstructed, up to equivalence, from D together with 
this data. 


Concerning the second refinement mentioned above, we know from 4.2.2(iv) 
that Ind-C is cocomplete if C is locally small and cocartesian. The converse holds 
if C is Cauchy-complete, since in any category D with filtered colimits the full 
subcategory D,, of finitely presentable objects is closed under whatever finite 
colimits exist in D. 


Proposition 4.2.20 For a locally small category C, the following are 
equivalent: 


(i) C is continuous and cocomplete. 


(ii) C is a retract, by functors preserving filtered colimits, of a category of the 
form Ind-D where D is cocartesian. 


(iii) C is a coadjoint retract of a category as in (ii). 


Proof (i) > (iii) since continuity of C implies that it is a coadjoint retract of 
Ind-C. (iii) = (ii) is trivial; and for (ii) = (i) we have merely to observe that C 
inherits cocompleteness from Ind-D — which is, once again, an easy application 
of B1.1.10, given that C necessarily inherits filtered colimits from Ind-D, and is 
therefore Cauchy-complete. O 


We may also combine 4.2.18 and 4.2.20, as follows: 


Corollary 4.2.21 For a locally small category C, the following are 
equivalent: 


(i) C is continuous, ind-small and cocomplete. 


(ii) C is a retract, by functors preserving filtered colimits, of a category of the 
form Ind-D where D is small and cocartesian (equivalently, of a locally 
finitely presentable category). 


(ili) C is a coadjoint retract of a category as in (ii). 


Proof (iii) = (ii) is as usual trivial, and (ii) => (i) follows immediately from the 
corresponding parts of 4.2.18 and 4.2.20. For (i) = (iii), suppose C is ind-small 
and cocomplete; let D be a small full subcategory satisfying the condition for 
ind-smallness. Then the closure D of D under finite colimits in C is still small, 
and it also satisfies the condition; so, working through the proof of 4.2.18(i) > 
(iii), we deduce that C is a coadjoint retract of Ind-D. g 


Suggestion for further reading: Johnstone & Joyal [542]. 
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C4.3 Injective toposes 


In this section, we consider the notion of injectivity for toposes corresponding to 
that for locales (or sober spaces) which we met in 4.1.6. (Throughout this and 
the next section, we shall work over a base topos S, which is assumed to have a 
natural number object; but we shall generally treat it notationally as if it were 
the classical category of sets.) First, we must pay attention to the fact that we 
are working in a 2-category, and therefore need to take some care in defining the 
notion of ‘injective object’. If we are given a class M of 1-cells in a 2-category KR, 
the simplest notion of ‘injective object with respect to M’ would be an object 
A such that, for every diagram 


B—~>C 


A 


with m € M, there exists a morphism g: C — A and a 2-isomorphism gm = f. 
We shall call this notion weak injectivity with respect to M. However, it seems 
reasonable that the operation of ‘extending along m’ should be defined for 2-cells 
as well as 1-cells with codomain A, which leads to the following stronger notion: 
A is strongly injective with respect to M if, for every m: B —> C in M, we are 
given a functor m* : R(B, A) + £ (C, A) and a natural isomorphism fitting into 
the triangle 


R(B, A) as R(C, A) 
(—)om 


RK (B, A) 

A still stronger notion is that of a complete injective, which is dual to the notion 
of cocomplete injective that we met in B1.1.16. Explicitly, we say that A is 
a complete M-anjective if, for every morphism f: BC of &, the functor 
R(C, A) — R(B, A) obtained by composition with f has a right adjoint f*. 
the counit of the adjunction being an isomorphism whenever f € M. 

Fortunately, in the 2-category BTop/S of bounded toposes over a base & 
with natural number object, the three notions of injectivity coincide: 


Theorem 4.3.1 Let S be a topos with natural number object. For a bounded 
S-topos E, the following are equivalent: 


(i) E 28 a retract in BLop/S of a functor category (CoP, S] where C us 
cartesian. 
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(ii) E is a complete injective (with respect to the class of inclusions) in 


BTop/S. 
(iii) E is strongly injective in BLop/S. 
(iv) E is weakly injective in BLop/S. 


Proof The implications (ii) > (iii) and (iii) > (iv) are clear. (iv) = (i) follows 
from the fact that every bounded S-topos E is a subtopos of [C°?, S] for some 
cartesian C — if E is weakly injective then this inclusion must split. So it suffices 
to prove (i) => (ii), for which we show first that toposes of the form [C°?, S] are 
complete injectives and then that the class of complete injectives is closed under 
retracts. 

For the first assertion, we recall from B3.2.5 that, for a cartesian internal 
category C, C°P-torsors are the same thing as cartesian functors defined on 
C: thus, for any geometric morphism g: G — F over S, geometric morphisms 
F — [C°P, S] over S correspond to cartesian S-indexed functors C — F, and 
the operation of composing a geometric morphism with g corresponds to that of 
composing a cartesian functor with (the indexed version of) g*: F — G. But the 
latter has an S-indexed right adjoint, namely gx: G — F; so composition with 
the latter yields the functor g? which we seek. And the fact that, when g is an 
inclusion, we have gt(f)g & f is immediate from the definition of an inclusion. 

Now suppose € is a retract of a complete injective €. Given a geometric 
morphism g: G —> F, we may apply B1.1.10 to the diagram 


BTCop/S (F,E) —> BTop/S (F,E) —> BTop/S (F,E) 


BTop/S(G,E) —> BTop/S (G,E) —> BTop/S (G, €) 


where the vertical arrows are induced by composition with g and the horizontal 
ones by composition with the retraction morphisms E S E. (As in the proof of 
4.2.8, we have to regard this diagram as lying in the meta-2-category of Cauchy- 
complete categories; but this is permissible, by A4.1.15.) This yields the right 
adjoint g* for the topos E, and the fact that the adjunction is a reflection if g is 
an inclusion. O 


We pause to note a couple of elementary consequences of this result in the 
case S = Set: 


Corollary 4.3.2 Suppose E is an injective Grothendieck topos. Then 


(i) E satisfies De Morgan’s law (cf. D4.6.2); 
(ii) the terminal object of E is an indecomposable projective (cf. A1.1.10). 
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Proof (i) If C is cartesian, then the functor category [C°P,Set] satisfies De 
Morgan’s law by D4.6.3(a); and De Morgan’s law is inherited by retracts in Top, 
by D4.6.7(ii). 

(ii) Again, the fact that the terminal object of [C°?, Set] is an indecomposable 
projective when C is cartesian (or, more generally, when it has a terminal object) 
follows from A1.1.10; and it is easy to see that these properties are inherited by 
retracts in Yop. o 


Note that part (ii) of the corollary ensures that we have no hope of strength- 
ening De Morgan’s law to Booleanness in (i): the only Boolean injective topos 
(over Set) is Set itself. 

The connection between injectivity and continuous categories, as studied in 
the last section, is given by the observation that the category of points of an 
injective topos must be continuous. (Note that in this section, by ‘continuous cat- 
egory’ we invariably mean ‘continuous S-indexed category’; but, in dealing with 
continuous categories, we shall continue to argue as if our base topos were Set.) 

Given a bounded S-topos E€, we shall write Pt(€&) for the category 
BLop/S(S,E) and call it the category of points of E€. By B3.4.8, we know 
that this category has filtered S-indexed colimits. Moreover, the proof of B3.4.8 
shows that, given a geometric morphism f: F — E between bounded S-toposes, 
the functor Pt(#) — Pt(€) induced by composition with f preserves filtered 
S-indexed colimits. That is, 


Lemma 4.3.3 The assignment E ++ Pt(E) defines a 2-functor from BLop/S 
to the meta-2-category FILVs of S-indexed categories with filtered colimits. 
functors preserving filtered colimits and (arbitrary) natural transformations. U 


Now if C is an internal category, we saw in B3.2.7 that the category Pt([C, S]) 
is equivalent to the category Tors(C) C [C°?, S] of C-torsors; but, as we observed 
at the start of the last section, the latter is in turn equivalent to Ind-C. We may 
thus conclude 


Proposition 4.3.4 Let E be a topos which ıs expressible as a retract in 
BLop/S of a functor category |C, S]. Then Pt(€) is an ind-small continuous 


category over S. If E is injective in the sense of 4.3.1, then Pt(€) ıs also 
S-cocomplete. 


Proof In the case when £ is of the form [C, S], then Pt(€) = Ind-C is contin- 
uous by 4.2.8, and ind-small by 4.2.18. The general case follows from 4.3.3 and 
4.2.18. And if £ is injective, then it can be expressed as a retract of [C, S] where 
C is cocartesian, so that Ind-C and its retract Pt(€) are cocomplete by 4.2.21. 


But there is also a converse to 4.3.4. Recall from 4.2.2(iii) that, provided C is 
Cauchy-complete, it may be recovered up to equivalence from Ind-C as the full 
subcategory of finitely presentable objects. Hence the functor category [C, 5] 
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(which, as we saw in A1.1.9, depends only on the Cauchy-completion of C — 
as does Ind-C) may also be recovered from Ind-C. In fact we have 


Lemma 4.3.5 For any internal category C in S, {C,S] 1s equivalent to the 
category $ JLT (Ind-C,S). 


Proof This is just a particular case of the fact, established in 4.2.6, that 
Ind-C is the free filtered-colimit-completion of C, since S has filtered S-indexed 
colimits. o 


We note that finite limits and arbitrary colimits in the category 
SILT ys (Ind-C,S) may be constructed pointwise, from which it follows that, 
for any filtered-colimit-preserving functor F: Ind-C — Ind-D, the functor 
SILT s (Ind-D,S) + FILTs (Ind-C,S) induced by composition with F is geo- 
metric, and hence an inverse image functor by 2.2.10. That is, FILTs (—,S) 
may be regarded as a functor R + BTop/S, where £ is the full sub-2-category 
of FILTs whose objects are finitely accessible S-indexed categories (i.e. cate- 
gories of the form Ind-C with C small). On splitting the idempotents in &, we 
immediately deduce 


Corollary 4.3.6 Let P be an ind-small continuous S-indexed category. Then 
there is a bounded S-topos E, determined up to equivalence by P, which is a 
retract of a functor category {C,S] and satisfies Pt(E) ~ P. If in addition P is 
S-cocomplete, then E is injective. 


Proof We have seen that the 2-functors Pt and FILTs (—,S) define an equiv- 
alence between the 2-category & and the full sub-2-category of BYop/S whose 
objects are functor categories [C,S]; so they must extend to an equivalence 
between the Cauchy-completions of these two 2-categories. For the final asser- 
tion, we use 4.2.21 and 4.3.1(i). oO 


We note that, since every ind-small continuous category appears as a coad- 
joint retract of an object of £, it follows that every topos which occurs as a retract 
of a functor category [C, S] is actually a coadjoint retract of such a category. This 
is clearly related to the equivalence of the injectivity notions considered in 4.3.1 — 
though it does not seem to be directly deducible from the latter, even in the case 
when C has finite colimits. 

Suppose now that € is a retract in BYop/S of a functor category [C, S}. Then 
the retraction r: [C, S] — E€ and the inclusion i: E — [C, S] compose to yield an 
idempotent endomorphism of [C, S]. However, the image of this endomorphism, 
in the sense of the factorization of A4.2.10, need not be equivalent to €; nor need 
it be injective if C is cocartesian. 


Example 4.3.7 Let C be a small category, A an object of C with more than one 
endomorphism, and let E:C — C be the constant functor with value A. Then 
E induces an endomorphism e of the topos [C, Set], which is clearly idempotent 
since E is; but its image in the sense of A4.2.10 is the topos [M, Set] where 
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M is the monoid of all endomorphisms of A in C, by A4.2.12(b) — and [M, Set] 
is not injective for any nontrivial M, since its terminal object is not projective 
(cf. 4.3.2(ii)). On the other hand, the image of e in the idempotent-splitting 
sense is simply Set, since the image of E in this sense is the terminal category 1. 


Since BTop/S has finite limits, we know that any idempotent 1-cell in this 
2-category does have a splitting. Moreover, we have 


Lemma 4.3.8 Let E be a bounded S-topos, and e: E — E a geometric mor- 
phism which carries an idempotent monad or comonad structure in BYop/S 
(cf. B1.1.9(c)). Then the image of e in the idempotent-splitting sense coincides 
with its image in the sense of A4.2.10. 


Proof We shall deal with the comonad case; the other is very similar. Let 


Ecm eS 
be a splitting of the idempotent e. Then, as we saw in B1.1.9(c), r is right 
adjoint to i in BLop/S (that is, r* is left adjoint to 7*), and the unit 1y —> i*r* 
is an isomorphism. It follows that the counit i*i, — lp of (i* 4 ią) is also an 


isomorphism, i.e. 7 is an inclusion. So the displayed diagram is also the surjection— 
inclusion factorization of e. d 


Combining 4.3.4, 4.3.6 and 4.3.8, we have: 
Theorem 4.3.9 For a bounded S-topos E, the following are equivalent: 
(i) E is a retract of an S-topos of the form |C, S]. 
(ii) E is a coadjoint retract of a topos of the form [C, S]. 
(iii) There exists a local morphism [C,S] — E over S, for some internal 


category C in S. 


Moreover, E satisfies the injectivity conditions of 4.3.1 iff it satisfies any of the 
above conditions with the additional hypothesis that C is cocartesian. 


Proof The equivalence of (i) and (ii) follows as already indicated from 4.3.4, 
4.3.6 and 4.2.18. Since the left adjoint of a local morphism is necessarily an 
inclusion by 3.6.1(iv), condition (iii) is equivalent to saying that £ is the image 
in the sense of A4.2.10 of an idempotent comonad on [C, S]. So 4.3.8 implies that 
this is equivalent to (ii). The final assertion is immediate from 4.3.1. z 


An S-topos which is a retract of one of the form [C, S] is called quasi-injectwe. 
Using 4.3.9, we may deduce various properties of such toposes: for example, 
Corollary 4.3.10 Let p: E — S be an S-topos. 

(i) If E is quasi-injective, then p rs locally connected. 


(ii) If E is injective, then p is totally connected. 
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Proof (i) We already know that toposes of the form [C, S] are locally connected 
over S; so the result follows by applying 3.3.2(iv) to a triangle 


E 
S 
where f is local (and hence connected). 
(ii) In this case, we may choose C to be cocartesian and hence filtered; so 
the left adjoint of p*, which is the composite limc f* by the proof of 3.3.2(iv), 
is cartesian by B2.6.8. o 


[C, R 


We remark that injective S-toposes are also local over S, by 3.6.3(b) and 
3.6.4(ii). Indeed, they are actually hyperlocal (cf. B4.4.20(a)), and even carry 
algebra structures for the ‘measure topos’ monad of B4.5.8, as we observed in 
B4.5.10. 


Remark 4.3.11 In connection with 4.3.9(iii), we note that € admits a totally 
connected morphism from [C, S] iff it is an adjoint retract of [C, S], by 4.3.8. 
However, not every quasi-injective S-topos can be represented in this way. In fact, 
if there exists a totally connected morphism (or even an S-essential surjection) 
f: [C, S] — E over S, then € must itself be of the form [D, S], by 2.2.20: for fı 
preserves S-indecomposable projectives since it has an S-indexed right adjoint 
which preserves epimorphisms and S-indexed coproducts, and the images under 
f; of the representable functors form a separating family for € since the right 
adjoint of f, is faithful (cf. B2.4.2(d)). Thus we have shown that all idempotent 
monads (unlike the idempotent comonads) in the 2-category DTop/S of diagram 
toposes over S are already split. 


To conclude this section, we give a characterization of quasi-injective toposes 
in terms of sites, which we shall require in the next section. Let e: [C, S] — [C, S] 
be an idempotent geometric morphism. By the description on geometric mor- 
phisms between functor categories in terms of profunctors given in B3.2.8(c), we 
have an idempotent profunctor W: C +} C such that W is a torsor for the right 
action of C, which induces the functor f* in the sense that for any discrete opfi- 
bration F — C we have f*(F) = W @cF. If we think of the elements of W (U, V) 
as wavy arrows from U to V, as before, the condition that W should be a torsor 
is equivalent to saying that, for each V, the ‘wavy slice category’ (CY V) whose 
objects are the wavy arrows with codomain V, and whose morphisms are straight 
arrows between their domains forming commutative triangles, is filtered. 

We now assume (as we are entitled to do, by 4.3.9) that W is actually an 
idempotent profunctor comonad. This means that each wavy arrow f: U ~ V 
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has an underlying straight arrow e( f): U — V, and that € preserves composition 
in the appropriate sense. Given such a W, we define a coverage Jw on CP as 
follows: a cosieve on U is Jw-covering iff it contains the underlying straight 
arrows of all wavy arrows with domain U. (Thus each object U has a smallest 
J-covering cosieve Ry, consisting precisely of the underlying straight arrows of 
all wavy arrows with domain U; however, Jw is not in general a rigid coverage 
in the sense of 2.2.18 — cf. 4.3.13 below.) It is clear that this definition satisfies 
the stability property (C) of 2.1.1; but it also satisfies the properties (M) and 
(L) of 2.1.8 (the latter since the idempotency of W means that every wavy arrow 
can be factored as the composite of two wavy arrows), so it is a Grothendieck 
coverage. 


Lemma 4.3.12 Under the above assumptions, the topos Shs(C°?, Jw) is the 
image (in either of the senses of 4.3.8) of the idempotent geometric morphism 
e: [C, S] — [C, S] corresponding to W. 


Proof To identify the image of e in the sense of A4.2.10, we have to determine 
which subobjects R — C (U, —) (i.e. which cosieves on an object U of C) are 
mapped to isomorphisms by the functor W @¢ (—). But 


W &c R >> W&c C(U, -) = WU, -) 


is an isomorphism iff, for each wavy arrow f: U ~ V, there exists g: U — W in 
R and a wavy arrow h: W ~ V such that h-g = f. Clearly, this condition implies 
that R contains e(f) = e(h) - g for any such f; but conversely, if R contains all 
the e(f), then we may factor any wavy f: U ~ V as a composite hg of two wavy 
arrows, and then e(g) € R, so the condition is satisfied. Thus the image of e is 
precisely Shs(C°P, Jw). 

By 4.3.8, we know that Shs(C°?, Jw) is also the image of e in the idempotent- 
splitting sense. However, we may alternatively prove this by a direct calculation. 
which we shall in any case need in the next section (cf. the proof of 4.4.4). It 
is sufficient to show that if F: C — S is any Jw-sheaf then e,(F) S F. But 
since each object of C has a smallest Jw-covering sieve, the assertion that F is a 
Jw-sheaf means simply that, for each U € Co, the canonical morphism F (U) > 
lim suver, F(V) is an isomorphism. And e,(F) is by definition W thc F, 
where W the (—) denotes the right adjoint of W @c (—); from this, we may 
readily deduce the formula e,(F)(U) = lim F(V), where this limit is taken (not 
over Ry but) over the category (UC) whose objects are wavy arrows U ~ V 
and whose morphisms are straight arrows between their codomains inducing 
commutative triangles. It is clear that the canonical mapping from the first limit 
above to the second (which sends (z+ | f € Ry) to (zep) | f: U ~ V)) is monic: 
to show that it is an isomorphism, we have to show that if (zp | f:U~4V) is 
any element of the second limit, and f and g are two wavy arrows with the same 
underlying straight arrow, then necessarily x; = xg. But if f and g have the 
same underlying straight arrow, then they are coequalized by any wavy arrow 
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whose domain matches their codomain (V, say), and hence z+ and x, have the 
same image in e,(F)(V). Since we have already observed that F — e,(F) is 
monic, this forces £f = Z,, as required. o 


It follows from 4.3.12 that the idempotent endomorphism e of [C, S], and 
hence the idempotent profunctor comonad W, depend only on the image of the 
counit map €, since the knowledge of which morphisms of C underlie wavy arrows 
suffices to define the coverage Jw. However, it does not seem easy to give a direct 
construction of W from Jw, except in the particular case when € is monic (in 
which case, of course, W (U,V) may be taken to be the set of those morphisms 
U — V which belong to Ry). 


Example 4.3.13 If P is a continuous poset (in Set, for simplicity), then 4.3.12 
allows us to express it as the category of points of a coadjoint retract of the topos 
[P, Set]: specifically, of Sh( P°?, Jez), where J<< is the (Grothendieck) coverage 
in which a cosieve on an element p of P is covering iff it contains all elements q 
with p << q. For example, if P is the ordered set (—oo,0] C R (which is easily 
seen to be continuous, since its way-below relation coincides with the strict 
ordering <), we obtain precisely the non-rigid coverage on [0,00) = (—co,0]°? 
mentioned as a counterexample after 2.2.18. On the other hand, if P is the poset 
IQ of ideals of a poset Q, then we obtain the rigid coverage associated (as in 
2.2.4(d)) with the inclusion of the sub-poset of principal ideals. 


Suggestions for further reading: Johnstone [517], Johnstone & Joyal [542], 
Scott [1102]. 


C4.4 Exponentiable toposes 


We now turn from the topos analogue of 4.1.6 to that of 4.1.9: the determina- 
tion of which bounded S-toposes are exponentiable in the 2-category BYop/S. 
We recall that we made some preliminary investigations of exponentiability in 
Section B4.3; in particular, we recall the following result, which we restate as 
the starting-point of our present investigation: 


Theorem 4.4.1 Let p:€ — S be a bounded S-topos. Then E is exponentiable 
in BUop/S iff the particular exponential S[O]E exists, where S[O] is the object 
classifier for toposes defined over S (cf. D3.2.1). g 


This of course corresponds to the fact, which we observed in the proof of 4.1.9, 
that the existence of the particular exponential S*, where S is the Sierpiński 
space, implies the exponentiability of X in Loc. And we shall use it in much 
the same way, to establish the link between injectivity and exponentiability in 
BTop/S. Note that S[O] is an injective S-topos by 4.3.1, since we may identify 
it with the functor category [S;,S] and Sy is cocartesian; and since the functor 
(—) x E£: BIop/S — BTop/S preserves inclusions by the proof of B3.3.6, it 
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follows that its right adjoint, if it exists, must preserve injectives. But we have 
Pt(S[O}*) ~ BTop/S (E, SIO] = E 


by the defining property of the object classifier; so from 4.3.4 we immediately 
deduce 


Proposition 4.4.2 IfE is an erponentiable S-topos, then it is continuous as 
an S-indexed category. o 


Our main task in this section is to establish the converse of 4.4.2. We know 
from 2.2.8 that any bounded S-topos € has a small dense subcategory, which 
may be taken to be closed under finite colimits in €; so if E is a continuous 
category then it is necessarily ind-small, as well as cocomplete. Hence by 4.3.6 
it is (equivalent to) the category of points of an injective S-topos F, which is 
determined up to equivalence by £, and which is clearly the only possible candi- 
date for the exponential S[O]®. As in the proof of 4.1.9, the hard work consists 
in showing that it does indeed have the universal property of this exponential. 

Before we embark on it, let us fix some notation. We shall take C to be a 
small full generating subcategory of E, which is closed under finite limits as well 
as colimits, and is therefore a pretopos. We shall write K for the Grothendieck 
coverage on C induced by the inclusion C — E, i.e. the collection of all sieves 
on C which map to epimorphic families in E. We shall write W both for the 
‘wavy arrow’ profunctor € % € and for its restriction to C, and Jw for the 
coverage which it induces on C°P as in the last section. (Thus F is the topos 
Shs(C°?, Jw).) 

We also need a description of K in terms of W. Note that, as usual, K 
contains the coherent coverage P on C, and so by 3.2.18 it may be expressed 
as the join PV Ka, where K4 consists of all the dm-sieves in K. But from the 
definition of wavy arrow, every Ky-covering sieve on an object U of C contains 
the underlying straight arrows of all wavy arrows with codomain U. The sieve Su 
consisting of all such straight arrows is not in general a dm-sieve, or even filtered 
(though it will be filtered if the ‘underlying straight arrow’ map is injective), 
but it is K-covering since £ is a continuous category. So we see that K may be 
generated as a Grothendieck coverage by P together with the coverage Kw whose 
only covers are the sieves Sy, U € Co. (Note that the latter is indeed a coverage. 
since if f: V — U in C then Sy C f*Sy. Indeed, if we modify the definition to 
include all sieves which contain Sy for some U, we get a Grothendieck coverage: 
the ‘local character’ axiom (L) follows from the idempotency of W, as it did for 
Jw.) In particular, a functor C°P — S is a K-sheaf (that is, an object of £) iff it 
is a P-sheaf and satisfies the sheaf axiom for the sieves Sy. 

Our basic strategy for proving the exponentiability of £ is to show that the 
equivalence between E (that is, Shs(C,K)) and the category of points of F 
(i.e. of cartesian Jw-cover-preserving functors C° — S) can be derived from 
properties of the pair (C, W) which are preserved by inverse image functors. 
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Then for any S-topos r: G — S we shall have 


BTop/S (G xs E, SO] >G xs E 
~ Shg(r*C,r*K) 
~ BTop/G (G, Shg(r*C?,r# Jw)) 
~ BTop/S (G, F) 


yielding the required adjunction. (It should be noted, however, that Shs(C, K) 
and the category of cartesian Jw-cover-preserving functors C°P — S do not 
coincide as subcategories of [C°?,S]; indeed, as subcategories of Ind-C ~ 
Carts (CP, S), they are embedded via the right and left adjoints respectively 
to the colimit functor Ind-C — E.) 

One further ingredient is needed before we can actually prove such a theorem. 
In general, if C is a pretopos, then functors on C°? which are sheaves for the 
coherent coverage are ‘almost cartesian’ in the sense of the remark before A1.4.8, 
but they need not be cartesian: they send coequalizers of equivalence relations to 
equalizers, but they do not necessarily send arbitrary coequalizers to equalizers, 
since they do not preserve the infinite unions involved in the definition of the 
equivalence relation generated by a parallel pair. Since we wish this to be true 
in our case, we have to build it into the properties of W, as follows. We shall 
say that the pair (C, W) is essentially finitary if, given any object U of C and 
any relation T — U x U, with equivalence closure R = U2, T (n) defined as in 
3.4.5, each wavy arrow V ~ R in C factors through T™ — R for some finite n. 
Clearly, if (C,W) is obtained as above from a topos € which is a continuous 
category, then it is essentially finitary; for the T™ then form a directed diagram 
in € whose colimit is R, and so there must be a morphism of ind-objects from 
w(R) to this diagram. 

The key ‘change of base’ result is thus 


Lemma 4.4.3 Let C be an internal pretopos (with arbitrary coequalizers) in 
a topos S, and W: C%C an idempotent profunctor comonad which is a right 
torsor, such that the pair (C, W) is essentially finitary. Then, for any geometric 
morphism q: F — S, the pair (q*C,q*W) is essentially finitary. 


Proof The assertion that (C, W) is essentially finitary may be expressed as 
a geometric sequent in the internal language of S, provided the infinite union 
a T™ always exists in C. But, since we have assumed that C has arbitrary 
coequalizers, we may construct it as the kernel-pair of the coequalizer of T 3 U. 
These constructions are preserved by the inverse image functor q*; hence it 


preserves the validity of the geometric sequent. o 


Proposition 4.4.4 LetC be an internal pretopos with arbitrary coequalizers in 
a topos S, and let W be an idempotent profunctor comonad on C which is a right 
torsor, such that the pair (C, W) is essentially finitary. Let K be the coverage 


748 LOCAL COMPACTNESS AND EXPONENTIABILITY C4.4 


on C generated by the coherent coverage together with the sieves Sy, and let 
Jw be the coverage on C°P consisting of the sieves Ry defined in 4.3.12. Then 
Shs(C, K) is equivalent to the category of points of the topos Shs(C°?, Jw). 


Proof The ‘essentially finitary’ assumption ensures that any K-sheaf on C, 
considered as a functor C°P — S, preserves equalizers, since it implies that the 
kernel-pair of the coequalizer of an arbitrary pair T = U is K-covered by the 
sieve generated by the morphisms T™) — R. So we may regard Shg(C, K) 
as a subcategory of the category Cart (C°P, S) œ~ Ind-C — in fact a reflective 
subcategory thereof, since it is reflective in [C°?, S]. 

On the other hand, the category of points of Shs(C°?, Jw) may also be 
regarded as a full subcategory of Cart (C°P, S), consisting of those cartesian func- 
tors which map Jw-covering sieves to epimorphic families. As we have already 
remarked, these two subcategories of Cart (C°?,S) do not coincide in general. 
However, since W is a right torsor, the functor (—) @c W: [C°?, S] — [C°P, S] 
maps the subcategory Cart(C°?,S) into itself, and it inherits an idempotent 
comonad structure from W. By 4.3.12, we know that its image is exactly the 
subcategory of functors which preserve covers in Jw, since it corresponds to the 
image of the idempotent endomorphism of [C, S] induced by W. 

As a functor on [C°P, S], (—) @c W has a right adjoint W h€ (—), by B2.7.8. 
The latter may be defined more explicitly by 


W h? F(U) = lim F(V), 


where the limit is taken over the category (CY U) whose objects are all wavy 
arrows V ~+ U and whose morphisms are straight arrows between their domains. 
Clearly, W h? (—) has an idempotent monad structure, and, by the argument 
in the proof of 4.3.12 (applied with all arrows reversed), its image (as a functor 
on [C°P, S]) consists precisely of those functors C°? — S which satisfy the sheaf 
axiom for the sieves Sy defined earlier. But the fact that W is a right torsor 
ensures that this functor also preserves sheaves for the coherent coverage, since 
the latter can be defined as functors which preserve certain finite limits; so 
its image as a functor on Cart(C°?,S) is exactly the category of K-sheaves. 
Moreover, the adjunction between (—) @c W and W h€ (—) is itself idempotent, 
so it restricts to an equivalence between the category of K-sheaves and the 
category of cartesian functors which preserve Jy-covering sieves. 0 


Theorem 4.4.5 A bounded S-topos p: E — S is exponentiable in BTop/S iff 
its underlying category is continuous (as an S-indexed category). 


Proof One direction is 4.4.2. For the converse, suppose € is a continuous 
category; then it suffices by 4.4.1 to construct the exponential S[O]®. We define 
the latter to be Shs(C°?, Jw), where C is a generating subcategory for € closed 
under finite limits and colimits, W is the restriction to C of the wavy-arrow 
profunctor on €, and Jw is the coverage defined from W as in 4.3.12. We note 
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that W is a right torsor, since C is closed under finite colimits in €, and by 4.2.17 
these are also ‘wavy colimits’, i.e. W(—,V) preserves finite limits for any V. 

Now let q: F — S be an arbitrary S-topos. Then the pair (q*C, q*W) is 
essentially finitary by 4.4.3; moreover, the constructions of the coverages Jw and 
Kw from W make it clear that we have q# (Jw) = Jọw and q# (Kw) = Kaw. 
So, applying 4.4.4 in the context of F-indexed categories, we have 


BLop/S (F xs ESO) ~ F xs E 
~ Shz(q*C, q* K) 
~ BUop/F (F, Shr(q*C?, g* Jw)) 
~ BTop/S (F,Shs(C°?, Jw)), 


as required. o 


Examples 4.4.6 (a) In Section B4.3, we were mainly concerned with two par- 
ticular classes of exponentiable S-toposes, namely those of the form [C, S] for an 
(arbitrary) internal category C in S and those which are coherent over S. It is 
easy to see that both classes consist of exponentiable toposes using 4.4.5, since 
the underlying categories of both types are locally finitely presentable (for the 
local finite presentability of coherent toposes, cf. D3.3.12). 

(b) From 4.2.9, we may deduce that if € is an exponentiable S-topos, then 
so is €/A for any A. (However, this could also be deduced from (a) and the 
composability of exponentiable morphisms, as we saw in B4.3.6.) 

(c) As a special case of (b), any open subtopos of an exponentiable topos is 
exponentiable. But any closed subtopos E’ of an exponentiable topos £ is also 
exponentiable by 4.2.8, since the inclusion E’ — E preserves filtered colimits 
by (the S-indexed version of) A4.5.4, and so does its left adjoint. (Like (6), 
this could alternatively have been deduced from the second case of (a) and the 
composability of exponentiable morphisms; for a closed subtopos of € is coherent 
as an E€-topos.) Combining the two, we deduce that any locally closed subtopos 
(that is, any subtopos which is the intersection of an open and a closed subtopos) 
of an exponentiable topos is exponentiable. Conversely, if an inclusion S’ — S is 
exponentiable in BYop/S, then it is locally closed; this follows from (f) below 
and 4.1.12, since the terminal object of Loc(S) is compact and Hausdorff. 

(d) Paralleling 4.1.7(ii), if a geometric morphism f: F — € over S is con- 
nected and either locally connected or tidy, and F is exponentiable in BYop/S, 
then £ is also exponentiable, since we can use f* and its left (resp. right) adjoint 
to express € as a retract of F by functors preserving filtered colimits. (However, 
exponentiability does not ‘descend’ along arbitrary open or proper surjections, 
as we shall see in 4.4.9 below.) 

(e) We observed in B4.3.5(iii) that any retract in BTop/S of an exponen- 
tiable S-topos is exponentiable; but we may alternatively derive this from 4.4.5 
and 4.2.8, since inverse image functors preserve filtered colimits. In particular, 
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we note that every quasi-injective topos (in the sense of 4.3.9) is exponentiable. 
So, if we can construct the exponential S[O]F, we can also construct S [als (015) 
and so on. In the terminology introduced in B4.3.2, this means that the ‘dual’ 
of any dualizable geometric theory is itself dualizable. 

(f) If a localic topos Shs(X) is exponentiable in BTop/S, then it follows 
from 4.4.2 and 4.2.11 that X must be a locally compact locale. (We could also 
have deduced this by noting that Subsn(x)(1) may be identified with the category 
of points of the exponential Sh(S)S#(*), where S is the Sierpiński space, and 
the latter is an injective topos since Sh(S) is injective.) Somewhat surprisingly, 
in view of the close formal similarity between 4.1.9 and 4.4.5, this necessary 
condition for exponentiability of a localic topos is not sufficient; we shall see 
a counterexample in 4.4.8 below. However, if X is stably locally compact, then 
Shgs(X) is exponentiable in BTop/S, by 4.1.13 and (e) above. In fact a stronger 
result is true: if X is locally stably compact, then Shs(X) is exponentiable. (This 
follows from the remark after 4.1.13, plus the (easy) fact that any locally coherent 
topos is an open subtopos of a coherent topos: given a distributive pre-lattice B, 
let Bt be the lattice obtained by adjoining a top element to B, and then IB is 
a principal ideal in [(Bt).) 


To determine exactly which locales give rise to exponentiable toposes, we 
need to introduce a strengthening of the way-below relation, which stands in 
much the same relation to it as the notion of strong compactness introduced in 
3.4.1 does to ordinary compactness. Given elements U and V of a frame O(X), 
we shall write V << U if, whenever we have a filtered diagram (A; | i € J) in 
Sh(X) with lim;A; = U, there exists i € I such that A; has a section over V. 
And we shall say that X is locally metastably compact if every U € O(X) satisfies 
U = U{V € O(X) | V << U}. (The reason for the term ‘metastably’ is that, in 
a locally stably compact locale, the relation << coincides with <<, and hence 
any such locale is locally metastably compact. We shall not prove this here, since 
we have already seen that locally stably compact locales are exponentiable.) 

If we already know that Sh(X) is a continuous category, then it clearly 
suffices in the definition of the relation << to consider the particular filtered 
diagram w(U); hence in this case we have V << U iff there exists a wavy arrow 
V ~ U in Sh(X). Thus we may conclude 


Lemma 4.4.7 If X is a locale such that Sh(X) is an exponentiable S-topos. 
then it is locally metastably compact. 


Proof Given U € O(X), we identify it with a subterminal object of Sh(X) 
and let w(U) = (A, | i € I). Then the supports oA; of the A; form a directed 
family of subterminal objects whose union is U, and each oA; can be covered by 
opens V over which A, admits a section. Thus U is the union of all such V. 


o 
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Example 4.4.8 We may now give our promised example of a locally compact 
locale which is not locally metastably compact, and hence does not give rise 
to an exponentiable topos of sheaves. The space X of Example 4.1.10, though 
not locally stably compact, is locally metastably compact, since although the 
relation X << X fails to hold (as we saw in 3.4.1(a)), we do have U << X and 
V << X, where U and V are the two copies of [0, 1] embedded in X. However, if 
we ‘iterate’ this construction as follows, we obtain our desired counterexample. 

Consider the product space [0,1] x K, where K is the Cantor space, i.e. the 
product of countably many copies of the discrete two-point space. (We think 
of points of K as infinite sequences of 0’s and 1’s.) We define an equivalence 
relation R on this space by setting (z,s) R(y,t) iff x = y, and either s = t or 
there exists a natural number n such that x > 1/2”*! and the first n terms of 
the sequences s and t agree; let Y be the quotient space ((0, 1] x K)/R. Thus, in 
terms of its evident projection onto [0,1], Y has a discrete 2”-point space as its 
fibre over each point of the interval (1/2"+1, 1/2"], and a copy of K as its fibre 
over 0. It is easy to see that the quotient map q: [0,1] x K — Y is open, and 
hence by 4.1.7(ii) Y is locally compact. But we shall show that no open U CY 
satisfying U << Y can meet the fibre over 0; hence Y is not locally metastably 
compact. 

Let U be an open set in Y containing the point q(0, s). Then, for some n, U 
also contains the points q(1/2”+!, s) and q(1/2"t!, s’) where s’ differs from s at 
the (n + 1)st term but not before. Hence U contains an open subset U’ which 
looks like the effect of identifying two copies of the open interval (t— €, t+€) along 
the subinterval (t,t + €), where t = 1/2"+! and 0 < e < 1/2"t?. For 0 < ô < €, 
let Xs be the space obtained from X by ‘unglueing’ these two intervals over 
(t,t +6]. Then the projection Xs — X is a local homeomorphism, and it is clear 
that we can regard (Xs | 6 > 0) as a directed diagram of sheaves on X with 
colimit X. But no Xs admits a section over U’, let alone over U. 


Remark 4.4.9 The fact that the space Y of 4.4.8 is an open surjective 
image of the compact Hausdorff (and therefore stably locally compact) space 
[0,1] x K shows us that exponentiability does not descend along open surjec- 
tions, in contrast to B4.3.7: that is, if we are given an open surjection F —> E 
in BTop/S and F is exponentiable, we cannot conclude that £ is exponen- 
tiable. At first sight, this is rather surprising, since the descent theorem for 
open surjections, to be proved in the next chapter (see 5.1.6), tells us that in 
these circumstances £ is the pullback-stable pseudo-colimit in BTop/S of a dia- 
gram involving F and the pullback F x¢ F; and we might expect to be able to 
form the exponential S[O]® as a corresponding pseudo-limit of exponentials. The 
reason why this will not work is that, in contrast to the special case considered 
in B4.3.7, the kernel-pair F xe F does not in general inherit exponentiabil- 
ity from F: in the particular example just described, the equivalence relation 
R is (not locally closed in ([0,1] x K) x ([0,1] x K), and hence) not locally 
compact. 
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To prove the converse of 4.4.7, let X be a locally metastably compact locale. 
It suffices by 4.2.10 to construct an initial ind-object in Sh(X) with colimit F, 
for every sheaf E on X. But it is easy to see that local metastable compactness 
is a local property, i.e. it is inherited by the domain of any local homeomorphism 
E — X; so in fact it suffices to do this for the particular case E = X. In what 
follows, we shall freely identify sheaves on X with local homeomorphisms over X. 

We say that a sheaf F — X is admissible if it is generated by a finite number 
of sections (i.e. there exists an epimorphism []7_, U: > F in Sh(X) for some 
n, where the U; are subterminal objects of Sh(X)) and, for every ind-object 
(G; | j € J) in Sh(X) with colimit X, there exists a morphism F — G, for 
some j. (Thus a subterminal object U is admissible iff it satisfies U << X.) We 
define a category T as follows: its objects are monomorphisms (F” — F) where 
F is admissible and F’ << F in the (continuous) lattice of subsheaves of F, and 
morphisms (F’ — F) — (G’ — G) are morphisms F’ — G’ for which there 
exists some morphism F — G making the obvious square commute. 


Lemma 4.4.10 The category T is filtered. 


Proof It is clearly nonempty. Given two admissible objects F and G, it is 
easy to verify that their coproduct F II G is admissible; also, if F’ << F and 
G’ << G, then we have F’ << F < FUG and G’ << G < FUG in Sub(F UG), 
and hence F” IG’ << FUG. So (F’ UG’ — FIG) is an object of T, and it 
clearly admits morphisms from (F’ — F) and (G’ — G). 

Now suppose given a parallel pair of morphisms f’, 9’: (F’ — F) 3 (G’ > G) 
in Z. Form the diagram 


F ! 
Poa G i H' 
g' 
f h 
F ——~ s G ———>H 
g 


in Sh(X), where the top row is a coequalizer and the right-hand square is a 
pushout. By A2.4.3, the morphism H’ — H is monic, and the square is also a 
pullback; and since h is epic (so that pullback along it is a conservative functor) 
it is easy to verify that we have H’ << H in Sub(H). It is also clear that 
H is finitely generated; to show that it is admissible, consider an ind-object 
(A, | J € J) with colimit X. For some i, we have a morphism k: G > A;. The 
composites kf and kg need not be equal; but since F’ << F in Sub(F) we can 
find j — 7’ in J such that the equalizer of F = G > A; — A, contains F’ — F. 
and hence the composite G’ — G — A; — A; factors through the coequalizer 
h’. So we obtain a factorization of G > A; — A; through the pushout h; hence 


C4.4 EXPONENTIABLE TOPOSES 753 


H is admissible, and h’: (G’ — G) — (H' — H) is a morphism of F coequalizing 
f' and g'. o 


The finite generation condition in the definition of admissible objects ensures 
that Z is at least equivalent to a small category. Thus we may regard the obvious 
forgetful functor Z — Sh(X) sending (F’ — F) to F’ as an ind-object in Sh(X). 


Lemma 4.4.11 The ind-object just defined is initial among ind-objects in 
Sh(X) with colimit X. 


Proof First we must verify that the ind-object does indeed have colimit X. But 
since X can be covered by opens U’ for which there exists U with U' << U <x X, 
it is clear that the colimit is well-supported. To show that it is subterminal, 
suppose U << X and we are given morphisms f,g: U = F’ for some object 
(F’ — F) of T. Then, for any U’ << U, the restrictions of f and g to U’ may 
be regarded as morphisms (U’ = U) 3 (F’ — F) in T; so by 4.4.10 they 
are coequalized by some (F’ =— F) — (G' = G) in T; ie. the equalizer of 
U 3 F’ —G' contains U’. Since U is covered by such opens U’, it follows that 
f and g represent the same morphism U — lim7zF’”; so the colimit has at most 
one section over U. But the opens U with U << X form a basis for O(X); so 
the colimit has at most one section over any open set. 

The proof of initiality is generally similar. Given any ind-object (A, | j € J) 
with colimit X, we have by definition a morphism from any admissible object 
F to some A,. But if we had two such morphisms F = Aj, and a sub- 
object F’ << F, then there must exist j — j’ such that the composites 
F' + F = A, > A; are equal; so for any such F” there is a unique equivalence 
class of morphisms F’ — A, which includes the restriction of some morphism 
defined on F. Taking this equivalence class for each object (F’ — F) of T, we 
obtain a morphism of ind-objects (F’ | (F’ — F) € T) — (A, | j € J). However, 
given any object (F’ — F) of Z, we can find F” with F’ << F” << F in Sub(F); 
then the inclusion F’ — F” is a morphism (F’ — F) — (F” — F) in T, and so 
if we are given any morphism of ind-objects with domain (F” | (F’ — F) € T), 
its (F’ — F)th component must contain some morphism which extends to 
F”. Hence by the argument already given there is exactly one morphism 
(F | (Fl! F) ED) (A, lJ € J). D 


Putting everything together, we now have 


Theorem 4.4.12 Let X be an internal locale in S. Then Shs(X) is exponen- 
tiable in BLop/S iff X is locally metastably compact. 


Proof One direction is 4.4.7. Conversely, if X is locally metastably com- 
pact, then by 4.4.11 Ind-Sh(X)x has an initial object; and since local 
metastable compactness is a local condition, the same is true for all the fibres 
of lim : Ind-Sh(X) —> Sh(X). So by 4.2.10 Sh(X) is a continuous category, and 
by 4.4.5 it is an exponentiable topos. o 
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We remark that, although we defined th 
of arbitrary (diagrams of) sheaves on X, 
purely lattice-theoretic condition on O(X), 
we reduced the condition (E) for strong com 
in 3.4.4, to a condition on a site of definition 
which may be found in [542]. 


Suggestions for further reading: Johnst 
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e relation U << V in O(X) in terms 
the definition can be reduced to a 
using ideas similar to those by which 
pactness of an S-topos £, introduced 
for € (cf. 3.4.5). We omit the details, 


one & Joyal [542], Niefield [892, 896]. 


C5 
TOPOSES AS GROUPOIDS 


C5.1 The descent theorems 


The title of this chapter perhaps needs a few words of explanation. Our general 
thesis throughout Part C has been that the notion of (Grothendieck) topos is 
a generalization of the notion of topological space (or, better, of the notion 
of locale). In particular, toposes (or geometric morphisms between them) can 
be ascribed topological properties such as (local) compactness or (local) con- 
nectedness, aS we saw in Chapter C3; and the theory of exponentiability for 
toposes, which we explored in Chapter C4, also has close parallels with that 
for spaces (or locales). Nevertheless, we know from the examples of toposes we 
have seen (in Section A2.1 and elsewhere) that, even if we restrict our attention 
to Grothendieck toposes, there are many which (in their origins, at least) do 
not appear at all ‘spatial’ in character. So one is tempted to ask whether this 
generalization is perhaps too large to be genuinely useful. 

Our aim in this chapter is to demonstrate that the generalization from locales 
to (Grothendieck) toposes is not, after all, as broad as it appears. Specifically, we 
shall show that, by uniting the notion of locale with that of a groupoid, we obtain 
representation theorems enabling us to describe arbitrary Grothendieck toposes 
in terms of these two concepts. So the original informal picture of a Grothendieck 
topos, dating from the earliest days of work on étale cohomology of schemes, as 
‘a space whose points have enough internal structure to allow them to possess 
nontrivial automorphisms’ is not as wide of the mark as all that. 

There are several different versions of these representation theorems, and 
in the interests of comprehensiveness we shall prove more versions than any 
one reader may perhaps wish to see. But (with the exception of the ‘Freyd 
representation’ studied in Section C5.4) they all have a family resemblance: in 
Particular, they depend on combining the idea that an arbitrary Grothendieck 
topos has a particular type of ‘covering’ by a localic topos, with theorems which 
say that the covering maps concerned are descent morphisms in BTop, in a 
Suitable sense. The covering theorems will be proved in Section C5.2; in the 
Present section we concern ourselves with the descent theorems. 

Since we shall be making extensive use of pullbacks of geometric morphisms, 
we shall assume until further notice that all geometric morphisms under con- 
sideration are bounded. We shall also assume whenever necessary that the base 
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topos S over which we are working has a natural number object; and we shall 
use the term ‘Grothendieck topos’ loosely to mean ‘bounded S-topos’, without 
explicitly specifying the base topos S. 

We begin by recalling the definition of descent morphisms from Sections B1.5 
and B3.4. Given a bounded geometric morphism f: F — E, we may form the 
diagram 


T12 TI 
—_— z 7 
: <—— 
FxefFxef B  Fxef A F 
—s —> 
T23 T2 


(recall that we called such a diagram a 2-truncated simplicial topos in Sec- 
tion B3.4 - we shall refer to this diagram as Fe), and we recall that descent data 
for an object B of F consists of a morphism 6: m] (B) — 13(B) in F xe F such 
that (modulo coherence isomorphisms) we have A*(0) = 1g and the diagram 


Tio(0 
ni) 2, ex) 
a) N e 
rä(B) 


commutes in F xg F xe F. (Such a morphism 6 is necessarily an isomorphism, 
as we saw in B1.5.1: the pullback of 6 along the ‘twist’ isomorphism F xe F ~> 
F xg F is an inverse for 0.) If B = f*(A) for some A € ob €, then the component 
at A of the canonical isomorphism af f* = 73 f* constitutes descent data for B. 
so we can lift f* to a functor E — Descys(F.), where Descs(F.) (or simply 
Desc(Ff,), if we do not need to specify the morphism f explicitly) denotes the 
category of objects of F equipped with descent data. In fact, as we saw in 
B3.4.12, the category Desc(F,) is always a bounded €-topos, provided £ has a 
natural number object, although for most of this and the next section we shall 
not need to assume this fact. (We shall need it in Section C5.3, however.) 

We say that f is a descent morphism (resp. a pre-descent morphism) if the 
canonical functor E — Desc(F,) is part of an equivalence (resp. full and faith- 
ful). (Given that this functor is the inverse image of a geometric morphism, the 
latter condition just says that Desc(F,) is a connected E-topos. Note that since 
F — Desc(F,) is always surjective, a pre-descent morphism is necessarily a 
surjection.) If f is a descent morphism, we shall also say that objects descend 
along f. (This is not quite the sense in which we used the phrase ‘f is a descent 
morphism’ in Section B1.5; in the terminology of that section, our present condi- 
tion is equivalent to saying that f is a descent morphism for the BTop-indexed 
category given by the forgetful functor BYop°? — Cat which sends a topos to 
its underlying category, and a geometric morphism to its inverse image functor. 
In fact I. Moerdijk [835] has shown that open surjections are descent morphisms. 
in an appropriate 2-categorical sense, in BTop.) 
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We begin with the simplest case of the descent theorem, that in which f 
is a stable (right or left) Beck-Chevalley morphism in the sense of 2.4.16. We 
note that the morphism 0: 7{(B) — 73(B) may be transposed to a morphism 
0: B — m473(B) in F, and if the pullback square of f against itself satisfies 
the Beck-Chevalley condition we may identify the codomain of ĝ with f* fą(B). 
Moreover, by the arguments in the proof of B1.5.5 (which use the Beck—Chevalley 
condition only for pullbacks of pullbacks of the particular morphism under con- 
sideration), saying that 6 constitutes descent data for f is equivalent to saying 
that ð is a coalgebra structure for the comonad induced by the adjunction 
(f* 4 fs). So we may immediately deduce the following result from A4.2.6, 
plus the results on stable Beck—Chevalley morphisms in 3.3.16 and 3.4.11: 


Theorem 5.1.1 Suppose f: F — E is bounded, surjective and either locally 
connected or tidy. Then objects descend along f. 0 


In fact, however, this result is not best possible: both open and proper sur- 
jections are descent morphisms. To prove this, it is convenient to go by way 
of descent for locales, and then to regard objects as a special (discrete) case 
of locales: of course, we say that locales descend along f: F — E if the func- 
tor Loc(£) — Desc(Loc(F).) induced by f*: Loc(£) — Loc(F) is part of an 
equivalence, where Desc(Loc(F),.) denotes the category whose objects are inter- 
nal locales Y in F equipped with morphisms 6: 77(Y) — 73(Y) in Loc(F xe F) 
satisfying the usual unit and cocycle conditions. 


Lemma 5.1.2 Locales descend along hyperconnected morphisms. 


Proof In 2.4.11 we saw that, for any pullback square in Bop, the 
Beck-Chevalley condition holds for internal locales in the toposes concerned. 
Thus, by (the dual of) the argument of B1.5.5, we may always identify 
Desc; (Loc(F).) with the category of algebras for the monad on Loc(F) induced 
by f*: Loc(€) — Loc(F) and its left adjoint fi. The problem is thus to show 
that f* is monadic at the level of locales. But for hyperconnected morphisms 
f this follows easily from 2.4.15, which says that the adjunction is a reflection 
in this case. o 


Lemma 5.1.3 Suppose given a commutative triangle of geometric morphisms 


Ge ag 


E 
where h is hyperconnected. Then locales descend along g iff they do so along f. 


Proof First suppose locales descend along g, and let Y be an internal locale 
m F equipped with descent data 6 for f. By applying the inverse image of 
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the geometric morphism h x h: G xeG—Fx¢eF to the descent data, one 
obtains descent data for h*Y relative to g, so by assumption there exists a 
locale X in £ satisfying g*(X) = h* f*(X) = h*(Y). Since h* is full and faith- 
ful on locales, we deduce f*X = Y. The construction is functorial in (Y, 9), 
and clearly defines an inverse (up to isomorphism) for the canonical functor 
Loc(€) — Desc;(Loc(F),). 

Conversely, suppose locales descend along f, and let Z be an internal locale in 
G equipped with descent data for g. Once again, we may pull the data back along 
G xr G — G xe G, to obtain descent data for Z relative to h, and hence by 5.1.2 
we obtain a locale Y in F such that h*Y = Z. Nowhxh:GxeGouF xcF 
is hyperconnected, since it is a composite of pullbacks of h, so the isomor- 
phism h*nj(Y) = aj(Z) — 13(Z) & h*73(Y) ‘descends’ to an isomorphism 
m™(Y) — 13(Y) in Loc(F xe F), which is readily verified to be descent data for 
Y relative to f. Thus we obtain a locale X in £ such that f*X = Y and hence 
g°X % h*Y & Z; it is again easy to verify that this construction yields an inverse 
(up to natural isomorphism) for the functor Loc(€) — Desc,(Loc(G).). o 


Proposition 5.1.4 For any topos S, open surjections and proper surjections 
of locales are descent morphisms (in the sense of B1.5.5) in Loc(S). 


Proof We use B1.5.7, which provided a criterion for a class of morphisms in 
a cartesian category to consist of descent morphisms. We already know that 
both open surjections and proper surjections satisfy the first three conditions of 
B1.5.7, by the results of 3.1.11 and 3.1.12 in the open case and by 3.2.6 in the 
proper case. So we need only verify condition (iv), which says that if we are given 
an equivalence relation a,b: R = X in Loc(S) with a and b open (respectively, 
proper), then the coequalizer q: X — Y of a and b is open (resp. proper) and 
pullback-stable. 

The first assertion is easy in the open case, since by 1.5.3(iii) we know that 
open maps are precisely those for which the corresponding frame maps are homo- 
morphisms of complete Heyting algebras; if a and b both have this property, then 
clearly the equalizer of a* and b* in Frm(S) is a sub-(complete Heyting algebra) 
of O(X). Alternatively, we could employ the dual of the following argument, 
which we need for the proper case. 

Suppose a and b are proper, and consider the composite a,b* : O(X) > O(X). 
We note first that, since (a,b) is reflexive, we have a,b* < axs,s*b* = lox) 
(where s: X — R is the diagonal map). Symmetry of (a,b) tells us that a.b* = 
b,a*; and transitivity tells us that it is idempotent, for if we form the pullback 


ee TEE 


R ——> X 
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and let t: R xx R— R be the map expressing transitivity, we have a,b*a,b* = 
a,c,d*b* = a,t,t*b* > a,b*, using the Beck—Chevalley condition for the pull- 
back square at the first step. Let O(Y) denote the set of fixed points of a,b"; 
then O(Y) is a subframe of O(X), since the inclusion g*: O(Y) — O(X) has a 
right adjoint g,, and it preserves finite meets since a,b* does. Also, the image 
of q* is precisely the equalizer of a* and 6*; for from a*(U) = b*(U) we deduce 
q*q.(U) = a,b*(U) = a,a*(U) > U, whence g.g*(U) = U, and conversely if U 
is fixed by g*q, then b*(U) > a*a,b*(U) = a*g*q.(U) = a*(U), and similarly 
a*(U) > b*(U). Thus q: X — Y is the coequalizer of a and b in Loc(S). 

To show that q is proper, we first note that q, preserves directed joins, since 
q* qx = a,b* does and q* reflects joins. For the O(Y)-linearity condition, suppose 
U € O(X) and V € O(Y): then 


q*q.(U Uq'(V)) = a,b" (U U g" (V) 
= a,(b* (U) U b*ą*(V)) since b* preserves finite joins 
=a,(b"(U) Ua*q*(V)) since ga = qb 
=a,b"(U)Uq"*(V)_ since a is proper 
= q*(q&(U)U V), 
but q* is injective, so qg.(U Uq*(V)) =q.(U) UV. 
To show that the coequalizer diagram R= X — Y is preserved under pull- 


back in Loc(S), note that the identities we have etablished, plus the fact that 
a and q are surjective, say that 


q* ERA 
OW) Z2 0(X) 2, O(R) 


is a split equalizer diagram of internal preframes in S, and the maps involved 
are all O(Y)-linear. Pulling back along a locale morphism Y’ — Y corresponds 
to applying the functor (—) Ooy) O(Y") from O(Y )-modules to O(Y")-modules 
in PFrm(S) (cf. 1.1.9); so it preserves this equalizer. And the forgetful functor 
from frames to preframes creates equalizers, so the diagram remains an equalizer 
in Frm(S), that is a coequalizer in Loc(S). The argument for this part of the 
proof in the ‘open’ case is similar, using complete join-semilattices in place of 
preframes. o 


Putting together the last three results, we have 


Theorem 5.1.5 Locales descend along arbitrary open or proper surjections 


in BTop. 


Proof Let f: F — E be a surjection which is either open or proper. By 5.1.3, 
locales descend along f iff they descend along its localic part, which we may 
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write as She(X) — € for an internal locale X in E which is positive and either 
open or compact as appropriate. But now internal locales in She (X ) correspond 
to objects of Loc(€)/X, by 1.6.3; so we are reduced to proving that X — lisa 
descent morphism in Loc(€). And this follows from 5.1.4. o 


To deduce that objects descend along an open or proper surjection f, we 
use the identification of objects with discrete locales, and the characterization 
of the latter as those locales X such that X — 1 and the diagonal X — X x X 
are open (3.1.15). Since f*: Loc(€) — Loc(F) preserves finite products (being 
a right adjoint), it therefore suffices to know that openness of locale morphisms 
descends along f, i.e. if g: X — Y is a morphism in Loc(€) such that f*(g) is 
open in Loc(F), then g itself is open. But we proved this in 3.2.23; so we may 
now deduce 


Theorem 5.1.6 Open surjections, and proper surjections, are descent mor- 
phisms in Bop. o 


From the fact that both objects and locales descend along a morphism, we 
may deduce many other descent properties. In particular, most of the properties 
of geometric morphisms which we studied in Chapter C3 descend along either 
open or proper surjections: 


Corollary 5.1.7 Suppose given a pullback square 


EE, 


Fae 


in BTop/S. where f is surjective and either open or proper. If h has any of the 
following properties: surjective, open, proper, hyperconnected, stably connected. 
locally connected, tidy, atomic, separated, local, totally connected, then g has the 
corresponding property. 


Proof ‘Surjective’ is obvious, and ‘open’ and ‘proper’ were dealt with in 3.2.23. 

For ‘hyperconnected’, let X be the internal locale in € corresponding to the 
localic part of the hyperconnected-localic factorization of g; then f*X © 1 in 
Loc( F). But this isomorphism must descend along f, so X S 1 in Loc(E); ic. 
g is hyperconnected. 

If h is stably connected (that is, all pullbacks of h are connected), it suf- 
fices to prove that g is connected, since any pullback of g can be pulled back 
along an open surjection to a pullback of h. But we already know that g is 
surjective, so we need to prove that g* is full. Suppose we have objects A 
and B of E and a morphism a: g*A — g*B in G; then k*a is a morphism 
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h* f*A = k*g* A — k*g* B = h* f*B in H, so it derives from a unique morphism 
b: f*A — f*B in F. Moreover, k*a commutes with the descent data which k* g* A 
and k*g* B possess relative to the open or proper surjection k; so, since the pull- 
back of h to the kernel-pair of f is still connected, we see that b must commute 
with the descent data on f*A and f*B relative to f. Hence it is of the form h*c 
for a morphism c: A — B in €; then since k*g*c S h* f*c = h*b = k*a and k* is 
faithful we deduce that g*c = a. 

For local connectedness, we use the criterion of 3.3.5(ii): we need to show that 
for any object B of G we can factor the composite G/B — G — € as a connected 
morphism followed by a local homeomorphism. In fact it suffices to consider the 
case B = 1, since local homeomorphisms pull back to local homeomorphisms. 
We know we have such a factorization H — F/C — F of h, where C = hi1; now 
consider the diagram 


Ty 
HxgH SH 


N2 


h 


Tl 
FxeF ESF 
T2 


Here both the squares are pullbacks; so since the factorization of 3.3.5(i) is 
stable under pullback, by the Beck-Chevalley condition for locally connected 
morphisms, we have an isomorphism r{C = 75C in F xe F. It is easy to see 
that this isomorphism satisfies the unit and cocycle conditions for descent data; 
so we deduce that C = f*A for an object A of E. Moreover, the factorization 
h': H — F/C of h through F/C — F corresponds by B3.2.8(b) to a morphism 
l — h*C & k*g*A in H; it is again easy to see that this morphism commutes 
with the descent data on k*g* A, and so derives from a morphism 1 —> g*A in G - 
equivalently, to a factorization of g through €/A — E. Now we have a pullback 
square 


H TEE G 
h! g' 
f/A 


F/C ——>E/A 


where h’ is stably connected (since it is also locally connected); so by the previous 
argument we deduce that g’ is connected. 
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For the tidy case, we use the Beck—Chevalley condition for the pullback 
squares 


Ty 

H xs H SH 
T2 

h 


TI 
F xe F ZZZ F 
N2 


to deduce that, for any object B of G, the object h.k*B of F can be equipped 
with descent data relative to f, and so can be written as f*qB for some object 
qB of E. It is clear that B + qB is functorial; also, the natural maps f*A — 
h,h* f*A = hyk*g*A = ftqg*A and k*g*qB & h*f*qB = h*h.k*B — k*B 
descend along f and k respectively, to yield natural transformations le — qg* 
and g*q — lg. And since f* and k* are faithful, these transformations satisfy the 
triangular identities for an adjunction (g* 4 q), i.e. we may identify q with g». 
Thus we have shown that the square satisfies the Beck-Chevalley condition. It 
now follows that if C is any filtered internal category in (a slice of) E, then f* 9. 
preserves colimits of shape C, since h,k* does; but f* preserves colimits and is 
conservative, so gą preserves colimits of shape C. 

For the atomic case, we use the characterization of atomic morphisms in 
3.5.14: it is easy to see that the diagonal A, is a pullback of A, along a morphism 
which is itself a pullback of f, and hence an open or proper surjection. A similar 
argument deals with the separated case. 

For the case when h is local, we first note that by cases already dealt with 
g must be connected and tidy; so the Beck—Chevalley condition f*g, S hsk* 
holds. But f* and k* are comonadic, so we may use (the dual of) the adjoint 
lifting theorem A1.1.3(i) to lift the right adjoint of h, to one for g,. By 3.6.1 (iv), 
this is sufficient. 

Finally, suppose h is totally connected. Then g is connected and locally 
connected, by cases already dealt with, so we need only show that the left adjoint 
gı of g* is cartesian. But f*g S hik*, by the Beck—Chevalley condition for locally 
connected morphisms; hk* is cartesian by assumption, and f* is cartesian and 
conservative, so the result follows. o 


Next, we mention a variant of the notion of descent morphism, namely that of 
a lax descent morphism. Suppose given a geometric morphism f: F — E between 
bounded S-toposes; instead of forming the pullback of f against itself, we could 


alternatively form the comma object 


dı 
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(f | f) ——~> 
do | 
f 
For 


F 
| 
E 
in BTop/S, and then the pullback 
htt) > HI 


do 


I= 


and so on. ((f | f | f) is the universal solution to the problem of finding an object 
equipped with three morphisms do,d1,d2 to F and two 2-cells fd2 —> fd, —> 
fdo.) It is easy to see that this gives rise to a simplicial object in BYop/S 
(in fact an internal category, though not an internal groupoid), and f* induces 
a functor from E to the category of ‘lax descent data’ for (the 2-truncation of) 
this simplicial object. The notion of lax descent data was defined in B3.4.14(a), 
and we observed there that the category of lax descent data for a 2-truncated 
simplicial object in BTop/S is always a bounded S-topos. Of course, we say that 
f is a lax descent morphism in BTop/S if this functor is part of an equivalence. 
(Note that this notion depends on the base topos over which f is considered 
as lying, since the comma object (f | f), unlike the pullback F xe F, changes 
if we vary S. In particular, if we take S to be € itself then the comma object 
coincides with the pullback, so f is a lax descent morphism over € iff it is a 
descent morphism as previously defined.) 

In 2.3.17 and 3.4.18, we noted that the Beck—Chevalley condition holds for 
comma, squares in BYop/S, one edge of which is either S-essential or relatively 
tidy. From this, and from the fact that the ‘opposite edges’ of such squares are 
actually locally connected (resp. tidy), we may deduce 


Proposition 5.1.8 S-essential surjections, and relatively tidy surjections, are 
lar descent morphisms in BIop/S. go 


The proof is exactly like that of 5.1.1. 


The reader will have observed that, in the proof of 5.1.4, we did not show 
that an equivalence relation a,b: R= X in Loc(S), for which a and b are either 
open or proper, is necessarily effective. Indeed, this is not true in general (and 
we deliberately omitted this condition, which would have simplified the proof of 
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B1.5.7, from the hypotheses of that scholium, in order that we could apply it in 
the situation of 5.1.4). However, it is useful to have a criterion for an equivalence 
relation whose ‘legs’ are open or proper to be effective in Loc(S). 


Lemma 5.1.9 Leta,b: R= X be an equivalence relation in Loc(S), and sup- 
pose either that a is open and (a,b): R= X x X is proper, or that a is proper 
and (a,b) is open. Then (a,b) is effective. 


Proof The two cases are, as usual, dual to each other; we shall prove the first. 
Let q: X — Y be the coequalizer of a and b, and c,d: S — X the kernel-pair of 
q; then O(S) may be identified with the tensor product O(X) Qoy) O(X) of 
O(Y )-modules in CjSLat(S), by 1.1.9. We claim that the frame homomorphism 
O(S) — O(R) corresponding to the factorization u: R— S of (a,b) through 
(c, d) reflects the bottom element; clearly, this homomorphism sends a generator 
U V of the tensor product (where U,V € O(X)) to the element a*(U)Nb*(V) 
of O(R). 
But if a*(U) b*(V) = @, then we have 


Ø =a,(a"(U)Mb*(V)) since a; preserves joins 
=Unab*(V) since a is open 


=UNnq*q(V)_ by the proof of 5.1.4. 


But we also have VN q*q(V) = V, soUNV =ONV = @ as an element of 
the tensor product O(X) oy) O(X). Since every element of O(S) is a join 
of elements of the form U & V, the claim is established. But if R — X x X 
is proper, then it is a closed inclusion by 3.2.6; hence R — S is also a closed 
inclusion. And if it is the closed sublocale CW for an open W € O(S), then the 
foregoing argument shows that W = 0. So R — S is an isomorphism. 

The dual case is similar, using the preframe tensor product O(X )@oiy) O(X ) 
and showing that the frame homomorphism from the latter to O(R) reflects the 
top element. 


Remark 5.1.10 The result of 5.1.9 is not best possible. By examining the 
proof given in the open case, we see that it actually shows that the frame homo- 
morphism u*: O(S) — O(R) has the property that if u*(W) < a*(U), then 
W < (U); for if W is a union of elements U;  V;, then for each i we obtain 
Ui Na*q(V;) = a(a*U; N b*V;) < aa*U < U, and so we can rewrite U, ® V 
as (U, q*qV,) © Vi, which lies below (U @ 1) = c*(U). So the argument in 
fact shows that the image of u is fibrewise dense over X, in the sense defined 
in 1.1.22; hence if we merely assume that R — X x X is a fibrewise closed 
inclusion over X, then we may conclude that R 3 X is effective. (Even this 
result is not best possible; we shall see in 5.3.4 that the fibrewise closedness is 
automatic if we simply assume that R — X x X is an inclusion, and not merely 
a monomorphism in Loc.) 
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There is a similar ‘fibrewise’ weakening of the second hypothesis in the proper 
case, which will be found in [1203]. 


Before leaving this section, we prove a long-promised result, which enables 
us to extend our results on cofiltered limits of toposes from the countable case 
to the general case. The key lemma could in fact have been proved at any time 
since 3.1.20 (indeed, the major part of it could have been proved at any time 
since 1.2.8), but since its applications all involve the descent theorem it seemed 
sensible to place it here. 


Lemma 5.1.11 Let S be a topos with a natural number object, and C a filtered 
internal category in S. Then there exists an open localic surjection p: S! — S and 
a final functor N > p*C in S', where N denotes the poset obtained by equipping 
the natural number object with the order-relation of A2.5.13. 


Proof As in 1.2.8, the basic idea is to take S’ to be the classifying topos for 
the propositional theory of final functors N — C. To define the syntactic site 
of this theory, we take the poset P in S whose elements are functors f: [p] — C 
from finite cardinals to C (where we identify a cardinal [p] with the strict down- 
segment {0,1,...,p — 1} of p in N, and order it as a sub-poset of N), ordered 
by f < g iff f extends g. We impose a coverage T on P by saying that, for 
each f: [p] — C in P and each c: Co, the set of g < f for which there exists a 
morphism c — g(q — 1) in C (where [q] is the domain of g) is a covering sieve. It 
is easy to check that the set of all such sieves satisfies the stability condition (C) 
in the definition of a coverage; so we have a topos S’ = Shs(P,T) ~ Shs(X), 
where X is the locale corresponding to the frame of T-ideals of P. Moreover, P 
has a top element and the T-covering sieves are all inhabited (the latter because 
C is filtered), so by 3.1.20 S’ — S is an open surjection. 

Now, for any S-topos q: F — S, geometric morphisms F — S’ over S corre- 
spond to flat cover-preserving functors P — F; but any such functor G defines a 
functor g: N — q*C in F (namely, the union in F of all those f: [p] — C which 
are mapped by G to 1), which has the property that 


(Ve: Co)(Sn: N)(sa: Cy)(a@: c > g(n)) 
is valid in F. Since C is filtered, this condition is sufficient for finality. O 


In view of B2.6.13, it would have been sufficient to prove 5.1.11 for directed 
posets C, but in fact it is no harder to prove the result for arbitrary filtered 
categories. Using 5.1.11, we may now extend all our results on limits of inverse 
sequences in BTop/S to arbitrary cofiltered S-indexed limits. For example, 
we have 


Corollary 5.1.12 The localic reflection functor BTop/S — LTop/S pre- 
serves arbitrary cofiltered S-indezed limits. 
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Proof Suppose C is a cofiltered internal category in S, and suppose we have 
a diagram of bounded S-toposes (Ee | c: Co) of shape C. Let F. denote the 
localic reflection of E., for each c. Applying 5.1.11 to C°P, we obtain an open 
surjection S’ — S and an initial functor g: N°? — C in S’. If we form the 
pullbacks €! = S’ xs E and F; = S' xs Fe, then by 2.4.12 we know that 
F! is the reflection of E! in LLop/S’; also, since pullback preserves limits, we 
have limc(€.) ~ S’ xs limc(€,), and similarly for the Fe. But we also have 
lime (EZ) ~ limn (Ejn), and hence by 2.5.12 we know that limc(F;) is the 
reflection of limc(E,) in LTop/S’. Now we have a pullback square 


lime(€,) —> lime(Ec) 


lime(F) —> lime(F.) 


of which the top and bottom edges are open surjections and the left vertical 
edge is hyperconnected; so the right vertical edge is hyperconnected by 5.1.7. 
But limc(F,) is localic over S, since LLop/S (being reflective in BTop/S) is 
closed under arbitrary limits; so the result follows. O 


Corollary 5.1.13 Suppose given a cofiltered S-indexed diagram in BYop/S, 
whose transition maps are all surjective (resp. open and surjective, proper, con- 
nected and locally connected, connected and atomic, local, totally connected). 
Then the legs of the colimiting cone satisfy the same condition. 


Proof In all cases except the proper one, these follow in the same way as 
5.1.12 from the corresponding results for limits of inverse sequences (2.5.14, 
3.1.22, 3.3.13, 3.5.11, 3.6.7(v), 3.6.18(iv)), plus the facts (proved in 5.1.7) 
that all the classes of geometric morphisms concerned descend along open 
surjections. 

Since we did not prove the result for inverse sequences of proper maps, we 
must argue rather differently in that case. First we observe that, without loss 
of generality, we may assume that the base topos S is one of the vertices of 
our diagram; if not, we choose a particular vertex £. and work in BTYop/Ec, 
replacing our original index category C by the slice category C/c. Now we form 
the hyperconnected-localic factorization Ee + Le — S of each E, — S; since 
hyperconnected maps are proper, it follows from 3.2.16 that the transition maps 
between the L. are proper, and hence that the maps from the inverse limit Loo 
to the Le are proper by 3.2.11. In particular, since S is one of the Le, Loo 18 
compact over S. By 5.1.12, it follows that the limit €,, of our original diagram is 
also compact over S. But we saw that we could replace S by any of the vertices 
of the diagram, so this in fact means that the maps E» — Ee are all proper. 
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Suggestions for further reading: Grothendieck [421], Joyal & Tierney [560], 
Kock [627], Moerdijk [827,835], Vermeulen [1203]. 


C5.2 Groupoid representations 


In this section we shall prove the fundamental representation theorem of A. Joyal 
and M. Tierney [560] to the effect that every Grothendieck topos is equivalent 
to the topos of continuous actions of a localic groupoid. Here, and throughout 
this section, ‘Grothendieck topos’ is once again used loosely to mean ‘bounded 
S-topos’, where S is any topos with a natural number object (the natural number 
object is essential: see 5.2.2 below); but we shall as usual treat S as if it were 
the classical category of sets, pausing only occasionally to point out any extra 
difficulties that may be encountered in ‘internalizing’ the arguments to a general 
topos. 

The key idea behind the representation theorem is to combine the descent 
theorem of the previous section with a theorem asserting that there are ‘enough 
localic S-toposes’ in the sense that every bounded S-topos is an open surjective 
image of a localic one. We shall give several different proofs of this theorem, since 
each of them throws slightly different light on the structure of the 2-category 
BTop/S. We begin with the first published proof of the theorem, which is 
essentially due to R. Diaconescu [282] (although the concept of open map did 
not appear explicitly in Diaconescu’s work). 


Theorem 5.2.1 Let S be a topos with a natural number object. For any 
bounded geometric morphism p: E — S, there exists an open surjection f: F — E 
such that the composite pf is localic. 


Proof We consider first the case when E has the form [C, S] for an internal 
category C in S; in this case we shall construct a functor d: P — C, where P 
is a poset, which is surjective on objects (so that the geometric morphism it 
induces is surjective) and satisfies the condition of 3.1.2 (so that the morphism 
it induces is open). This clearly suffices, since [P,S] is localic over S if P is a 
poset (A4.6.2(d)). We shall argue as if S were the topos of sets, but it should be 
clear that the argument can be internalized in any topos with a natural number 
object. 

We take the objects of P to be all finite composable strings of morphisms 


of C, ordered by setting @ = (a),...,@n) < b = (b,,..., 6%) iff @ is a terminal 
segment of b, i.e. n < k and a; = bi+k-n for all i. The functor d: P — C sends 
(@1,...,a,) to the codomain of a;, and an inequality @ < b to the morphism 


of C obtained by composing the morphisms which are in b but not in @. It is 
clear that d is surjective on objects, and that it satisfies the condition of 3.1.2 
(indeed, in the latter, we may take r and i to be identity morphisms: given a 
string g and b: d(@) — V, we define Z to be the string obtained by adjoining b 
to the beginning of @, and then we have g < č and d(@ < &) = b). 
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For the general case, we represent € as a subtopos of one of the form [C, S], 
and then form the pullback 


FS Se 


[P,S] ——~> [C,S] 


where P is constructed as above. Then 3.1.26 ensures that the top edge of this 
diagram is an open surjection; and the left-hand side is an inclusion, so F is 
localic over S. D 


Remark 5.2.2 The assumption that the base topos S has a natural number 
object cannot be omitted from the hypotheses of 5.2.1: the result fails when S is 
the topos Set s of finite sets. Indeed, let C be the finite category with two objects 
U and V and three non-identity morphisms e: U > U, r: U —> V andi: V — U 
satisfying ri = ly and ir = e = e? (equivalently, C is the Cauchy-completion 
of the two-element monoid which is not a group). Then we claim that there 1s 
no open surjection F — |C, Set] such that F is localic over Set p. For we know 
by 2.2.22 that every bounded Set s-topos is of the form [P, Set] where P is a 
finite category, and clearly the localic ones are those for which we can take P 
to be a finite poset. Moreover, since C is Cauchy-complete, we know by 2.2.23 
that every geometric morphism [P, Sets] — [C,Set,] is induced by a functor 
P — C. Thus finding a geometric morphism as above is equivalent to finding 
a finite poset P and a functor f: P — C whose image contains the object U 
(for otherwise the corresponding geometric morphism would factor through the 
inclusion Set; — [C,Set,] whose inverse image is given by evaluation at V), 
and which satisfies the condition of 3.1.2. Suppose given such an f: let Po be an 
element of P with f(Po) = U, and apply the condition of 3.1.2 to the morphism 
e: f(Py5) > U. Since U is not a retract of any other object of C, we deduce 
that there must exist P) > Po in P such that f(P,) = U and the inequality 
Py < Pı maps to the endomorphism e. Clearly, P, # Po, since e is not an 
identity morphism; so on repeating the argument we deduce that P contains 
an infinite ascending chain Po < P; < Pz < ---; in particular, P is infinite. 
contradicting the assumption that [P, Set p] is a bounded Set ;-topos. (A similar 
argument using condition (ii) of 3.2.14 shows that [C,Set;] does not admit any 
proper surjection from a localic Set s-topos.) 


Remark 5.2.3 The functor d: P— C of 5.2.1 has a natural factorization 
through the counit map FC — C, where FC is the free category on the underly- 
ing directed graph of C (that is, the category whose objects are those of C and 
whose morphisms are composable strings of morphisms of C). Specifically, the 
factorization d sends an object @ to the codomain of the first morphism in &. 
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and an inequality g < b to the string of morphisms in g but not in b. It is clear 
that d is a (surjective) discrete opfibration, so that we may identify [P, S] with 
a slice category of the form [FC’,S]/D where D is well-supported; moreover, the 
counit map FC — C is full and surjective on objects, so that by A4.6.9 the geo- 
metric morphism which it induces is hyperconnected. Since both hyperconnected 
morphisms and surjective local homeomorphisms are stable under pullback, we 
deduce that for every bounded S-topos E, there is a hyperconnected morphism 
F — E and a well-supported object D of F such that F/D is localic over S. 


The French term étendue is often used for a ‘locally localic’ topos, i.e. a topos 
E containing a well-supported object A such that €/A is localic (cf. [1051]). We 
note in passing: 


Lemma 5.2.4 Let C be a small category. Then the functor category |C, Set] is 
an étendue iff every morphism of C is epic. 


Proof Since any slice of [C, Set] is equivalent to a category of the same form 
(A1.1.7), it is easy to see that [C, Set] is an étendue iff there exists a surjective 
discrete opfibration P — C such that P is a poset. But every morphism in a poset 
is epic, and it is easy to see that this property ‘descends’ along surjective discrete 
opfibrations. Conversely, if every morphism of C is epic, then the domains of the 
discrete opfibrations correponding to the functors C (U,—) are all posets; and 
their coproduct maps surjectively to C. CJ 


Of course, 5.2.4 may be ‘internalized’ to an arbitrary topos S. Clearly, the 
free category on a directed graph has the property that all its morphisms are epic 
(and monic), so this provides an alternative proof of 5.2.3 (and hence of 5.2.1). 

For completeness, we also note: 


Lemma 5.2.5 A Grothendieck topos E is an étendue iff there exists a site of 
definition (C, J) for E such that every morphism of C is monic. 


Proof One direction follows easily from 5.2.4: if we are given such a site (C, J), 
then there exists a well-supported object A of [C°P, Set] such that [C°P, Set]/A 
is localic over Set, and hence Sh(C, J)/i*A is localic, where i is the inclusion 
Sh(C, J) — |C°?, Set]. 

Conversely, suppose € is an étendue; let A be a well-supported object of 
E such that €/A is localic over Set. Let C be the (non-full!) subcategory of 
objects of C which admit monomorphisms to A, and monomorphisms between 
them; we claim that the inclusion C — E satisfies the conditions of 2.2.1 (for 
the canonical coverage on €), so that by 2.2.16(i) C becomes a site of definition 
for € when equipped with the induced coverage. For the first condition, suppose 
given f,g: BC in E with f # g; then A*f # A*g in E/A, and since the 
latter topos is localic there exists A’ — A and a morphism h: A’ — B such that 
fh # gh. So, for any object B of £, the family of all morphisms from objects 
of C to B is epimorphic. For the second condition, suppose given f: B — A’ in 
E where A’ admits a monomorphism m: A’ — A; then, regarding the composite 
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mf as an object of €/A, we have an epimorphic family of morphisms to it 
from subterminal objects — which says that we have an epimorphic family of 
morphisms A” — B such that the composites A” — B — A’ are monic, and so 
lie in the subcategory C. 0O 


The following refinement of 5.2.1 is sometimes of use. 


Scholium 5.2.6 Let E be a bounded S-topos, and suppose E has a point 
s: S — E (that is, a section of the structure map p: E > S). Then it is possible 
to choose the open surjection f of 5.2.1 so that s factors through it. 


Proof As before, we first consider the case when € = |C, S] for some internal 
category C in S. Then we know that points of € correspond to C-torsors in S, by 
B3.2.7; equivalently, they are objects of the ind-completion of C (qua S-indexed 
category), as defined in Section C4.2. So we may think of s as being defined 
by an ind-object (U, | i € I) of C; and, by B2.6.13, we may assume that the 
indexing category I is actually a directed poset. Now we modify the definition of 
the poset P constructed in 5.2.1: in addition to the objects considered there, we 
also take all the objects of I together with all pairs (@,i), where 7 is an object 
of I and @ is a composable string of morphisms of C such that the domain of its 
last member is U;. The ordering is defined by saying that the ‘pure strings’ are 
ordered as they were before, we have (@,i) < (b, j) iff i = j and @ is a terminal 
segment of 6, and that i < (a,j) and i < j (where i,j are both objects of I) iff 
the relation i < j holds in I. The functor d: P > C sends a ‘pure string’ Ẹ or 
an object of the form (d,i) to the codomain of the first member of G@; it sends 
an object 7 of I to U;, and an inequality i < 7 to the appropriate morphism 
in the ind-object (U, | i € I). The verification that d is surjective on objects, 
and that it satisfies the condition of 3.1.2 for openness, is exactly as in 5.2.1; but 
the inclusion I — P now defines an ind-object of P, and hence a point of [P, S], 
whose image under the geometric morphism induced by d is clearly the given 
point of €. 

In general, we have an inclusion E€ — [C, S] for some C; we choose P as above 
for the point 5 of [C, S] which is the image of our given point of £, and form the 
pullback 

F ——>€E 


[P, S ——+ [C,S] 


Since the point § factors through both € and [P, S], we obtain a factorization of 
it through the pullback, as required. 


The argument of 5.2.6 may clearly be generalized from a single point to a 
whole S-indexed family of points (that is, a morphism S /I — € for some I ). 
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In this connection we should mention that C. Butz and I. Moerdijk [214] have 
shown that, if the topos € has enough points (cf. 2.2.12), then we may choose its 
open cover F to have enough points also; that is, we may take it to be Sh(X) 
for a space X, and not merely a locale. 

We remark also that if, in the first part of the proof of 5.2.6, the chosen point 
of [C,S] had happened to be defined by an ind-object (U, | n € N) indexed 
by the directed set N of natural numbers, then we should not have needed to 
enlarge our original poset P at all, since it would already contain a ‘copy’ of 
this ind-object; but more complicated directed sets cannot be embedded in our 
original P, and so we do have to enlarge it as indicated. 

The result of 5.2.1 is not best possible: we may require stronger conditions 
on the morphism f. 


Theorem 5.2.7 Let S be a topos with a natural number object. For any 
bounded geometric morphism p: E — S, there exists a connected and locally 
connected morphism f: F — E such that the composite pf is localic. 


Proof Recall that in 3.3.8 we gave conditions on a functor between small 
categories for it to induce a connected and/or locally connected morphism of 
Set-valued functor categories; the arguments used there were constructive, and 
so can be applied to functors between internal categories in a topos S. It is not 
possible, for a general internal category C, to find a functor f: P — C satisfying 
these conditions such that P is a poset; but we saw in 3.3.9 that we can do so 
in the case when C is the particular internal category S, of finite cardinals in S. 
And this is enough; for the functor category [S;, S] is the object classifier S[O] for 
Top/S, as we saw in B3.2.9, and in D3.2.5 we show that every bounded S-topos 
E€ admits a localic morphism to S[O] (in fact it suffices to take the classifying 
morphism of any bound for € over S). Thus we may form the pullback 


ae [P, S] 
g d 


£ > SJO] 


where e is localic and P is the poset of finite partial equivalence relations on N, 
as in 3.3.9; then g is connected and locally connected since such morphisms are 
stable under pullback by 3.3.15, and h is localic for a similar reason, so that F 
is localic over S since composites of localic morphisms are localic. CJ 


Remarks 5.2.8 (a) Theorem 5.2.1 could have been proved in the same style 
as 5.2.7, by first constructing a covering of the particular internal category S f 
satisfying the conditions of A4.2.7(b) and 3.1.2, and then pulling back along a 
localic morphism € — |S, S]. However, in that case it is no more difficult to 
construct the covering for an arbitrary internal category. 


772 TOPOSES AS GROUPOIDS C5.2 


(b) We note that 5.2.7 is ‘best possible’ in the sense that we cannot impose 
any stronger conditions on the covering map f: F — E, at least among those 
considered in Chapter C3. For if we require f to be atomic and connected, then 
it is hyperconnected by 3.5.4(i); but it is also localic since F is localic over S, and 
hence an equivalence, so € itself must be localic over S. Even if we were to weaken 
the condition of connectedness to surjectivity, we still could not require f to be 
atomic; for if it were, it would then be a local homeomorphism by 3.5.4(iii), and so 
E would be an étendue. (We saw in 5.2.3 that any Grothendieck topos € admits a 
hyperconnected covering by an étendue F, but once again we cannot require this 
covering to be atomic; for if it were, then the composite F/A — F — E (where 
A is a well-supported object such that F/A is localic over S) would be atomic 
and surjective. In Section C5.4 below we shall investigate the ‘best possible’ 
conditions we can impose on a class of toposes F such that every Grothendieck 
topos admits a connected atomic covering by a topos F in the class.) Nor can 
we require f to be totally connected in the sense of 3.6.16; for if it were, then its 
right adjoint d: E€ — F would be an inclusion, and this would again force E to be 
localic over S. We shall also see in 5.2.10 below that we cannot in general require 
the covering map f to be proper, even if we abandon the openness requirement 
altogether. 

(c) It is possible to give proofs of both 5.2.1 and 5.2.7 making more extensive 
use of the theory of classifying toposes. (Our reason for giving more prominence 
to the proofs above was mainly to make the results more accessible to any read- 
ers who have not (yet) familiarized themselves with the ideas of Part D; there is 
a sense in which the argument via geometric theories is the ‘right’ way to prove 
these results.) For 5.2.7, we note that if P is the poset of finite partial equivalence 
relations on N, then [P, S] is none other than the classifying topos for the propo- 
sitional theory of partial equivalence relations on N (since the latter is a cartesian 
propositional theory; over Set, it may be axiomatized by an (N x N)-indexed 
family of primitive propositions R(n,n’) with axioms (R(n,n') + R(n',n)) and 
((R(n,n') A R(n',n”)) F R(n,n”)) for all n, n’ n”). And the geometric mor- 
phism [P, S] — [Sj, S] induced by d is none other than the classifying map of 
the ‘generic subquotient of N’ which exists in this classifying topos. Thus, when 
viewed as a topos defined over [Sf, S], it becomes the classifying topos for the 
theory of partial enumerations of a particular object G (that is, the theory of 
partial maps N — G whose image is the whole of G), namely the generic object 
itself. We encountered this theory in 3.3.11(c), where we saw that it gave rise to 
a connected and locally connected site. (Note also that we may proceed directly 
to the construction of the topos F appearing as a pullback in the proof of 5.2.7, 
as the classifying topos over € for the theory of partial enumerations of some 
bound B for E over S.) 

The construction which we gave in 5.2.1 for an open surjection from a localic 
topos to a general bounded S-topos does not lend itself so easily to a description 
in terms of classifying toposes, but a closely related construction (which was in 
fact the one given by Joyal and Tierney [560]) may be obtained as follows. First 
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of all, we note a slight modification of the result of D3.2.5, which we quoted 
in the proof of 5.2.7 above: namely, that every Grothendieck topos € admits a 
localic morphism to the classifying topos S[Q,] for the theory of inhabited objects 
(that is, the theory with one sort, no primitive symbols except equality, and the 
single axiom (T + (3x)T)). We remark that Set/O,] may readily be identified 
as the functor category |C, Set] where C is the category of nonempty finite sets 
(and all functions between them); but we shall not need this information. To 
obtain a localic morphism € — S[Q,], it suffices to observe that any bound for 
€ over S is necessarily well-supported, and so its classifying map factors through 
SiO] — S[O}. 

Thus it suffices to construct an open surjection f: F — S[O,]| with F localic 
over S: for this, we take F to be the classifying topos for the propositional theory 
of (total) equivalence relations on the natural number object N, and f to be the 
classifying map of the quotient of N by the generic equivalence relation. Then, 
viewed as a topos over S[Q;], f classifies the theory of (total) enumerations of 
the generic inhabited object Gi; more generally, its pullback along a morphism 
E — S/Q,] classifies the theory of enumerations of the particular object of E 
classified by the latter. But we saw in 1.2.8 above how to construct a site for the 
theory of enumerations of a given inhabited object A; we observed there that 
the locale to which it gives rise is a dense sublocale of a nonempty spatial locale, 
but it is actually strongly dense, since all covers in the site are inhabited (cf. the 
remark after 3.1.18). So the fact that it gives rise to an open surjection follows 
from 1.2.14(ii) and 3.1.14(ii). 

We note that this construction, although rather simpler than the earlier one 
using partial enumerations, is actually substantially different from it: the inter- 
nal locale in € which it produces is very far from being locally connected. Indeed, 
if the inhabited object A whose enumerations are being classified is decidable, 
then the locale is actually zero-dimensional, since each sub-basic open set (cor- 
responding to a primitive proposition p(n, a) of the theory of 1.2.8) has an open 
complement, corresponding to the disjunction of the p(n, b) with b £ a. 


In the previous section, we proved the descent theorem for proper surjections 
as well as open ones. As we remarked above, not every Grothendieck topos 
admits a proper surjection from a localic topos; but 3.2.15 enables us to identify 
an important class of toposes which do. 


Lemma 5.2.9 Let E be a Grothendieck topos containing a pre-bound (in the 
sense of D3.2.3) which is Kuratowski-finite in the sense of D5.4.1. Then there 
is a proper surjection f: F — E such that F is localic over Set. 


Proof In D3.2.10, we show that the classifying topos for K-finite objects in 
Set-toposes may be taken to be the functor category [Set fe, Set], where Set fe is 
the category of finite sets and surjections between them. By D3.2.6(i), a K-finite 
pre-bound for € corresponds to a localic geometric morphism E£ — [Set Fes Set]. 
In 3.2.15, we constructed a proper surjection [C, Set] — [Set j., Set] where C is 
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a poset (it is surjective because the functor inducing it is surjective on objects); 
pulling this back along E — [Set s,.,Set], we obtain the required localic cover 
of E. g 


The condition that € should have a K-finite pre-bound is equivalent to say- 
ing that it is the classifying topos for a (single-sorted) geometric theory whose 
models are all K-finite (cf. D3.2.5). The converse of 5.2.9 is false; we shall see 
a counterexample in 5.2.12 below. Nevertheless, not every Grothendieck topos 
admits a proper surjection from a localic topos: 


Example 5.2.10 Let G be a (discrete) group. We claim that the topos [G, Set] 
admits a proper surjection from a localic topos iff G is finite. For we know that 
[G, Set] is a classifying topos for G-torsors (cf. B3.2.4(b)); hence a geometric 
morphism Sh(X) — [G, Set] corresponds to a local homeomorphism p: E —> X 
which is locally isomorphic to X x G. Now the slice category [G, Set]/G (where 
G, as an object of [G, Set], is the generic G-torsor) is equivalent to Set; so if 
Sh(X) — [G, Set] were proper, its pullback along [G, Set]/G — [G, Set] would 
be a proper map Sh(E) — Set, i.e. E would be compact. But E cannot be 
compact if G is infinite: for we can cover X by opens U; (indexed by some set 
I) such that each p~!(U;) is homeomorphic to U; x G, and then cover E by (the 
homeomorphic images of) the sets U; x {g}, g € G,i € I; and this cover clearly 
has no finite subcover. On the other hand, if G is finite, then every G-torsor is 
K-finite; so [G, Set] admits a proper surjection from a localic topos, by 5.2.9. 
(In fact the morphism Set ~ [G, Set]/G — [G, Set] is proper in this case.) 


Now suppose € is a bounded S-topos, and we have found a descent morphism 
f: F — E to E from a topos F which is localic over S. Then f is itself localic by 
A4.6.2(f), and hence so are the projections Fx ¢F = F, by B3.3.6. Hence Fx eF 
is itself localic over S, by A4.6.2(e). Similarly, the triple pullback F xe F xe F 
is localic over S; so we can regard the entire (truncated) simplicial topos 


F xe F xe F Z F xe fF EZF 


as living in the category Loc(S) of internal locales in S. Moreover, the inclu- 
sion Loc(S) — Yop/S preserves limits, by 1.4.8; hence the simplicial locale Ge 
defined by setting the (n+1)-fold pullback of F over € equal to Shs(G,,) inherits 
the same limit-preservation properties as the simplicial topos. In particular, the 
diagram 


do 
G2 ——> Gı 
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is a pullback in Loc(S); but this (plus preservation of the pullbacks involving 
G3, to ensure associativity of the composition defined by d?) means that G, is 
an internal category in Loc(S), as we saw in B2.3.1. And since we have a twist 
isomorphism G; — G; satisfying the appropriate equations, it is actually an 
internal groupoid in Loc(S). 

As we observed in B3.4.14(b), the functoriality of the assignment X > 
Shgs(X) is all we need to conclude that an internal groupoid G in Loc(S) gives 
rise to a topos Conts(G) of continuous actions of G: the objects of this topos 
are sheaves on Go (equivalently, local homeomorphisms E — Go) equipped with 
action maps dj(E) — dj(£) over G, satisfying the unit and cocycle conditions. 
However, in the case currently under consideration, we do not need the proof in 
B3.4.14 that this category is a topos, since the descent theorem tells us that we 
have Conts(G) > Desc(F,) > E. 

We shall say that a localic groupoid G is open (resp. proper, locally connected, 
tidy, ...) if its domain and codomain maps d;, do: G1 3 Go have the property in 
question. Note that this will automatically be the case (for any pullback-stable 
property of geometric morphisms) if the covering morphism F — E from which 
we start has the corresponding property; also that do has one of the properties 
iff dı does, since the inverse map G; — G, is an isomorphism dọ = d, in 
Loc(S)/Go. Thus the following results are all now immediate from the covering 
theorems proved earlier in this section, plus the descent theorems of Section C5.1. 


Theorem 5.2.11 Let S be a topos with a natural number object. 


(i) For any bounded S-topos E, there exists a connected and locally connected 
localic groupoid G in S such that E ~ Conts(G). 


(ii) For any étendue E over S, there exists an atomic localic groupoid G in 
S (that is, one whose domain and codomain maps are local homeomor- 
phisms) such that E ~ Conts(G). 


(iii) For any bounded S-topos E having a K -finite pre-bound over S, there exists 
a proper localic groupoid G in S such that E =~ Conts(G). 


Proof (i) is immediate from 5.2.7 and the fact (5.1.1) that locally connected 
Surjections are descent morphisms. (ii) similarly follows from the very definition 
of an étendue (5.2.4) and the fact that surjective local homeomorphisms are 
descent morphisms. (iii) follows from 5.2.9 and (the proper case of) 5.1.6. J 


Example 5.2.12 The converse of 5.2.11(iii) is false. Let G be any profinite 
group which is not discrete: for example, the profinite completion of the additive 
group Z (that is, the inverse limit of its finite quotients in the category of topo- 
logical groups). Then G is compact, and so (G = 1) is a proper localic groupoid. 
However, any K-finite object of Cont(G) has finite underlying set (by D5.4.12, 
Since the forgetful functor Cont(G) — Set is an inverse image functor), and 
hence lies in the subcategory Unif(G) defined in A2.1.7; from this it is easy to 
see that it cannot be a bound, or even a pre-bound, for Cont(G) over Set. We 
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remark also that the geometric morphism Set — Cont(G) is proper, by 5.3.6 
below; thus Cont(G) also provides a counterexample to the converse of 5.2.9, as 
promised. 


There is one further case of particular interest. If E is connected and atomic 
over S, and it has a point s$s: S — E, then the latter is automatically an open 
surjection by 3.5.6. In this case, we obtain a representation of E as Conts(G), 
where G is a localic groupoid in S satisfying Go = 1 ~ that is, a localic group. 
Thus we have 


Theorem 5.2.13 Any connected atomic S-topos with a point is equivalent to 
one of the form Conts(G), where G is an open localic group in S (i.e. one such 
that G — 1 is open). O 


Remarks 5.2.14 (a) The assumption that the topos has a point cannot be 
omitted from 5.2.13: we give examples of (bounded) connected atomic morphisms 
with no sections in 5.4.12 below, and also in D3.4.14. (There are also unbounded 
examples, as we noted in 3.5.7.) 

(b) The converse of 5.2.13 is also true: we already know this in the case when 
G is a topological group (i.e. its underlying locale is spatial), but it requires a 
little more proof if we merely assume G is an open locale. We shall provide the 
latter in 5.3.8 below. 

(c) If the topos € in 5.2.13 is the classifying topos (over S) for a geometric 
theory T, then we may describe the group G appearing in the theorem a little 
more explicitly. The point s of course corresponds to a particular T-model M 
in our base topos S; thus, as a topos defined over E, it may be thought of 
as Classifying the theory of isomorphisms between the generic T-model and our 
particular model M. Thus the pullback S x¢ S classifies the theory of a pair of 
such isomorphisms; but a pair of such ismorphisms must differ by some particular 
automorphism of M, and so we can identify this topos with Shs(G), where G is 
the ‘localic group of automorphisms’ of M. At least in the case when S = Set 
and the theory T is coherent, we may identify this with a topological group 
(that is. the locale G is spatial): it is the actual group of automorphisms of 
M, topologized with the ‘topology of pointwise convergence’ — i.e. the basic 
neighbourhoods of a given automorphism a are the sets 


{8 | 8 agrees with a on F} 


as F ranges over the finite subsets of M. We shall meet several particular 
examples of such groups in Section C5.4 below, and also in Section D3.4. 

(d) A topos satisfying the hypotheses of 5.2.13 may have several non- 
isomorphic points, which may in turn have non-isomorphic automorphism 
groups: for example, the topos Set[D,.] of D3.4.10 has one isomorphism class of 
points for each infinite cardinality in Set, and these clearly have non-isomorphic 
automorphism groups. However, 5.2.13 tells us that all these groups are ‘Morita- 
equivalent’ in the sense that they give rise to equivalent toposes of continuous 
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actions. We shall explore this notion of Morita equivalence further in the next 
section. 


Suggestions for further reading: Butz & Moerdijk [213, 214], John- 
stone [524], Joyal & Tierney [560], Kock & Moerdijk [637, 638], Rosenthal 
[1047, 1048, 1051]. 


C5.3 Morita equivalence for groupoids 


We have seen in the last section that every bounded S-topos may be represented, 
up to equivalence, in the form Cont s(G) for some localic groupoid G in S. We 
have also seen, in B3.4.14(b), that every localic groupoid in S does give rise 
to a bounded S-topos in this way. But there are clearly many different localic 
groupoids giving rise to any given S-topos; so, for this representation to be 
useful, we need to have some control of the relation between localic groupoids 
which says that they give rise to equivalent categories of continuous actions. 
Borrowing a term from model theory (cf. D1.4.9), we shall call this relation 
Morita equivalence. 

For most of this section, we shall be working with localic groups and groupoids 
in a fixed base topos S, whose name we shall suppress whenever possible in order 
to simplify the notation; that is, we shall write Cont(G) and Sh(X) rather 
than Conts(G) and Shs(X). However, apart from a few points where we refer 
explicitly to Set, all our arguments will be constructive, and so applicable in an 
arbitrary topos S — except that, as in the earlier sections of this chapter, our 
standing hypothesis that S has a natural number object will remain in force. 

It will be recalled that we made some remarks about the problem of Morita 
equivalence, for the ‘classical’ case of a spatial group G, immediately follow- 
ing A2.1.6. Incidentally, from now on we shall tend to use the terms ‘spatial 
group’ and ‘spatial groupoid’, rather than ‘topological group’ and ‘topological 
groupoid’ to emphasize cases where the locales of objects and morphisms of a 
localic groupoid G are actually spaces (i.e. have enough points). Actually, there 
is a subtlety here: because the inclusion Sob — Loc does not preserve finite 
limits, a spatial group G (that is, an internal group in Sp) may fail to be a 
localic group even if its underlying space is sober, because the multiplication 
defined on the product G x G in Sp may fail to extend to the locale product; 
and similarly for spatial groupoids. A counterexample is given by the additive 
group Q of rationals with the topology inherited from the Euclidean topology 
on R, as we shall shortly see. However, we shall in practice fee] free to ignore 
those spatial group(oid)s which are not localic group(oid)s — we shall not lose 
anything significant thereby. 

We shall mainly concern ourselves with open localic groupoids (that is, those 
for which the domain and codomain maps G, =3 Go are open); but many of our 
results have parallels for proper groupoids, and some of them apply to arbitrary 
groupoids in Loc. We begin our investigations with a result which strikingly 
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illustrates the difference between localic and spatial group(oid)s: every localic 
subgroup of a localic group (in Set) is closed. Of course, this immediately tells 
us that the spatial group Q cannot be a localic group, since it is a dense subgroup 
of the localic group R (and the latter is a localic group, since its local compact- 
ness ensures that the locale product R x R and the spatial product (R x R)p 
coincide; cf. 4.1.8). In fact we shall prove the result for localic groupoids, and 
then specialize to groups. 

In general, given a locale Y over X, the notion of ‘fibrewise closed sublocale 
of Y over X’ obviously depends on the particular morphism Y — X which 
we have chosen, and not just on Y. However, if G is a localic groupoid, we 
shall tend to speak of sublocales of G, being fibrewise closed over Go, without 
specifying explicitly whether our structure map G; — Go is do or dı: the point 
is that, provided the sublocale in question is mapped to itself by the inverse map 
i: Gı — G; (as is clearly the case if it is the locale of morphisms of a subgroupoid 
of G), then it doesn’t matter which one we use, since i is an isomorphism dg — dı 
in Loc/Go. 


Theorem 5.3.1 Let G be a localic groupoid, and let H be a localic subgroupord 
of G (that is, we have a morphism f: H — G of localic groupoids such that both 
fo and fı are inclusions) such that the domain and codomain maps of H are 
open. Then fı: Hı — G; is fibrewise closed over Go. 


Proof We note first that we may reduce to the case when fo is an identity 
morphism, by replacing G by the groupoid ER 1 3 Ho) where 


G, >———> G1 


(di, do) 
fo x fo 


Ho x Ho >——> Go x Go 


is a pullback. For if Hı — Ci is fibrewise closed over Hp, it is still fibrewise 
closed over Go since fo is an inclusion (cf. 1.2.14(i)); and the inclusion Gi — G1 
is fibrewise closed over Go, since it is the intersection of the pullbacks of Ho — Go 
along d; and do. 

Now let H, denote the fibrewise closure of H} in Gi over Go. We claim 
that (Hı = Go) can be made into a subgroupoid of G; in particular, that the 
composition map dı: Gi xg, Gi — G; restricts to a morphism Hy, XGo H, > H, 
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(the other conditions are easy). We note that we have pullbacks 


FXE =. dz 
Hı X Go A, >—> A, X Go A LLRA Ay 
do do 
j 2 dı 


A, >> Hi —— Go 


where the domain and codomain maps H, = Go are open by 3.1.14(ii), and the 
inclusion 7: Hı — H, is fibrewise dense over d); so, again by 3.1.14(ii), 7 x 1 is 
fibrewise dense over dz: Hy XGo Hı — Hj, and hence over Go via either of the 
maps dıdz and dod2. But by applying 3.1.14(i) to the left-hand square above, we 
may also deduce that j x 1 is fibrewise dense over Go via the third projection 
dodo. Similarly, the inclusion Hy, XG Hı — H, x Ga Hı is fibrewise dense over Go, 
via any of the three projections; so their intersection Hı xg, Hı — Ho X Go Hı 
is also fibrewise dense over Go, by 1.2.14(iii). But dı : G2 + Gı may be regarded 
as a morphism over Go, for either the first or the third projection Gz — Go, and 
it restricts to a morphism Hı xg, Hı — Hj; so, by the functoriality of fibrewise 
closure (1.2.14(vi)), it also restricts to a morphism H; xG, Hı — Hj, as required. 
Now that we know H defines a subgroupoid of G, it follows that Hz = 
al xc, Hı is invariant under the map (ido, d1): G2 > G2 (where i: Gi — G; is 
the inverse map of G), and so is the sublocale Hı xc, Hı. But the diagram 


(ido, dı) 
G2 ——> G2 


d 
dy i 


Gı 


commutes; so, from the fact that Hı xg, Hı — H3 is fibrewise dense over dz, 
we deduce that it is also fibrewise dense over dı. Similarly, Hı XG, Hı — Hy 
is fibrewise dense over dı, and hence so is their intersection H3 — Hə. But 
dı: Hy — H; is (split) epic, so its restriction to H3 is also epic, by 1.2.14(ii). 
And this restriction factors through H; — Hy, by assumption; so the latter 
must be an isomorphism, i.e. H is fibrewise closed in G, over Go. Oj 


Specializing to the case of groups, we immediately obtain 


Corollary 5.3.2 Let G be a localic group. Then any localic subgroup of G 
which is open as a locale is weakly closed in G. In particular, if G is a localic 
group in a Boolean topos, then every localic subgroup of G is closed. O 
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Remark 5.3.3 In particular, we note that for any localic group G the iden- 
tity element e: 1 — G is a weakly closed sublocale of G; hence by 1.2.14(v) the 
diagonal G >— G x G is weakly closed, i.e. G is ‘weakly Hausdorff’. However, 
we cannot strengthen this to ‘Hausdorff’ unless our underlying topos is Boolean. 
For a counterexample, let U be any non-complemented subterminal object in S, 
and let G be the discrete locale on the pushout 1 Hy 1 in S, equipped with a 
group structure as a quotient of the cyclic group of order two. 


As another application of 5.3.1, we may deduce 


Corollary 5.3.4 Jf R= X is an open equivalence relation in Loc such that 
R= X x X is an inclusion (and not merely monic), then it is effective. 


Proof We may regard R as a subgroupoid of the trivial groupoid 
(X x X 3 X). Theorem 5.3.1 then says that R — X x X is fibrewise closed 
over X; so the result follows from 5.1.10. T 


Remark 5.3.5 The reader may be concerned about the apparent contradiction 
between 5.3.4 and the example of a non-effective equivalence relation in LH, 
using irrational rotations of the circle S1, which we gave in A1.3.7. For this pair 
R = S!, the induced map R — S! x S! is merely monic in Sp, and not an 
inclusion; but if we replace R with its image R — S! x S! (that is, if we re- 
topologize R as a subspace of S! x S1), we obtain an equivalence relation in Sp 
such that the projections R = S! are still open, but R — S? x S! is fibrewise 
dense over S!. Moreover, although R = S! is effective in Sp, it is not effective 
in Loc (or in Sob): its coequalizer in Sp is (indiscrete, and hence) not sober. 
The answer to the conundrum is that, as a relation in Loc, R — S! x S} is 
not transitive (unlike R — S! x S!!), for exactly the same reason that Q fails 
to be a localic group: the localic pullback R xg: R is larger than the spatial 
pullback, and its image under the ‘composition’ map is the whole of S! x S?. 


We recall that, for an arbitrary localic groupoid G, the category Cont(G) of 
continuous actions of G is defined to be the full subcategory of the internal dia- 
gram category [G, Loc] whose objects are those internal diagrams ((A — Go), @) 
for which A — Go is a local homeomorphism (i.e. an object of Sh(Go)). We 
defined this category, and saw that it is a topos, in B3.4.14(b); also, there is 
a surjective geometric morphism d: Sh(Go) — Cont(G) whose inverse image is 
the forgetful functor. We note next that various properties of this morphism are 
inherited from the corresponding properties of the domain and codomain maps 


of G. 


Lemma 5.3.6 Let G be a localic groupoid such that the domain and codomain 
maps Gi = Go are open (resp. locally connected, connected, atomic, proper. 
tidy). Then the same is true of the morphism d: Sh(Go) — Cont(G). 


Proof First suppose dọ and dı are open. To show that d is open, we shall 
show that its inverse image d* commutes with universal quantification: let 
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f: (A,a) > (B, B) be a morphism of Cont(G), and (A’,a’) a subobject of 
(A, a). It suffices to show that the universal quantification B’ = V;(A’) — B, 
computed in Sh(Go), can be given the structure of a subobject of (B, 3) in 
Cont(G), i.e. that the isomorphism 8: dj(B) — dj(B) restricts to an isomor- 
phism dj(B’) — dj(B’). But this is obvious from the fact that dj and dj 
commute with universal quantification; for we have d}(B’) = Yazd} (A), and 
the isomorphism a identifies dj(A’) and dj(A’) as subobjects of dj(A) S d (A). 

The arguments for the locally connected case are very similar, using the 
fact that a morphism is locally connected iff its inverse image commutes with 
II-functors (3.3.1). For the connected case, let (A, œ) and (B, 8) be two objects 
of Cont(G); we have to show that every morphism f: A — B in Sh(Go) is 
actually a morphism (A,q@) — (B, 8), i.e. that it commutes with a and 8. But 
the composite 


as(A) > at(a) 2d ana) 2s aga) 


must equal dj(g) for a unique g: A — B; and since sj sends both a and 8 
(modulo coherence isomorphisms) to identity maps, we deduce on applying it 
to this composite that f = g. For the atomic case, we argue as in the locally 
connected case to show that d* commutes with the formation of power objects. 

In the proper case, we have to work rather harder. Let U be a subterminal 
object of Sh(Go), and consider the object U = d,,d%(U). Clearly, we have 
U < di„Soxsġdó(U) = U; but we also have 


didy.dgU = do,.djd3(U) © dz.djdj(U) = ddi. dö (U) , 


using the weak Beck—Chevalley conditions for the pullback squares 


d2 d2 
G2 ———> Gı and G2 ———> Gı 
dı dı do do 
dı dı 
G, ———> Go Gy SS Go 


and the identity dodo = dod; : Gz > Go. So U, equipped with this isomorphism, 
may be thought of as an object of Cont(G) (since it is subterminal, the coherence 
conditions on the isomorphism are automatic). And any subterminal object V 
Satisfying V < U and dj(V) = d}(V) also satisfies V < dı di (V) = dy,.d§(V) < 
d,,d3(U); so we may identify U + U with the effect of d, on subterminal objects. 
Now if (A, a) is any object of Cont(G), then on writing (Eo — Go) for the local 
homeomorphism corresponding to A, and (E, — G1) for its pullback along either 
dy or dı, we see that (E; 3 Eo) can be given the structure of a groupoid E in 
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Loc, and that Cont(G)/(A, œ) may be identified with Cont(E). So we may com- 
pute the effect of the direct image of d/(A,a): Sh(Go)/A — Cont(G)/(A, a) on 
subterminal objects in exactly the same way. Taking (A,q@) to be the object of 
objects of a weakly directed internal poset in Cont(G) (so that A is the object 
of objects of a weakly directed internal poset in Sh(Go)), and using the fact 
that dj and dı, both preserve directed Sh(Go)-indexed unions of subobjects, we 
deduce that d, preserves directed Cont(G)-indexed unions of subobjects. 

The argument in the tidy case is very similar, except that we work with 
arbitrary objects of the toposes concerned rather than subterminal objects. We 
leave the details to the reader. 0 


Let G be a localic groupoid in a topos S. The functor BYop/S — Loc(S) 
which sends a topos over S to its localic reflection is a left 2-adjoint (cf. 1.4.8), so 
it preserves colimits; hence, if we compute the (pseudo-)colimit of the simplicial 
locale G, in Loc(S), we obtain the localic reflection of Cont(G) — S. But, since 
Loc(S) is a locally ordered 2-category (in which, moreover, all 2-isomorphisms 
are identities), there is no difference between the ‘descent locale’ Desc(G,) and 
the ordinary colimit of Ge computed in the 1-dimensional category Loc(S), i.e. 
the coequalizer of G; 3 Go (which we shall denote by 7o(G), and call the locale 
of algebraic components or orbit locale of G). Thus we have 


Lemma 5.3.7 For any localic groupoid G, there is a hyperconnected morphism 
Cont(G) — Sh(70(G)). In particular, G is algebraically connected (that is, 
G, 3 Go — 1 is a coequalizer in Loc(S)) iff Cont(G) is hyperconnected over S. 

0 


We use the term ‘algebraically connected’ to distinguish this notion from 
the (topological) connectedness of G (i.e. of the domain and codomain maps 
Gi 3 Go). 

Using 5.3.6 and 5.3.7, we may now prove the converse of 5.2.13: 


Corollary 5.3.8 Let G be an open localic group in S. Then Cont(G) is 
connected and atomic over S (and has a point). 


Proof By 5.3.6, the morphism d: S ~ Sh(Go) — Cont(G) is an open surjec- 
tion, so the fact that Cont(G) — S is atomic follows from 3.5.1(ii). The fact 
that it is connected (in fact hyperconnected) follows from 5.3.7. Q 


Remark 5.3.9 However, we could also prove 5.3.8 without appealing to 5.3.6 
or 5.3.7, by constructing a connected atomic site of definition for Cont(G). We 
did this for a spatial group (in Set) in 2.2.4(c) (cf. also 3.5.9(b)); as we remarked 
there, the construction still works for a localic group, even though there may be 
no points in the cosets of its open subgroups. To see this, observe that if G is an 
open localic group, then for any open subgroup H of G the equivalence relation 
G x H 3 G (given by the first projection and the multiplication) is effective by 
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5.3.4, from which it follows that the square 


qu) 
G x H —m—=>G:H 
(mi m) A 
qxq 


G x G —>G:HxG:H 


is a pullback (where q is the quotient map), and hence by 3.2.23 that G : H (is 
open and) has open diagonal, i.e. it is discrete by 3.1.15. (So the left cosets of 
H exist as points of the discrete locale G : H, even though an individual coset. 
may not contain any points of G. Also, if K is a particular left coset of H, then 
it is straightforward to verify that the image of K x K under the composite 


1x2 m 
G x G —> G x G — > G 


is a subgroup isomorphic to H; thus we may speak of ‘the conjugate gHg`! 
corresponding to a left coset gH’ even if no such g exists.) 

G : H also clearly inherits a left G-action from G itself, so it defines an object 
of Cont(G). Moreover, the objects of this type form an S-indexed separating 
family for Cont(G): if E is any discrete locale equipped with a left action of G, 
then forming the equalizer of the projection G x E — E and the action map 
yields an open sublocale of G x E (open because it is a pullback of the diagonal 
E — E x E) which we may think of as an E-indexed family of open subgroups of 
G, and which is such that the corresponding E-indexed family of quotients maps 
epimorphically to Æ. So we obtain a site of definition for Cont(G) by taking 
the internal full subcategory on the objects G : H; but it is easily seen that 
any morphism between two such objects is an epimorphism in Cont(G), and 
hence that every inhabited sieve is covering. So we have constructed a connected 
atomic site for Cont(G) over S, in the sense of 3.5.8 (it is connected because it 
contains the terminal object 1 = G : G of Cont(G)). Finally, it is easy to see that 
Cont(G) has a point, whose inverse image is the forgetful functor Cont(G) > S. 


Remark 5.3.10 We may also use the ideas of this section to give an alternative 
proof (at least for toposes with natural number objects) of the result of 2.4.14 
that a bounded morphism whose diagonal is an inclusion must be localic. Let 
p: E -+ S be such a morphism, and let f: F — E be an open surjection with F 
localic over S. Then F xe F — F xs F is a pullback of E — E xs E, so it too 
is an inclusion. In terms of the localic groupoid G derived from this surjection, 
this says that (dı, do): G1 + Go x Go is an inclusion in Loc(S); in particular, 
Gi 3 Go is an equivalence relation in Loc(S), and by 5.3.4 it is effective. Thus, if 
we form the coequalizer q: Go — mo(G) of this pair in Loc(S), then q is an open 
surjection by the argument in the proof of 5.1.4, and hence Sh(Go) — Sh(70(G)) 
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is a descent morphism in Yop by 5.1.6. So we have equivalences Sh(79(G)) ~ 
Desc(Sh(G,)) = E. 


We note next that the construction Cont(—) is indeed functorial on the 
category Gpd(Loc) of localic groupoids: if f : G — H is a functor between localic 
groupoids, then it extends to a morphism fe: Ge — H, of simplicial locales in 
the obvious way, and hence induces a morphism of simplicial toposes Sh(G,) — 
Sh(H,). Since the operation of taking (pseudo-)colimits is functorial, this in turn 
induces a geometric morphism Cont(G) — Cont(H), which we shall denote by 
Cont(f). It is not hard to verify that the inverse image of this morphism is 
simply the functor which sends a continuous G-action (A, a) to (f§(A), ff (@)) 
(modulo the appropriate coherence isomorphisms, whose names we suppress as 
usual). The direct image functor is not so easy to describe explicitly, so we shall 
avoid using it whenever possible. 

Cont(—) is even 2-functorial. Here we have to be slightly careful, since we 
have two different sorts of 2-cells in our domain category Gpd(Loc), namely 
the ‘topological’ ones arising from the 2-dimensional structure of locales and the 
‘algebraic’ ones arising from the 2-dimensional structure of groupoids. First, if 
we have two internal functors f,g: G 3 H such that fo < go and fı < gı in 
the usual partial ordering on locale morphisms (cf. 1.1.6), then it is easy to see 
that fn < gn for all n. Since these inequalities induce natural transformations 
fj — gå and ff — gj, as we saw in 1.4.5, we obtain a natural transformation 
Cont(f)* — Cont(g)*. But we also have the notion of internal natural transfor- 
mation between internal functors f,g: G 3 H (which corresponds to the notion 
of simplicial homotopy, as we saw in B2.3.2). And these too induce geometric 
transformations: given 6: f — g, which we recall is a morphism 09: Go — Hı 
satisfying certain equations, and an object (A, œ) of Cont(H), we see that 95(@) 
is a morphism f*(A) = 63dj(A) — 63d5(A) © g*(A) in Sh(Go), and straight- 
forward verification shows that this is the component at (A, œ) of a (necessarily 
invertible) natural transformation Cont(f)* — Cont(g)*. 

We may thus immediately deduce 


Lemma 5.3.11 Suppose we have an algebraic equivalence of groupoids between 
G and H - that is, a pair of functors f:G—H and g: H — G together with 
natural transformations from both fg and gf to the respective identity functors. 
Then the toposes Cont(G) and Cont(H) are equivalent. 


But, as we emphasized back in Section Al.1, we have a weaker notion of 
equivalence of categories, which we there called ‘weak equivalence’: namely a 
functor which is full, faithful and essentially surjective on objects. What should 
this mean for a functor between localic groupoids? As usual, we call a functor 
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f:G—H full and faithful if 
f 


Gom sa 
(do, dı) (do, dı) 
fo x fo 


Go x Go ——~ Ho x Ho 


is a pullback. For ordinary groupoids in S, the statement that f is essentially sur- 
jective on objects would simply assert that the morphism dg: Hı xy, Go —> Ho 
is an epimorphism; but since we are working in Loc, where ‘mere’ epimorphisms 
are not even stable under pullback, we need to demand more than this. In fact it 
makes sense to demand that dg7, should be a pullback-stable descent morphism 
in Loc. To achieve this, we shall in practice need to know that doz, is either 
an open surjection or a proper surjection; we shall call f an open (resp. proper) 
weak equivalence in the two cases. 


Lemma 5.3.12 If f:G—H is either an open or a proper weak equivalence 
of localic groupoids in S, then the induced geometric morphism Cont(G) — 
Cont(H) is an equivalence. 


Proof Since H is a groupoid, the kernel-pair of dg: Hı — Ho may be iden- 
tified with do, dı: H2 3 Hy; and, using the fact that f is full and faithful, an 
elementary diagram-chase shows that the kernel-pair of do™,: Hy Xx Ho Go — Ho 
may be identified with 


1 x do 
hi <i: Gs He Gi 


(d\(1 x fi), dim) 


Now if (E,a: djE — djE) is an object of Cont(G), then the object 13(E) of 
Sh(H, x H, Go) may be equipped with descent data relative to this pair of maps 
by pulling back a along 72: Hı xy, Gi —> G1; so we obtain an object F of 
Sh( Ho) such that (do7)*(F) & rå (E). Further, since the diagram 


do 
G, ————> Go 


(fi, 41) fo 


dori 
Hı X Ho Go =—— > Ho 


commutes, we see that dj Jò (F) = di (E) in Sh(G1); applying sõ to this, we get 
an isomorphism fj (F) = E in Sh(Go). 
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To see that F can be equipped with an H-action, we think of E as the 
domain of a local homeomorphism E — Go; then we can write di (E) and d (E) 
respectively as G1 XG, E and E xg, Gi, and we have a diagram in Loc(S) 


1xa@ 
Hike ke. = — Ss eS 


(dy (71, fi2), 73) 


omi 
Hı XH, Gi Hı xX Ho Go ——~> Ho 
(di(1 x fi), di72) 


where the vertical morphisms are local homeomorphisms and the top row is a 
pullback-stable coequalizer, by the proof of Proposition 5.1.4. (Here @ denotes 
the G-action on E considered as a map G, xg, E — E, i.e. the composite 
of the isomorphism G1 xg, E — E xq, Gi with the first projection.) Clearly, 
the first and second terms in the top row admit left H-actions, and the morphisms 
between them are equivariant for this action; so it ‘descends’ to an action on F. 

The construction above is easily seen to be functorial; so we have defined 
a functor fi: Cont(G) — Cont(H). It is also clear that the composite f* fi is 
isomorphic to the identity. Finally, if (E,a) = f*(F, 8) for some object (F, 8) of 
Cont(H), then we have pullback squares 


p 
Hı xm, ==> Hı xm F — F 


d 
Hike E E 


from which we may deduce that fi(E, a) is isomorphic to (F, 3). So the functors 
f* and fi are inverse up to isomorphism. O 


We may now prove a further representation theorem like those of Sec- 
tion C5.2, which is ‘dual’ to the representation of connected atomic toposes with 
a point given in 5.2.13. Recall that we say an S-topos p: E > S is Hausdorff if 
p is a separated map, i.e. the diagonal E — E xs E is proper. 


Corollary 5.3.13 Let E be a hyperconnected Hausdorff S-topos with a point. 
Then there exists a compact localic group G in S such that E ~ Conts (G). 


We note that this is indeed dual to 5.2.13, since a geometric morphism is 
connected and atomic iff it is hyperconnected and has open diagonal, by 3.1.9(i), 
3.5.4(i) and 3.5.14. 
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Proof Let f: 7 —€ be an open surjection such that F is localic over S; 
as in 5.2.6, we may choose this surjection so that the given point of E lifts to 
a point of F. Applying 5.2.11, we obtain a representation of € as Conts(H), 
where H = (Hı 3 Ho) is a localic groupoid which is open (since its domain 
and codomain maps are pullbacks of f), algebraically connected (by 5.3.7, since 
E is hyperconnected) and such that Ho has a point p: 1 — Ho. Moreover, the 
map (do, d1): Hı — Ho x Ho is proper, since it is a pullback of the diagonal of 
E. Thus if we form the pullback of this morphism along (p, p): 1 — Ho x Ho, 
we obtain a compact locale G, which clearly has a group structure inherited 
from the groupoid structure on H. Also, we have a morphism of groupoids G = 
(G 3 1) — H defined by p and the inclusion G — Hy, which is full and 
faithful by definition. And the equivalence relation on Ho which is the image of 
(do, d,): Hı — Ho x Ho is closed in Ho x Ho, but has open projections since H 
is an open groupoid; so it is effective by 5.1.9, and hence must be the whole of 
Ho x Ho; in other words, Hı — Ho x Ho is surjective. But Go XH, Hı — Ho 
is the pullback of this morphism along p x 1: Hyp = 1 x Ho — Ho x Ho; so it is 
also a proper surjection. Hence G — H is a proper weak equivalence of localic 
groupoids, and so by 5.3.12 we have Conts(G) ~ Conts(H) ~ £. Q 


Remarks 5.3.14 (a) Like 5.2.13, 5.3.13 has a converse. For if G is a 
compact localic group, then it follows from 5.3.6 that the canonical point 
d: S ~ Sh(Go) — Cont(G) is proper; hence its kernel-pair S XCont(G) S is a 
compact S-topos, by the stability of proper maps under pullback. But we also 
have a pullback square 


S Xcont(G) S ———————> Cont (G) 


A 


(d, d) 
S —————_—_> Cont(G) xs Cont(G) 


whose bottom edge is a proper surjection; so by 3.2.23 the diagonal A is proper, 
ie. Cont(G) is a Hausdorff S-topos. But we already know that it is hypercon- 
nected and has a point, by 5.3.7. (Note also that the case of a discrete group G 
was covered in 3.2.24 above.) 

(b) In the particular case when our base topos S is Set, we can say rather 
more about the group G which we obtain from 5.3.13. Since Set is Boolean, 
(the underlying locale of) any localic group is Hausdorff by 5.3.3. But any com- 
pact Hausdorff locale classically has enough points: this is a well-known result 
from locale theory (cf. [520, III 1.10]), but we may also obtain it from 4.1.13, 
which said that every compact Hausdorff locale is a retract of a coherent locale, 
plus D3.3.13 which says that every coherent topos has enough points. Thus 
we may take G to be a compact spatial group. Further, we noted after A2.1.6 
that we may replace any spatial group G by its totally-disconnected reflection 
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G/Ge (where Ge is the connected component of the identity element) without 
changing the topos of continuous actions; and it is again a well-known result of 
general topology that a totally disconnected compact space is zero-dimensional 
(cf. [520, II 4.2]). Further, any zero-dimensional compact spatial group is pro- 
discrete, and indeed profinite, by [520, VI 2.9]. So we conclude that the pointed 
hyperconnected Hausdorff toposes over Set are exactly the toposes of continuous 
actions of profinite groups. 


We may now deduce a result promised in Section C3.4: 
Corollary 5.3.15 Any proper separated map of toposes is tidy. 


Proof If f is proper and separated then so are both halves of its 
hyperconnected-localic factorization, by 3.2.16 and 3.2.25, so it suffices to prove 
the result separately under the hypothesis that f is either localic or hypercon- 
nected. But the localic case was dealt with in 4.1.14. So suppose f: F — €E is 
hyperconnected (hence proper) and separated. In general, it need not have a 
section; but its pullback mı: F xe F — F has one, namely the diagonal, and it 
is still hyperconnected and separated. Hence by 5.3.13 we can represent it as 
Cont z(G) for some compact localic group G in F, and it is therefore tidy by 
3.4.1(b). But 7 is the pullback of f along a proper surjection (namely, f itself); 
so f is tidy by 5.1.7. oO 


It is not only the weak equivalences considered in 5.3.12 that give rise to 
equivalences between toposes of continuous G-actions. There is also a notion of 
‘completion’ for localic groupoids, which has been christened étale completion 
by I. Moerdijk [831], which more or less by definition does not affect the topos 
of continuous G-actions. To define it, let G = (G1 = Go) be a localic groupoid, 
and let d: Sh(Go) — Cont(G) be the geometric morphism whose inverse image 
is the forgetful functor. Then the square 


d 
Sh(G,) ——> Sh(Go) 
do d 


Sh(Go) > Cont(G) 


commutes up to isomorphism, since by definition each object (A, œ) of Cont(G) 
comes equipped with an isomorphism a: dj(A) — dj(A) in Sh(G,). We say that 
G is étale-complete if this square is a pseudo-pullback in BTop/S; and we define 
the étale completion G of G to be (Gi =3 Go), where Sh(G)) = Sh(Go) is the 
kernel-pair of d in BTop/S. Note that this kernel-pair automatically has the 
structure of a groupoid; and that its domain is indeed localic over S, since d is 
localic. 
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Lemma 5.3.16 Let G be an open or proper localic groupoid. Then the canon- 
ical functor G — G induces an equivalence Cont(G) ~ Cont(G). Moreover, G 
is open (resp. locally connected, connected and locally connected, atomic, proper, 
tidy, tidy and connected) if G is. 


Proof Since d: Sh(Go) — Cont(G) is an open or proper surjection by 5.3.6, 
it is a descent morphism in BYop/S; so Cont(G) is equivalent to the category 
of objects of Sh(Go) equipped with descent data relative to this morphism. 
But by definition the latter is precisely Cont(G). The second assertion again 
follows from 5.3.6, plus the stability of the indicated classes of morphisms under 


pullback. o 


Example 5.3.17 To gain some feeling for what étale completion means, we 
consider the particular case of a localic group G in Set. In this case, we saw in 
5.3.9 that we have a site of definition for Cont(G) given by the category whose 
objects are the open subgroups H of G, with morphisms H — K labelled by 
left cosets gK of K such that H C gKg~', and equipped with the coverage 
in which every inhabited sieve covers. If it happens to be the case that every 
open subgroup of G contains an open normal subgroup, then we may use the 
Comparison Lemma to cut down this site to the full subcategory consisting of 
those open subgroups H which are normal, so that the corresponding object 
G : H of Cont(G) admits a group structure. Of course, not every morphism in 
the site is a group homomorphism; but if we further cut down to those which are 
(i.e. those which are labelled by cosets containing the identity element of G), we 
obtain a codirected diagram of discrete groups and surjections, and if we take 
the inverse limit of this system in the category of localic groups, we obtain a 
group (the pro-discrete completion of G) whose category of discrete quotients is 
isomorphic to that of G (so that it gives rise to the same topos of continuous 
actions), and which may be shown to be the étale completion of G in this case. 

In general, we cannot impose a group structure on the inverse limit of the 
discrete quotients G : H; the best we can get is a monoid structure. To define 
this, we may think of a ‘point’ of the inverse limit as a system of left cosets 
(9u H | H € OS(G)) (where OS(G) denotes the poset of open subgroups of G), 
such that gu K = gg K whenever H C K. Of course, this is only a manner of 
speaking, since the inverse limit may not have enough points; but what follows 
can be turned into a rigorous argument — cf. [831]. Then we define the product 
of two such systems (gy H | H € OS(G)) and (ky H | H € OS(G)) to be 


(Sky Hk, ku H | He OS(G)) : 


It is readily verified that this is well-defined, and yields an associative multipli- 
cation on the inverse limit; however, not every ‘point’ of the limit has an inverse, 
in general. But if we cut down to the ‘sublocale of invertible elements’ of this 
monoid, we obtain a localic group which is the étale completion of G. We shall 
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not go into the details of the proof; the interested reader is referred to [831] 
and [833]. 


Finally in this section, we prove a theorem which says in effect that 
BTop/S may be regarded as a category of fractions of the full subcategory 
of Gpd(Loc(S)) whose objects are étale-complete open localic groupoids in S, 
obtained by inverting the open weak equivalences. In order to state this theorem 
formally, we should need to spend a good deal of time developing the theory of 
‘2-categories of fractions’, which seems hardly worth while given the amount of 
use we should make of it. So instead we shall merely state and prove the central 
result from which this representation may be deduced. (The reader who wishes 
to see a more precise formulation in terms of 2-categories of fractions is referred 
to [990].) We already know that every bounded S-topos may be represented 
as Cont(G) for some étale-complete open localic groupoid G, by 5.2.11(i) and 
5.3.16; moreover, every open weak equivalence between such groupoids gives rise 
to an equivalence of toposes, by 5.3.12. It is not hard to verify that open weak 
equivalences (between open groupoids) are stable under composition and pull- 
back, so that we should expect an arbitrary morphism G — H in the category 
of fractions to be representable by a span 


Gelek l 


where g is an open weak equivalence. In fact we shall prove that all geometric 
morphisms between toposes of continuous actions arise from spans of this type. 


Theorem 5.3.18 Let G and H be étale-complete open localic groupoids in S, 
and let f: Cont(G) — Cont(H) be a geometric morphism over S. Then there 
exist an étale-complete open groupoid K and morphisms of groupoids g and h 
such that g is an open weak equivalence, and such that the diagram 


Cont(K) 


Gono Cont(h) 
f 
Cont(G) ——> Cont(H) 
commutes up to isomorphism. 


Proof Given f, we form the pullback 


bio e 3 Sh(Ho) 


| | 


Sh(Go) BL Cont(G) tg Cont(H) 
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in BTop/S. Since the right vertical morphism is localic, so is the left one; so we 
can write E ~ Sh(Ko) for some locale Ko. Moreover, go: Sh( Ko) — Sh(Go) is 
an open surjection, since d: Sh(H )) — Cont(H) is an open surjection by 5.3.6. 
We define K, by the pullback diagram 


ae of 


(di, do) (dı, do) 


Ko x Ko w Go x Go 

in Loc; thus it is immediate that K = (Kı 3 Ko) inherits the structure of a 
groupoid from G, and that g: K — G is full and faithful. Further, the domain 
and codomain maps of K are open, since they are pullbacks of those of G, and 
Gi XG, Ko — Go is an open surjection, since we can factor it as the composite 
of Gi — Go and a pullback of Ko — Go; so g is an open weak equivalence. 

However, since G is an étale-complete groupoid, the geometric morphism 
Sh(G,) — Sh(Go x Go) ~ Sh(Go) xs Sh(Go) is a pullback of the diagonal 
Cont(G) — Cont(G) xs Cont(G); and similarly for H. So a straightforward 
diagram-chase shows that we could equivalently have defined K, by the pullback 


Sh(K;) eee o eee Sh(H;1) 


91 dd, 


dd, f 
Sh(G,) —> Cont(G) —> Cont(H) 


in BZop/S; and that the pair (ho, hı) defines a morphism of localic groupoids 
K — H. The fact that K is étale-complete follows easily from the étale- 
completeness of G and the fact that g is full and faithful. 

It thus remains only to verify that the diagram in the statement of the the- 
orem commutes. Let (B, 8) be an object of Cont(H), and write (A, aq) for the 
object f*(B, 8) of Cont(G). Then the (pseudo-)pullback square defining Ko 
yields a canonical isomorphism g§(A) = hš (B) in Sh( Ko); and, if we use gf (a) 
and h}(@) to equip these two objects with K-actions, it is easy to verify that the 
canonical isomorphism is in fact an isomorphism in Cont(K). So the result is 
established. O 


We remark that 5.3.18 remains true, with exactly the same proof, if we sub- 
Stitute the word ‘proper’ for ‘open’ throughout. However, as we have already 
observed, not every bounded S-topos is representable by a proper localic 
groupoid. 
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Suggestions for further reading: Isbell et al. [486], Johnstone [527, 528], 
Joyal & Moerdijk [554], Moerdijk [831-834, 837], Moerdijk & Vermeulen [858], 
Pronk [990]. 


C5.4 The Freyd representation 


The present section would, in some respects, be more at home in Part D than 
in Part C. It is placed here because it leads to a representation theorem for 
(a large class of) Grothendieck toposes which can be presented in terms of localic 
groupoids; but the methods which lead up to this result are logical rather than 
geometric in character, and the references on which we shall rely are mostly to 
results proved in Part D. (Another difference between this section and the earlier 
ones in this chapter is that, for at least some of the results here, we shall need 
to assume that our base topos S is Boolean; in practice we shall take it to be 
the classical topos of sets, so that ‘Grothendieck topos’ is to be understood in 
its original and more restrictive sense.) 

The representation theorem in question was first obtained by P. Freyd [374], 
and his motivation in proving it also came from logic: to obtain a better under- 
standing of the way in which independence proofs in topos theory, of the sort 
which we discuss in Chapter F4, relate to their set-theoretic counterparts. Freyd 
observed that set-theoretic independence results are commonly proved using a 
combination of three techniques: passing from a given model of set theory to 
a ‘forcing extension’ corresponding to a complete Boolean algebra, passing to a 
‘Fraenkel -Mostowski model’ corresponding to a topological group, and passing 
from a model of ‘non-well-founded set theory’ to its well-founded part, to recover 
the axiom of foundation. The first two of these, at least, have well-understood 
topos-theoretic analogues: they correspond to the passage from a topos S to the 
topos Shs(B) of (canonical) sheaves on a complete Boolean algebra B in S. 
and from S to the topos Conts(G) of continuous G-sets for a topological group 
G in S. In topos-theoretic terms, the third is a bit more nebulous, but it is 
at least an instance of the passage from a topos € to what Freyd called an 
exponential variety, that is a full subcategory closed under arbitrary products. 
coproducts, subobjects, quotients and power objects. (Such a subcategory is 
necessarily coreflective in €, by an argument like that of A4.2.4(d); in fact, the 
exponential varieties in E are (up to equivalence) exactly the toposes F equipped 
with connected atomic geometric morphisms € — F - recall from 3.5.4(i) that 
a connected atomic morphism is hyperconnected.) 

In considering topos-theoretic independence proofs, it obviously makes sense 
(since not all toposes are Boolean) to generalize the first of the three con- 
structions above from complete Boolean algebras to complete Heyting algebras: 
that is, to allow the passage from S to any topos localic over S (cf. 1.4.7). But. 
at least in theory. we have a much larger generalization available: that from com- 
plete Heyting algebras to arbitrary sites. Do we in fact gain anything thereby? In 
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other words, do there exist Grothendieck toposes which cannot be obtained from 
Set by a combination of the three constructions described in the last paragraph? 
Freyd showed that the answer was negative: in fact (and rather surprisingly) we 
do not have to use all possible instances of the second construction, but only one 
particular one, corresponding to the group of all permutations of a countably 
infinite set. On the other hand, the first two constructions alone do not suffice 
to reach every Grothendieck topos from Set (though they do suffice to reach all 
Boolean Grothendieck toposes); the first part of this section will be devoted to 
characterizing the ones that can be reached. For this purpose, we need to recall 
some facts about decidable objects and their quotients. 

We recall from A1.4.15 that an object A of a coherent category (in partic- 
ular, of a Grothendieck topos) is said to be decidable if the diagonal subobject 
Aa: Am Ax A has a complement. We recall also that the class of decidable 
objects in a Grothendieck topos is closed under finite products, arbitrary coprod- 
ucts and arbitrary subobjects; and it follows easily from this that the class of 
quotients of decidable objects (i.e. those A for which there exists a decidable B 
and an epimorphism B -» A) is closed under finite products, arbitrary coprod- 
ucts, arbitrary subobjects and arbitrary quotients. As we observed in A4.2.4(d), 
this implies that the full subcategory Ega of quotients of decidable objects in E 
is a topos, and that there is a geometric morphism (in fact a hyperconnected 
one) E — Eqa whose inverse image is the inclusion. Here we shall be concerned 
(initially, at least) with the case when Ega is the whole of E. 


Proposition 5.4.1 Fora Grothendieck topos €, the following are equivalent: 


(i) Every object of E is a quotient of a decidable object. 

(ii) E has a separating set of decidable objects. 
(iii) E has a bound over Set (in the sense of B3.1.7) which is decidable. 
(iv) E€ has a pre-bound over Set (in the sense of D3.2.3) which is decidable. 


Proof (i) => (iii): Given a separating set {G; | ¿ € I} for E, let G be the 
coproduct of the G;, and F —> G an epimorphism with F decidable. Then G is 
a bound for E, and hence so is F. 

(iii) > (iv) since any bound is a pre-bound; and (iv) > (ii) because an object 
G is a pre-bound iff the subobjects of its finite powers form a separating set, and 
the latter are all decidable if G is. 

Finally, (ii) = (i) since every object can be expressed as a quotient of a 
coproduct of members of the separating set, and coproducts of decidable objects 
are decidable. O 


If the equivalent conditions of 5.4.1 hold, we shall say that the topos € is 
locally decidable. 


Examples 5.4.2 (a) Trivially, any Boolean topos is locally decidable. 
(b) Any localic topos is locally decidable, since such a topos has 1 as a 
bound (or 0 as a pre-bound), and these are both decidable. More generally, if 
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E is locally decidable and f: F — € is a localic geometric morphism, then F is 
locally decidable, since f* maps (pre-)bounds for € to (pre-)bounds for F by 
D3.2.4, and it preserves decidability. 

(c) Let C be a small category. Then the functor category [C,Set] is locally 
decidable iff every morphism of C is monic. For the latter condition is equivalent, 
by A1.4.16, to saying that the representable functors C (A, —), A € ob C, are all 
decidable, and they form a separating set for |C, Set]. Conversely, if [C, Set] is 
locally decidable then the representable functors must be quotients of decidable 
objects; but, since they are projective, this forces them to be subobjects of 
decidable objects and hence decidable. 


Remark 5.4.3 In connection with Example 5.4.2(c), we note that local decid- 
ability is ‘dual’ to the property of being an étendue, introduced in 5.2.4: for a 
small category C, the functor category [C, Set] is locally decidable iff [C°?, Set] 
is an étendue. In particular, the classifying topos Set[D] = [Set;,,Set] for 
decidable objects (cf. D3.2.7) is locally decidable but not an étendue, and the 
classifying topos Set(K] = [Set je, Set] for Kuratowski-finite objects (cf. D3.2.10) 
is an étendue but not locally decidable. (Further evidence that local decidability 
is dual to being an étendue may be found by comparing condition (v) of 5.4.4 
below with Lemma 5.2.5.) 


If € is locally decidable, then it follows from 5.4.1(iv) and D3.2.6 that there 
exists a localic geometric morphism from E to the classifying topos Set[D] for 
decidable objects. The converse follows from 5.4.2(b) and 5.4.3. But in fact we 
can do better than this. If B is a (pre-)bound for E, then so is BUN, where N is 
the natural number object; and BHN is decidable if B is (recall that N is always 
decidable, by A2.5.11). But BUN is also infinite in the sense of D3.4.10, and so it 
can be classified by a (localic) geometric morphism from £ to the classifying topos 
Set(D,.| for infinite decidable objects. And in D3.4.10 we identified the latter 
topos as Cont(Go), where Go is the group of all permutations of a countably 
infinite set (with its ‘natural’ topology). Thus we have established most of the 
following result: 


Theorem 5.4.4 For a Grothendieck topos €, the following are equivalent: 


(i) E€ is locally decidable. 
(ii) There exists a localic geometric morphism from E to a Boolean topos. 
(iii) There exists a localic geometric morphism E — Cont(G), for some spatial 
group G. 
(iv) There exists a localic geometric morphism E — Cont(Go), where Go îs 
the (spatial) group of all permutations of N. 


(v) There exists a small site of definition (C, J) for E such that every morphism 
of C is epic. 
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Proof (i)= (iv) follows from the remarks above; (iv) => (iii) and (iii) (ii) are 
trivial; and (ii) = (i) follows from 5.4.2(a) and (b). 

For (iii) > (v), we recall that in 2.5.8(e) we constructed a site (C, J) for 
Shcont(c)(X), where X is an internal locale in Cont(G): the objects of C are 
pairs (H,U) where H is an open subgroup of G and U is an element of O(.X) fixed 
by H, and morphisms (H,U) — (K,V) correspond to left cosets gK of K such 
that H C gKg™! and U C gV. A sieve R on (K,V) is covering iff the set of those 
U € O(X) for which there exists H C K such that K: (H,U) —> (K,V) € R has 
join V. It is easy to verify that every morphism of C is epic: for the composite of 


gk hL 
(H,U) —> (K,V) —> (L,W) 
is given by ghL, and clearly ghL = gh’L implies hL = KWL. 
Finally, (v) = (i) follows from 5.4.2(b) and (c), since inclusions are localic. 
O 


Corollary 5.4.5 Any Boolean Grothendieck topos is equivalent to one of the 
form Shcont(c)(X ), where G is a spatial group and X is a Boolean internal locale 
in Cont(G) (i.e. one such that O(X) is a complete Boolean algebra). Moreover, 
we may take G to be the particular group Go mentioned above. 


Proof A Boolean topos € is locally decidable, so by 5.4.4 it is representable 
as Shcont(G,)(X) for some internal locale X in Cont(Go). And the Booleanness 
of E forces X to be Boolean. O 


The next remark provides a link with the ideas that we explored in the earlier 
Sections of this chapter. 


Remark 5.4.6 If G is a discrete group and X is a locale in [G, Set], then we 
may identify Shjg,set}(X) with the topos of continuous actions of the localic 
groupoid (G x Xo 3 Xo), where Xo is the ‘underlying locale in Set’ of X, i.e. 
the locale corresponding to the frame obtained by forgetting the G-action on 
O(X), and the domain and codomain maps G x Xo =3 Xo are respectively given 
by the second projection and the G-action. (Note that this is an atomic localic 
groupoid in the sense of 5.2.11(ii); that is, its domain and codomain maps are 
local homeomorphisms.) 

We may similarly describe Shcontig)(X), for an internal locale X in 
Cont(G), in terms of a localic groupoid, as follows. Since G acts on O(G) 
by translation, we may cut down to its ‘continuous part’ 


O(G) = {U € O(G) | H -U =U for some open subgroup H} 


and regard the latter as an internal frame in Cont(G). If we denote the cor- 
responding internal locale by G, then we have an action map G x X > X 
in Loc(Cont(G)), and hence an internal localic groupoid (Gx X 3 X) in 
Cont(G). Taking the MacNeille completions of the corresponding frames, and 
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forgetting the G-actions, yields a localic groupoid in Set; and it may be shown 
that Shcont(c)(X) is equivalent to the topos of continuous actions of this localic 
groupoid. 


If € is a Grothendieck topos which is not locally decidable, can we find a 
‘good approximation’ to it which is locally decidable? In one direction, this is 
easy: we may now re-interpret A4.2.4(d) as follows. 


Proposition 5.4.7 For any Grothendieck topos €, there is a locally decidable 
topos Eqa and a hyperconnected geometric morphism E — Ega which is universal 
amongst geometric morphisms from €E to locally decidable toposes. 


Proof The fact that €q is a topos, and that the inclusion Ega — E is the 
inverse image of a geometric morphism, was proved in A4.2.4(d). This morphism 
is hyperconnected (and not merely connected) because Ega is closed under arbi- 
trary subobjects and quotients in €. So it only remains to establish the universal 
property. But if f: E — F is a geometric morphism where F is locally decid- 
able, then since f* preserves decidability and epimorphisms it takes values in 
the subcategory Eqa, and hence we have a factorization (unique up to unique 
isomorphism) of f through E —> Ega- 


From ‘the other side’, the remark after 5.2.4 may be combined with 5.4.2(c) 
to yield a proof that every Grothendieck topos admits a hyperconnected mor- 
phism from a locally decidable one. But Freyd proved a stronger result: for every 
Grothendieck topos E, there is a locally decidable F and a connected atomic 
morphism F — E. Before proving this, however, we shall digress to consider 
the ‘ordered’ analogues of the results established so far in this section. For this 
purpose, we shall assume that our base topos Set satisfies the Linear Ordering 
Principle (the assertion that any set can be linearly ordered); this is well known 
to be strictly weaker than the full axiom of choice, but it implies the axiom of 
choice for families of finite sets. 

We say an object A of a topos E is orderable if it can be given the struc- 
ture of a model of the theory L of linearly ordered sets described in D3.4.11; 
and we say € is locally orderable if it has an orderable pre-bound. Clearly. 
an orderable object A is decidable, since the axioms of L imply that A x A 
is the disjoint union of the order-relation, its opposite and the diagonal; so 
any locally orderable topos is locally decidable. It is also clear that a coprod- 
uct A; I Ag of two orderable objects is orderable (with the ordering in which 
all of A; precedes all of Az), and so is their product A, x A (with the 
lexicographic ordering). Hence, if B is an orderable pre-bound for E, so is 
(B IL 1) x Q, where Q is the object of rational numbers in E (cf. D4.7.1): 
but the latter pre-bound is actually a model of the theory Læ of dense linear 
orders without endpoints, considered in D3.4.11. Thus we obtain an analogue 
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of 5.4.4: 


Theorem 5.4.8 The following conditions on a Grothendieck topos E are 
equivalent: 


(i) E is locally orderable. 
(ii) Every object of E is a quotient of an orderable object. 


(iii) There exists a localic geometric morphism from E to a topos in which every 
object is orderable. 


(iv) There exists a localic geometric morphism from E to the topos Cont(G,), 
where G, is the group of order-preserving permutations of Q (with its 
‘natural’ topology). 


Proof (iii) = (ii) follows from the fact that inverse image functors preserve 
orderability, and subobjects of orderable objects are orderable. (ii) = (i) fol- 
lows from the fact that any object mapping epimorphically to a pre-bound is 
a pre-bound. (i) = (iv) follows from the remarks above, plus the fact that we 
may identify the classifying topos Set{L..] with Cont(G,) (see D3.4.11). So, 
to complete the circle, it suffices to prove that every object of Cont(G,) is 
orderable. 

Since Cont(G) is atomic, we know that every object of it can be written as 
a set-indexed coproduct of atoms. Of course, in general the proof that an infinite 
coproduct of orderable objects is orderable requires the full axiom of choice in 
Set, in order to choose a linear ordering of each summand (and a linear ordering 
of the set indexing the coproduct); however, it turns out that in Cont(G,) each 
atom has a (nonzero) finite number of distinct linear orderings, so we need only 
the axiom of choice for families of finite sets - which, as we remarked above, 
follows from the Linear Ordering Principle. 

By D3.4.11, we may also regard Cont(G}) as the topos Sh(Ord*;,,. J), where 
Ord;,,, is the category of finite linearly ordered sets and order-preserving injec- 
tions between them, and J is the atomic coverage on Ord), (i.e. the coverage in 
which every inhabited sieve covers). In terms of this description, the atoms are 
the representable functors An = Ordym (n,—), where n denotes an n-element 
linearly ordered set. It is easy to see that A,, can be made into a subobject (in n! 
possible ways) of ( A1)”; and Aj is the underlying object of the generic model of 
Lx; hence every An is orderable. The fact that each A, has finitely many linear 
orderings follows from the fact that An x A, is a finite coproduct of atoms (since 
(A,)?” is): so it has only finitely many subobjects. (In fact there are exactly 
2"n! linear orderings of An; for a proof of this, and further information about 
the structure of linearly ordered objects in Cont(G1), see [529].) o 


Not every Boolean topos is locally orderable: for example, in Cont(Go) the 
only orderable atom is the terminal object 1, from which it follows that the only 
orderable objects are the constant objects p* I (where p: Cont(Go) — Set is the 
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unique geometric morphism), and these do not include a pre-bound. (More gen- 
erally, if G is any group containing non-identity elements of finite order, then 
[G, Set] is not locally orderable.) Perhaps more surprisingly, the conjunction of 
local orderability.and Booleanness does not imply that every object is order- 
able; the following counterexample is essentially due to G. P. Monro [860] (see 
also [524]). 


Example 5.4.9 Let © be the signature with one sort and three binary relation 
symbols <, = and # (which, as usual, we write in infix rather than prefix 
notation), and let T be the coherent theory over £ with the following axioms: 


(a) the axioms which say that < is a linear order without endpoints; 


(b) the axioms which say that = is an equivalence relation, and # is its 
complement; 


(c) the sequents 


>s 


(T Has en Gn) AWH) 


t 


i 


for all n > 0, and 
((x < y) Fs y, (Bw)((z < w) A (w < y) A (w = 2))). 


Thus a T-model is a linearly ordered object without endpoints, equipped with a 
complemented equivalence relation which has infinitely many equivalence classes, 
and such that each equivalence class is dense in the linear order (so that, in 
particular, the linear order is dense). Thus T may be regarded as a propositional 
extension of Læ, and so we have a localic morphism Set[T] — Set/L..] by 
B4.2.12; in particular, Set[T] is locally orderable. But it is easily seen that 
T satisfies the conditions of D3.4.6(ii), and so its classifying topos is Boolean; 
specifically, it is Cont(G2) where G2 is the automorphism group of the unique 
(up to isomorphism) countable T-model M in Set (cf. 5.2.14(c)), and the generic 
model is M itself with the canonical action of G2. Now consider the quotient 
N = M/{=]; of course, G2 also acts continuously on this quotient, and (as an 
object of Cont(G2)) it is the quotient of the generic T-model by =. It is easy to 
see that, given any open subgroup H of G2, we can find two elements z, y of N 
and an element of H which interchanges them; in fact we can take x and y to be 
any two =-classes which do not contain elements fixed by H. It follows that N 
does not admit any linear ordering which is invariant under an open subgroup 
of G2; hence, as an object of Cont(G2), N is not orderable. 


Remark 5.4.10 By analogy with 5.4.2(c), it is easy to see that a functor 
category [C, Set] is locally orderable iff each representable functor C (U, —) admits 
a total ordering, iff each morphism of C is monic and we can totally order each 
hom-set of C in such a way that all mappings of the form a+ ba (for a fixed 
morphism b) are order-preserving. We note that this holds if C is the free category 


C5.4 THE FREYD REPRESENTATION 799 


on a directed graph G, at least provided we have a total ordering of the arrows of 
G, since we may then use reverse lexicographic ordering to order the morphisms 
of C (that is, finite formally-composable strings of arrows of G) with a given 
domain. Thus in particular the étendue constructed in the proof of 5.2.3 is locally 
orderable; so that result may be strengthened to say that every Grothendieck 
topos admits a hyperconnected morphism from a locally orderable étendue. 

We may also extend 5.4.8, by adding to its equivalent conditions one which 
corresponds to (v) of 5.4.4: a Grothendieck topos is locally orderable iff it is defin- 
able by a site (C, J) where CP satisfies the condition mentioned in the previous 
paragraph. For if our locally orderable topos € is represented as Shcont(c,)(X), 
then we may use the Comparison Lemma 2.2.3 to cut down the site of 2.5.8(e) 
to its full subcategory on those objects (H,U) for which H is the stabilizer of a 
finite subset {qi,..-, qn} of Q. Now left cosets of such an H correspond to order- 
preserving injections g: {q1,..-, qn} — Q, and we may order these by imposing 
the lexicographic ordering on the n-tuples (g(q1),..-,9(@n)) (where for definite- 
ness we suppose that the elements q,...,@n are listed in increasing order). It is 
easily verified that this defines a total ordering on the hom-sets of C in such a 
way that composition with a fixed morphism is order-preserving. 


We now return to the proof of Freyd’s representation theorem. 


Theorem 5.4.11 Let € be a Grothendieck topos. Then, fori = 0,1, there exist 
spans 
fi Ji 
E <«—— F; —_—> Cont (G;) 

in BTop/Set such that f; is connected atomic and g; is localic, where Go (resp. 
Gı) is the group of all permutations of N (resp. the group of all order-preserving 
permutations of Q), with its natural topology. Equivalently, E may be repre- 
sented as an exponential variety in a locally decidable (resp. locally orderable) 
Grothendieck topos. 


Proof We shall give two proofs of the theorem: the first is (a reworking of) 
Freyd’s original proof, and the second is a more ‘conceptual’ proof, which is due 
to A. Joyal. (In fact the second proof is really no more than a translation of the 
first into the language of geometric theories: the relationship between the two is 
much like that between 5.2.1 and 5.2.8(c).) It is also worth noting that the case 
îi = 0 of the theorem follows from the case i = 1, since the topos Cont(G}) is 
Boolean and so admits a localic morphism to Cont(Go) by 5.4.5; however, in 
both cases we shall give the proof for i = 0 in detail and merely sketch the (very 
Similar) argument for i = 1. 

(i) Let B be a pre-bound for € over Set, that is, an object such that the 
subobjects of its finite powers form a generating set for E. For technical reasons, 
we need to assume that B is well-supported, so that every injection u: [m] — [n] 
between finite sets induces an epimorphism BY: B” —> B™ (here we are writing 
[n] for the n-element set {0,1,...,2—1}). We shall take our site (C, J) for E to 
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be the full subcategory of € on the subobjects of finite powers of B, equipped 
with the coverage induced by the canonical coverage on £. 
We now define a new site (D, K} as follows: objects of D are diagrams 


f g 


A —— A’ > B" 


in E, where A and A’ are objects of C and n is a natural number. We shall denote 
such an object by a quintuple (A, f, A’, g, n). Morphisms (A), f1, A1, g9, nı) > 
(Ag, fo, Ah, 92, n2) are pairs (h,u) where h: Ay > Ag in € and u: [ng] — [n] is 
an injection such that B”: B”! — B”? maps the subobject gı into g2 and the 
diagram 


fi g 


A, — A; >> Bt 


h B" 
fa 


Ag ——-> Ab >——> _ B™ 
92 


commutes. We have a functor P:D-—C which sends (A,f,A’,g,n) to A 
and (h,u) to h; it is easy to see that P is a fibration, since if we 
are given a morphism h: C — P(A, f,A’,g,n) in C we have a prone lifting 
(h, liny): (C, fh, A’, 9,n) — (A, f, A',g,n). Also, P has a full and faithful right 
adjoint T given by T(A) = (A — 1 > B®); so if we define the coverage K 
on D by saying that P creates covers (i.e. that a sieve on an object is cov- 
ering iff its image under P generates a J-covering sieve), then it follows from 
3.5.10 that the fibration of sites (P,T) induces a connected atomic morphism 
Sh(D, K) > SH(C, J) > E. 

Next, we observe that the full subcategory D’ of D consisting of those objects 
(A, f, A’, g, n) for which f is an identity morphism is K-dense in the sense of 2.2.1: 
for, given any object (A, f, A’,g,n), we can choose a monomorphism h: A — G™ 
for some m, and then we have a morphism 


h, 
had j (h, gf) ere 


1 BY 


A ——> A’ eee B” 
in D (where u: [n] > [m + n] is the map 7 +» m + j), which clearly generates a 
K-covering sieve. Thus by 2.2.3 Sh(D, K) is equivalent to Sh(D’, K’), where K’ 
is the coverage induced on D”. 
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Now let Ln, for each n, denote the frame of subobjects of B” in £, and 
consider the sequence of embeddings 


Lo >——> L; >—> L >——> ::: 


induced by pullback along the projections B"+! — B” induced by the inclusions 
in] — [n + 1} in Sets. Let L be the colimit of this sequence (in Set, or in the 
category of Heyting algebras). Given an element g: A — B” of Ln for some n 
and a permutation u of N, we may choose m such that u maps [n] into [m] and 
define u- f to be the subobject of B™ obtained by pulling back f along B”; 
this clearly defines a continuous action of the group Go on L, in which Ln is 
exactly the subset of elements fixed by the subgroup Go,, of all permutations 
fixing {0,1,...,n — 1} pointwise. (The fact that Ln is exactly this subset, rather 
than merely contained in it, is a consequence of the fact that the morphisms 
B” are epic in E: if an object of Sube(B™), for some m > n, has the same 
pullback along B” and B” whenever u and v agree on [n], then we may equip 
it with descent data relative to the epimorphism Bt, where i: [n] — [m] is the 
inclusion. ) 

In fact it is not hard to see that L is an internal frame in Cont(Go), i.e. 
(cf. 2.5.8(e)) it is the ‘continuous part’ of its MacNeille completion, regarded as 
a frame in [Go, Set]. Moreover, if we construct the site described in 2.5.8(e) for 
the topos Shgont(g,)(L) (but restricting its objects to those (H,U) for which 
H is one of the particular open subgroups Gon — these clearly define a dense 
subcategory), we obtain a site equivalent to (D’, K’). Thus we have Sh(D, K) > 
Sh(D’, K’) ~ Shcont(Go)(L), as required. 

For the case i = 1, we proceed in exactly the same way, except that in the 
definition of the morphisms of D we restrict ourselves to pairs (h, u) for which 
u is an order-preserving injection, and we replace the sequential colimit in the 
definition of L by a directed colimit indexed by the poset of finite subsets of Q. 

(ii) For the second proof, we first note that, if E is a topos satisfying the 
conclusion of the theorem and h: E’ — € is a localic geometric morphism, then 
E’ also satisfies the conclusion; for we may form the pullbacks 


1 hi 
F| —> F, 


bi fr 


E' = E 
in which f/ is connected and atomic by 3.3.15 and 3.5.12, and h; (and therefore 
the composite g;h;) is localic by B3.3.6. So, by D3.2.6, it suffices to prove the 
result for some particular topos Set[T], where T is a single-sorted geometric 
theory such that every Grothendieck topos has a pre-bound which can be given 
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the structure of a T-model. In fact, as in the second part of 5.2.8(c), we shall 
take T to be the theory O; of inhabited objects. 

We now proceed in a manner which is reminiscent of 5.4.9 above. For the 
case i = 0, we consider the theory To over the signature with two binary rela- 
tion symbols # and =, having axioms which say that # is the complement of 
the equality relation, that = is an equivalence relation, that the object under 
consideration is inhabited and, for each n > 1, the sequent 


(7 Foruan (Y) (w =a) \w#2))) 


i=l 
which says that each =-class is infinite. Let M denote the generic To-model in 
Set[To]; then M is clearly decidable and infinite, so it is classified by a geometric 
morphism Set[To] — Set{D,,] ~ Cont(Go), which is localic by D3.2.6. On the 
other hand, the quotient of M by the equivalence relation [=] is clearly an 
inhabited object of Set|To], so it is classified by a geometric morphism Set[To] — 
Set{O,]; we shall show that this morphism is connected and atomic. 

Viewed as a topos over Set[O;], Set[To] may be regarded as the classifying 
topos for the theory of ‘G-indexed families of infinite decidable objects’ where 
G is a particular object of the base topos, namely the generic inhabited object 
itself. (In fact, the pullback of Set[T 9] — Set{O] along an arbitrary localic 
morphism f: E — Set{O;] may similarly be regarded as classifying the theory of 
B-indexed families of decidable objects, where B is the pre-bound for € classified 
by f; so the argument which follows could be applied directly to this pullback. 
We note also that Set[To] — Set[O,] is a (covariant) partial product in the sense 
of B4.4.14, namely the partial product of the opfibration Set{O,|/G — Set[O,} 
and the topos Set[D,.]; but that is not relevant to our present concerns.) It is 
easy to see, generalizing the arguments of D3.2.7 and D3.4.10, that a site for 
this topos may be obtained by taking the internal category C whose objects 
(indexed by the list object LG in Set[O;]) are all morphisms [p] — G from finite 
cardinals to G, and whose morphisms are monomorphisms (over G) between 
their domains, and imposing the coverage T on C°P in which every inhabited 
sieve covers. Now it is also easy to see that (C°P, T) is a connected atomic site 
in the sense of 3.5.8; so Set[T | — Set[O,] is connected and atomic. 

For the case 7 = 1, we proceed similarly, using the theory Tı whose signature 
has two binary relation symbols < and =, and axioms which say that < is a lin- 
ear order without endpoints, that = is an equivalence relation, that the object 
under discussion is inhabited, and that each =-class is dense in the linear order 
(cf. 5.4.9). Once again, the generic model M of T, is classified by a localic geo- 
metric morphism Set[T,] — Set[L,.] ~ Cont(G), and the quotient M/[=] 
is classified by a connected atomic morphism Set[T,] — Set[O,}. We omit the 
details, which are similar to those already given for i = 0. z 


Remark 5.4.12 If € is any Grothendieck topos which is not locally decidable. 
then the covering maps f;: Fi — E constructed in 5.4.11 provide examples of 
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(bounded) connected atomic morphisms with no sections (cf. 3.5.7 and 5.2.14(a)). 
We can see this directly, by observing that the geometric theories (relative to 
E£) occurring in the second proof of 5.4.11 cannot have models in E; but it is 
also clear that a section of f;, if it existed, would necessarily be localic (since 
its direct image functor would be faithful, cf. A4.6.2(a)), and so it would force 
E to be locally decidable. In the same way, taking i = 1 and € = Cont(Go), we 
get an example of such a morphism whose codomain is Boolean. In D3.4.14 we 
provide a (more complicated) example whose codomain is Set. 


Suggestions for further reading: Freyd [374], Johnstone (521, 524, 529}. 
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FIRST-ORDER CATEGORICAL LOGIC 


D1.1 First-order languages 


In this chapter we shall examine the relationship between topos theory and 
first-order predicate logic. The languages underlying this kind of logic contain 
formal expressions called terms, which denote individuals, and formal expres- 
sions called formulae, which denote predicates that make assertions about the 
individuals. Compound terms and formulae are built up using various logical 
operators. The expressions of such a language can be assigned meanings in a 
category: the terms are interpreted as morphisms and the formulae as subob- 
jects, in a way that respects the logical structure of compound expressions. This 
categorical semantics requires various kinds of categorical structure to be present 
in the category, mirroring the logical operations which are allowed in the lan- 
guage, and will be described in the next section. In this section we shall describe 
the formal syntax of first-order languages. 

Since our concern in this book is with toposes, we shall be considering those 
logical operations which have a meaningful interpretation using the categorical 
structure of toposes (both elementary and Grothendieck toposes): this is a very 
rich collection of logical operations. The formulae are built up using logical con- 
stants (true and false), connectives (conjunction, disjunction, implication and 
negation) and quantifiers (existential and universal), starting from some atomic 
relations (which will always include the distinguished relation of equality). The 
languages will be possibly infinitary, that is, allow infinite conjunctions or dis- 
Junctions; but they will be first-order rather than higher-order, meaning that 
quantification is over variable individuals rather than over subsets of individu- 
als, functions of individuals, or iterates of such power and function constructions. 
(Higher-order predicate logic will be considered in Chapter D4.) However, we will 
allow individuals to be grouped into different sorts. Thus the languages consid- 
ered in this chapter will contain expressions in infinitary, first-order, many-sorted 
predicate logic with equality. Each language may involve certain non-logical sym- 
bols denoting basic sorts and basic functions and relations of various kinds. This 
information constitutes the signature of the language. 
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Definition 1.1.1 A (first-order) signature X consists of the following data. 


(a) A set &-Sort of sorts. 


(b) A set =-Fun of function symbols, together with a map assigning to each 
f € ¥-Fun its type, which consists of a finite non-empty list of sorts (with 
the last sort in the list enjoying a distinguished status): we write 


f:Ai-:-An > B 


to indicate that f has type Aı,..., An, B. (The number n is called the arity 
of f; in the case n = 0, f is more usually called a constant of sort B.) 


(c) A set D-Rel of relation symbols, together with a map assigning to each 
R € -Rel its type, which consists of a finite list of sorts: we write 


Resa sk 


to indicate that R has type A;,...,A,. (The number n is called the arity 
of R; in the case n = 0, R is more usually called an atomic proposition.) 


In some contexts, it is convenient to assume that the set /-Sort is equipped 
with finite products (that is, that we are given a set of ‘ground sorts’, and 
further sorts can be built up from these as (formal) finite products — including 
the empty product); this has the advantage that we can take all our function 
symbols and relation symbols to be unary. For most purposes, it is merely a 
matter of taste which approach one adopts in the first-order context; but when 
we come to higher-order logic in Chapter D4, it will be natural to adopt the 
second approach. 

For each sort A of a signature © we assume given a supply of variables of 
sort A. (The variables of sort A are usually assumed to form a (fixed) countably 
infinite set V4, but this doesn’t really matter: the only important thing is that, 
in any context, we are free to introduce a new variable of sort A, i.e. one which 
hasn't already appeared in any of the terms or formulae under consideration.) 
Then we have 


Definition 1.1.2 The collection of terms over X is defined recursively by the 
clauses below; simultaneously, we define the sort of each term and write t: A to 
denote that t is a term of sort A. 


(a) x: A, if x is a variable of sort A; 


(b) f(ti,...,tn): B, if f: Ay--- An — B is a function symbol and tı : A1,- 
tn: Ån. 


If f is a constant (that is, a function symbol of arity 0), we commonly write f 
rather than f() for the term obtained by applying f (as in (b) above) to the 
empty string of terms. 
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Next, we turn to the formulae, that is the logical expressions denoting 
predicates; the following definition is complicated by the fact that we are simulta- 
neously defining several different fragments of logic (that is, classes of formulae) 
with which we wish to work, but each individual clause in the definition is 
straightforward enough. (We should perhaps mention the long-standing tradi- 
tion in logic whereby formulae are called ‘well-formed formulae’ or even ‘wffs’; 
but since there is no (useful) notion of an ‘ill-formed formula’, this terminology 
seems redundant.) 


Definition 1.1.3 Consider the following clauses for recursively defining a 
class F of formulae over X, together with, for each formula ¢, the (finite) set 
FV (¢) of free variables of ¢. 


(i) Relations: R(t1,...,tn) is in F, if R — A,---A, is a relation symbol 
and tı: Aj,...,t,: An are terms; the free variables of this formula are all 
the variables occurring in some t;. (Once again, if R has arity 0 we write 
simply R rather than R().) 

Equality: (s = t) is in F if s and t are terms of the same sort; FV(s = t) 
is the set of variables occurring in s or t (or both). (Sometimes it is 
convenient to distinguish between the equality predicates associated with 
different sorts; if we wish to do this, we shall write (s =4 t), where A is 
the sort of s and t.) 

(iii) Truth: T is in F; FV(T) = 0. 

(iv) Binary conjunction: (@ A Y) is in F, if @ and y are in F; FV(¢ A Y) = 
FV($) U FV (4). 

(v) Falsity: L isin F; FV(L) = @. 

(vi) Binary disjunction: (ġ V w) is in F, if ọ and w are in F; FV(@V y) = 
FV(¢) UFV(w). 

(vii) Implication: (6=> yp) is in F, if ọ and y are in F; FV(ġ> 4) = FV(¢) U 
FV (4). 

(viii) Negation: -¢ is in F, if @ is in F; FV(-¢) = FV(¢). 

(ix) Existential quantification: (Ar)¢ is in F, if ¢ is in F and z is a variable; 
FV((A4r)¢) = FV(¢) \ {x}. (Again, it is sometimes convenient to specify 
the sort of the variable being quantified; if we wish to do this, we write 
(3x: Ajg.) 

(x) Universal quantification: (Vr) (or (Vx: A)¢, if it is necessary to specify 
the sort of z) is in F, if ¢ is in F and z is a variable; FV((Vr)¢) = 
FV(¢) \ {x}. 

(xi) Infinitary disjunction: Vier ¢ isin F, if I is a set, ġ, isin F for eachi € I 
and U;<; FV(¢i) is finite; in which case the latter set is FV (Vez ¢i)- 

(xii) Infinitary conjunction: \,-,¢: is in F, if I is a set, ġ, is in F for each 
i € I and U,<; FV(¢,) is finite; in which case the latter set is FV(A <7 ¢i)- 


(ii 


x 
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Using these clauses, we make the following definitions. 


(a) The set of atomic formulae over © is the smallest set closed under clauses 
(i) and (ii). 
(b) The set of Horn formulae over © is the smallest set closed under clauses 
(i)-(iv). 
(c) The set of regular formulae over © is the smallest set closed under clauses 
(i)-(iv) and (ix). 
(d) The set of coherent formulae over © is the smallest set closed under clauses 
(i)-(vi) and (ix). 
(e) The set of first-order formulae over & is the smallest set closed under clauses 
(i)-(x). 
The final two definitions involve clauses (xi) and (xii), which are infinitary in 
nature and result in proper classes (rather than sets) of formulae. 


(f) The class of geometric formulae over ¥ is the smallest class closed under 
clauses (i)-(vi), (ix) and (xi). 

(g) The class of infinitary first-order formulae over © is the smallest class closed 
under clauses (i)—(xii). 


We recall that the names ‘regular’, ‘coherent’ and ‘geometric’ were all intro- 
duced in Chapter Al as names for particular classes of categories with structure. 
The re-use of these names here is not accidental: as we shall see in the next 
section, regular (resp. coherent, geometric) formulae are precisely those which 
can be interpreted in regular (resp. coherent, geometric) categories. (Similarly, 
first-order formulae are those which can be interpreted in Heyting categories; we 
should perhaps have called the latter ‘first-order categories’, but that name is 
potentially confusing since every category is a model of a first-order theory - see 
1.1.7(e) below.) There is one important class of formulae missing from the list 
above, namely cartesian formulae, which correspond in a similar way to carte- 
sian categories: this class is in fact intermediate between those of Horn formulae 
and regular formulae (we allow certain instances of existential quantification, but 
not all of them), and before we can explain which existential quantifications are 
permitted we have to develop the notion of provability appropriate to cartesian 
logic. So the definition of this class is postponed until 1.3.4 below. 

The variables x which occur in a formula ¢, but which are within the scope 
of some quantifier (3x) or (Yx), are called its bound variables. In practice we do 
not distinguish between two formulae if they are a-equivalent, that is, if they 
differ only in the names of bound variables. We note that the same variable may 
appear both bound and free in different parts of a compound formula; but for 
any ¢ there is an a-equivalent ¢’ in which this does not happen, and we normally 
suppose that the names of our bound variables have been chosen in such a way 
that these coincidences are avoided. A closed formula (also called a sentence) is 
one for which FV(¢) is empty. A closed term is one which contains no variables. 
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It is quite possible, for a particular language, that there are sorts which 
possess no closed terms. Consequently, the use of variables of these sorts carries 
an implicit existential hypothesis. In classical set-based semantics, this implicit 
hypothesis is normally ‘swept under the carpet’ by imposing the requirement 
that the interpretation of any sort must be a nonempty set; but since we wish 
to interpret our languages in categories more general than Set, where we do 
not always have such a sharp distinction between ‘emptiness’ and ‘possession of 
elements’, we cannot afford to make any such general assumption. Therefore, we 
shall always keep a tally of which of these implicit hypotheses are present in any 
particular situation, via the notion of a ‘context’. 


Definition 1.1.4 A contezt is a finite list 4 = 71,..., £n of distinct variables. 
The case n = 0 is allowed, being the empty context |]. If ¢ is a context and 
y is a variable different from those occuring in 7, then x,y will denote the 
context obtained by appending y to the list 7. Similarly 7, y denotes the result 
of concatenating contexts 7 and y when they are disjoint. The type of a context 
7 is the string of (not necessarily distinct!) sorts of the variables appearing in it. 


We shall hardly ever consider terms and formulae by themselves, but rather 
in some context. We say a context 7 is suitable for a formula ¢ if all the free 
variables of ¢ occur in £; a formula-in-context is an expression of the form 2.4, 
where ¢ is a formula and 7 is a suitable context for it. The canonical context for 
@ is the context consisting precisely of the distinct free variables of ¢, listed in 
the order of their first appearance. Similarly, a term-in-contert %.t is a term t 
together with a context 7 containing all the variables mentioned in t. 

We shall frequently make use of the operation of substituting terms for vari- 


ables in a formula. If ï= s1,...,Sn is a list of (not necessarily distinct) terms, 
of the same length and type as the context x, then 
o|5/z] 


will denote the formula (well-defined up to a-equivalence) resulting from simul- 
taneously substituting s; for each free occurrence of zx; in @¢, for all i < n, after 
first changing the names of bound variables in @ if necessary to avoid capture of 
variables in 3° by any quantifiers in ¢. (In practice, when making substitutions 
one normally takes care to ensure that the variables occurring in the terms to be 
substituted are disjoint from those which occur bound in the formula.) Similarly, 
t|$/Z] will denote the term resulting from simultaneously substituting each s; for 
Ti in t£. 

The reader should be warned that the operation of simultaneous substitution, 
as defined above, is not the same as sequential substitution. In the formula 
ġ[sı/zı][s2/£2], the term sz has been substituted for all occurrences of x2 in sı 
(that is, in those copies of sı that have been substituted into ¢), as well as the 
free occurrences of x2 in ġ; but in ¢[s1, s2/2), £2] it has not. 

Next, we introduce the formal expressions which will serve as axioms for the 
logical theories we wish to consider. 
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Definition 1.1.5 By a sequent over a signature © we mean a formal expression 
of the form (¢@ Fz Y), where ¢ and wy are formulae over © and 7 is a context 
suitable for both of them. The intended interpretation of this expression is that 
wv is a logical consequence of ¢ in the context 7, i.e. that any assignment of 
individual values to the variables in ¥ which makes ¢ true will also make yw true. 
We say a sequent (¢F z Y) is regular (coherent, ...) if both @ and y are regular 
(coherent, ...) formulae. 


In full first-order logic, the general notion of sequent is not really needed, since 
the sequent (¢ Fz Y) expresses the same idea as (T Fy) (V1)... (V2n)(=> y)), 
which we may simply identify with the formula on the right of the turnstile. 
However, the use of implication and universal quantification in the latter formula 
takes us out of any of the smaller fragments of logic that we wish to consider; 
so in these fragments it will be useful to have the notion of sequent available. In 
the opposite direction, it is sometimes helpful to consider sequents of the more 
general form ( Fz Y), having a finite string ¢ = (¢,...,@n) of hypotheses and 
a single conclusion; but since all the fragments of logic we wish to consider in 
this chapter will include finite conjunction, we shall have no real need of this 
extra generality. 


Definition 1.1.6 By a theory over a signature ©, we mean a set T of sequents 
over ©, whose elements are called the (non-logical) axioms of T. We say T is a 
regular (coherent, ...) theory if all the sequents in T are regular (coherent, ...). 


In fact the above definition is a provisional one: we shall wish to identify 
two theories if they have the same logical consequences, and so what we have 
just defined is really the notion of a presentation or ariomatization of a theory. 
However, it is clearly impossible to give a more precise definition of what we 
mean by a theory until we have developed the notion of logical deduction in 
Section D1.3. The reason for introducing Definition 1.1.6 at this stage is to 
enable us to give the list of examples of theories with which we now conclude 
this section, so as to give the reader some idea of the expressive power of the 
various fragments of first-order logic that we have introduced. 


Examples 1.1.7 (a) By an algebraic theory, we mean one whose signature 
X has a single sort and no relation symbols (apart from equality), and whose 
axioms are all of the form (T Fz ¢) where ¢ is an atomic formula (s = t) and 7 is 
its canonical context. Such theories can be used to express the notion of a set (or, 
more generally, an object of a category) A equipped with ‘algebraic structure’ 
in the form of finitary operations A” — A, satisfying ‘universal’ equations: 
examples include monoids, groups, abelian groups, rings, modules (over a given 
ring), lattices, Heyting algebras (cf. A1.5.11), and so on. 

(b) Slightly more generally, we may consider many-sorted algebraic theories, 
which are subject to the same restrictions as in (a) except that their signatures 
may have more than one sort. A well-known example is the theory of ring-module 
pairs: this has two sorts R and M, whose individuals are to be interpreted as 
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elements of the ring and of the module respectively, and (for example) scalar 
multiplication becomes a function symbol of type R, M — M. Another example 
is the theory of directed graphs: this has two sorts V and A (for vertices and 
arrows), two function symbols of type A — V (for source and target), and no 
axioms. (More generally, for any small category C the theory of diagrams of 
shape C can be expressed as a many-sorted algebraic theory.) 

(c) A familiar example of a theory whose signature involves relation symbols 
rather than function symbols is the theory of partially ordered sets. The signature 
for this theory has one sort A, one relation symbol < — A, A (which we write, 
as usual, in infix rather than prefix notation: (s < t) denotes the atomic formula 
< (s,t)), and no function symbols. The axioms are 


(T Fz (x <2), 
(2 < y) A (y < 2)) Fz, (£ = y)), and 
(((£ < y) A (y < z)) Fz,y,z (£ < 2))- 


It is clear that this theory is a Horn theory. More generally, we can axiomatize 
ordered algebraic structures such as ordered groups and ordered rings, by tak- 
ing a signature which includes both the relation symbol < and the appropriate 
function symbols, and writing down the appropriate Horn sequents as axioms. 
On the other hand, the theory of totally ordered sets, which is obtained from 
that of partially ordered sets by adding the sequent 


(T Fay ((£ < y) V (y < z))), 


is (coherent but) not a Horn theory. 

(d) Again within the realm of Horn theories, we can axiomatize the theories 
of various classes of algebraic structures such as torsion-free abelian groups. For 
this particular example, we take the algebraic theory of abelian groups (written 
additively), and add the Horn sequents 


((nz = 0) Fz (x = 0)) 
for every integer n > 1, where nz of course denotes the term 
((-(£+r)+:)+zr) 


in which the variable x appears n times. 

(e) The theory of categories, unlike that of posets, is not a Horn theory 
(of course, we are not in a position to prove at present that it cannot be axiom- 
atized by Horn sequents, but we shall see how to do so eventually - cf. 2.4.7 
below), but it is a regular theory (and we shall see in Section D1.3 that it is 
actually a cartesian theory). It can be expressed over a signature with a sin- 
gle sort, but it is more convenient to take one with two sorts O and M (for 
objects and morphisms), three function symbols id: O — M, dom: M — O and 
cod: M — O, and one relation symbol T — M,M,M (the intended meaning 
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of T(x, y,z) being that z is the composite of x and y). The axioms involving T 
include 


(T(z, y,Z) Fa,y,z ((dom(x) = cod(y)) A (cod(x) = cod(z)) A (dom(y) = dom(z))), 
((T(z,y,z) AT (z,y,w)) Fz y,z,w (2 = w)), and 
((dom(x) = cod(y)) Fz,y (Az)T(z, y, z)); 


we leave it to the reader to work out what the other axioms should say. 
(f) Another example of a regular theory is the theory of divisible abelian 
groups: this is obtained from the theory of abelian groups by adding the axioms 


(T Fe (Ay)(ny = z)) 


for all n > 1. Unlike the theory of categories, this is not a cartesian theory (but 
if we add the torsion-freeness axioms from example (d), we do get a cartesian 
theory); again, the proof of this fact will come later. 

(g) A local ring is commonly defined by ring-theorists as a nontrivial com- 
mutative ring (with 1) having a unique maximal ideal; this description appears 
to lie outside the realm of first-order logic since it contains an implicit quantifi- 
cation over ideals, but it is well known to be equivalent to the condition that the 
non-invertible elements form an additive subgroup (in fact an ideal), which is 
clearly first-order. If we ‘turn round’ this condition to get rid of the negations, it 
becomes a coherent theory: to the algebraic theory of commutative rings with 1, 
we add the axioms ((0 = 1) Fy; L) (to ensure nontriviality), and 


((dz)((@ + y)z = 1) Fay ((32)(xz = 1) v (3z)(yz = 1))). 


(Here we have used the customary ring-theoretic notation for terms built up 
from variables using the function symbols for addition and multiplication.) 

(h) Similarly, though a field is customarily described as a nontrivial ring in 
which every nonzero element is invertible (which would suggest the non-coherent 
sequent (-(x = 0) Fz (Ay)(ry = 1))), the theory of fields can be presented 
coherently: to the axioms for commutative rings with 1, plus the nontriviality 
axiom mentioned in (g), we add the coherent sequent 


(T Fa ((x = 0) v (3y)(xy = 1))) - 


The reader should be warned that this sequent, though classically equivalent to 
the non-coherent one mentioned earlier, is constructively stronger; and since we 
wish to interpret our logic in categories whose subobject lattices may not be 
Boolean, we shall have to employ constructive rather than classical logic. Thus 
we have two constructively inequivalent theories of fields, whose models in Set 
(or, more generally, in any Boolean coherent category) happen to coincide, but 
which ‘look different’ in a general Heyting category. (In fact, these two theories 
by no means exhaust the possible ways of defining a field; see [505].) However, 
it is a consequence of the completeness theorem for geometric theories, which 
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we shall prove in 3.1.16 below, that if two geometric theories (over the same 
signature) are classically equivalent (i.e. have the same models in all Boolean 
toposes), then they have the same models in all toposes. 

(i) It is remarkable how few of the first-order theories encountered in the prac- 
tice of mathematics fail to be (at least classically equivalent to) coherent theories. 
(There is a reason for this — see 1.5.13 below.) One that does so is the theory 
of posets in which each element lies below some maximal element: to the theory 
described in (c) above, we add the axiom 


(T Fe (3y)((x < y) A (Vz)((y < 2) > (y = 2)))) 


which is clearly first-order but not coherent. (Once again, it is far from clear 
at present that this theory cannot be axiomatized by coherent sequents, but we 
shall eventually be able to prove that this is so — cf. 2.4.10.) 

(j) As an example of a theory which is geometric but not (finitary) first- 
order, we mention the theory of torsion abelian groups: this is obtained from the 
algebraic theory of abelian groups by adding the sequent 


(i = Vine = 0)) 


In a similar vein, the theory of fields of a given finite characteristic p is first-order 
and even coherent (just add the sequent (T Hp (p1 = 0)) to the theory in (h)), 
but the theory of fields of finite characteristic is (geometric but) not first-order. 

(k) Another instructive example of a geometric theory is the theory of finite 
sets. To axiomatize this, we take one sort A and one n-ary relation symbol Rn 
for each n > 0. Then, for each n, we take the axiom 


TAC eeta) a ee og Vy = z:)) 
i=1 


which says that if an n-tuple of individuals satisfies the relation Rn then it 
exhausts the members of (the set interpreting) the sort. (The case n = 0 of this 
axiom is (Rọ Fy L), which says that if Rọ holds then the interpretation of the 
sort must be empty.) Then we add the axiom 


(T Fy \V (Ber) +++ (Ben) Rn(21,---52n)} 


Finally, to ensure that the interpretations of the Rn are uniquely determined by 
that of the sort (i.e. that Rn holds for all n-tuples which exhaust the elements 
of the sort, and not just for some of them), we must add the axioms 


(Rn(£1,---, En) Fai, oR Opa ethan) 
whenever f: {1,2,...,m}— {1,2,...,n} is a surjection, and 
((Rn(z1, EIR Tn) A (x, = z;)) Ke, ee N Rn-1(2ı, B -, 23-1, 0741, ee 9) 


whenever 1 <i<j <n. 
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(I) An example of an infinitary first-order theory which is not geometric is 
given by the theory of metric spaces. This may be axiomatized over a signature 
with one sort A and a family of binary relation symbols Re — A, A indexed 
by positive real numbers <€; the intended interpretation of R.(x,y) is that the 
distance between x and y is (strictly) less than e. Among the axioms, we have 
things like 


(R.(z,y) Fry Rs(z,y)) whenever e < 6, 
(Rdz, Yy) Fry Vise: Ro(z,y)) for all €, and 
(Re (x,y) A Ro(y, 2)) Fey, Re+o(2, 2) 


(the last of these expresses the triangle inequality), all of which are geometric; 
but the key axiom 


(A Rela.) Few (© =9)), 


e>0 


which distinguishes metric spaces from pseudometric spaces, requires an infini- 
tary conjunction. 

(m) We conclude by briefly mentioning the class of propositional theories. 
A propositional signature is one which has no sorts; it follows that it cannot 
have any function symbols either, and the only relation symbols are atomic 
propositions (i.e. have arity 0). Of course, in a propositional signature we have 
no variables, and so we can forget about contexts — the only context is the 
empty one. Since there can be no quantifiers either, the distinction between Horn 
formulae/sequents/theories and regular formulae/sequents/theories disappears, 
and we call them cartesian formulae/sequents/theories. Propositional theories 
are useful for describing subsets of a given structure with particular properties. 
For example, given a meet-semilattice L, we have a propositional theory of filters 
in L, presented in a signature with one atomic proposition F, (to be thought of 
as the assertion that a is in the filter) for each a € L; the axioms are (T F F) 
(where 1 is the top element of L), and all sequents of the form (Fa + Fp) with 
a< bin L, or 


((Fa A Fo) F Fano) 


for arbitrary a,b € L. This is clearly a cartesian theory; if L is a lattice, we may 
also obtain a coherent propositional theory of prime filters of L, by adding the 
axiom (Fo F L) and all instances of 


(Favo F (Fa V F)). 


And if L is a complete lattice, we may axiomatize completely prime filters of L 
(which play an important rôle in the theory of locales — see C1.2.2) as a geometric 
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propositional theory, by adding the infinitary analogue of the last axiom-scheme: 


(Fa V Fa, ) 


iEl 


whenever a is the join of the family (a; | i € T) in L. (These three examples are 
‘typical’ in a very strong sense, as we shall see in 1.4.14 below.) 


Suggestions for further reading: Kock & Reyes [640], Makkai & Reyes [790], 
Reyes [1004]. 


D1.2 Categorical semantics 


This section describes the interpretation in categories of the expressions of a 
first-order language. This material lies at the heart of the relationship between 
category theory and predicate logic, and is a direct generalization of the tra- 
ditional definition (due to A. Tarski) of satisfaction of first-order formulae in 
ordinary set-valued structures. 


Definition 1.2.1 (a) Let C be a category with finite products and © a 
signature. A X-structure M in C is specified by the following data: 


(i) A function assigning to each sort A in ©-Sort, an object MA of C. This 
function is extended to finite strings of sorts by defining M(Aj),..., An) = 
MA, x--- x MAn (and setting M ([]), where [] denotes the empty string, 
equal to the terminal object 1 of C). 


(ii) A function assigning to each function symbol f: A,---A, — B in ©-Fun 
a morphism Mf: M(A),...,An) ~ MB inC. 

(iii) A function assigning to each relation symbol R — A,--- A, in -Rel a 
subobject MR — M(Aj,...,An) in C. 


(b) The %-structures in C are the objects of a category U-Str(C) whose mor- 
phisms are the /-structure homomorphisms. Such a homomorphism h: M — N 
is specified by a collection of morphisms ha: MA— NA in C indexed by the 
sorts of © and satisfying the following two conditions: 


(iv) For each function symbol f: A;--- A, — B in D-Fun, the diagram 
Mf 
MAK 2A) ==> MB 


ha, X+- x ha, |he 


Nf 
N(Ai ss, A) NB 


commutes. 


818 FIRST-ORDER CATEGORICAL LOGIC D1.2 


(v) For each relation symbol R — A,--- A, in U-Rel, there is a commutative 
diagram in C of the form 


MR >—> MAGA) 


NR >——> N(Aj,..., An) 


Identities and composition in X-Str(C) are defined componentwise from those 


in C. 


We note in passing that if D is another category with finite products, then any 
functor T: C — D which preserves finite products and monomorphisms induces 
a functor ©-Str(T): ©-Str(C) — X-Str(D) in the obvious way; and any nat- 
ural transformation a: T) — Tz between two such functors induces a natural 
transformation Y-Str(T,) — S-Str(T2). Thus the construction Y-Str(—) is 
2-functorial. 


Remark 1.2.2 There is an alternative ‘looser’ way of interpreting first-order 
languages in a category, which is sometimes known as the Brouwer—Heyting- 
Kolmogorov interpretation (or BHK interpretation, for short). For the BHK 
interpretation, we interpret sorts and function symbols exactly as in 1.2.1, but 
a relation symbol R — A,---A, is interpreted as an equivalence class of (not 
necessarily monic) morphisms MR — M(Aj,...,An), where two morphisms 
f: X + M(Aj,...,An) and g: Y — M(Aj,...,An) are considered equivalent if 
they become isomorphic in the preorder reflection of C/M(A,...,An), ie. if 
there exist morphisms f — g and g —> f in C/M(Aj,..., An). It is possible to 
develop all the ideas of this section for the BHK interpretation, in parallel with 
the development which we shall give for the ‘Tarski interpretation’, but we don’t 
need to do this, since it is easy to see using A1.3.10(d) that a ‘BHK structure’ for 
a signature © in a category C is really the same thing as a Tarski structure for & 
in the regularization Reg(C) of C, in which the sorts of © happen to have been 
interpreted by objects in the image of the canonical embedding I: C — Reg(C). 
Therefore, we shall not pursue this interpretation further. 


In a &-structure in a category with finite products, the function symbols are 
interpreted as morphisms of an appropriate kind in the category. This inter- 
pretation extends to the terms over E (or rather, to the terms-in-context) in a 
natural way. 


Definition 1.2.3 Let M be a ¥-structure in a category C with finite products. 
If %.t is a term-in-context over È (with = 2,...,¢p,, zi: Aj (i= 1,.--,7) and 
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t: B, say), then a morphism 
[Z.t]a: M(Aj,...,An) > MB 


in C is defined recursively by the following clauses: 


(a) If ¢ is a variable, it is necessarily x; for some unique i < n, and then 
[Z.t]m = Ti, the ith product projection. 


(b) Ift is f(ti,...,tm) (where t;: Ci, say), then [Z.t]m is the composite 


M(Aj,..., An) AG ea Eel MC... Ca) = MB. 


When it is clear which particular Ł-structure is being referred to, we shall 
omit the subscript M from [Z.t]. 


Implicit in the above definition is the use of composition in the category to 
interpret the operation of substituting a term for a variable in a term. That is, 
we have the following Substitution Property: 


Lemma 1.2.4 Let y be a suitable context for a term t: C (where yi: Bi, say), 
and let & be a string of terms of the same length and type as 7. Further, let % be 
a suitable context for each s;. Then [Z.t|5/y]] is the composite 


([Z.sı],..., Z. Sm] y.t] 
M(Aj,...,An) Wabes sel M(Bı,..., Bm) va MC. 
Proof Straightforward induction over the structure of t. O 


We remark that 1.2.4 covers the case where we wish to make substitutions for 
only some of the variables in y, since we can take the remaining s; to be y;. In 
particular, we may consider the case where Z contains y and each s; is y;; then 
(IZ. s1ı],...,[Z.Sm]) is just a product projection morphism, 7, and we have the 
so-called Weakening Property: |Z .t] = [y.t]oa whenever y C Z. We see from the 
Weakening Property that knowledge of [7.t], where y is the canonical context 
for t, determines [Z.t] for any other context 7 suitable for t. Occasionally, we 
shall simplify our notation further by writing [¢] for the interpretation of t in its 
canonical context. 

Another important consequence of Definition 1.2.3 is that the ‘naturality’ 
with respect to the interpretations of function symbols, which was built into the 
definition of a homomorphism of Y-structures, extends to the interpretations of 
terms-in-context: 


Lemma 1.2.5 Let h: M — N be a homomorphism of X-structures in a cate- 
gory C with finite products, and let £.t be a term-in-context over © (where t: B, 
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E=21,...,Ln and x;: Aj, say). Then the diagram 
[z.t]u 
M(A\,...,An) ———> MB 
ha, X+: x ha, hg 
[Z.t] 
N(A1,...; An) ——> NB 
commutes. 
Proof Again, this is an easy induction over the structure of t. O 


We turn next to the interpretation of formulae in a /-structure. Once again, 
we interpret formulae-in-context rather than abstract formulae; also, the range of 
logical operators we are able to interpret will depend on the amount of categorical 
structure possessed by the category C in which the structure lives. 


Definition 1.2.6 Let M be a &-structure in a category C with (at least) finite 


limits. A formula-in-context £.¢ over © (where = z1,..., £n and x;: Aj, say) 
will be interpreted as a subobject 


[Z.¢]m >> M(Aj,...,An) 


according to the following recursive clauses (where, as before, we drop the 
subscript M if the structure under consideration is obvious): 


(i) If ọ is R(ti,...,tm) where R is a relation symbol (of type B,,..., Bm, 
say), then [z.@] is the pullback 


(2.6 MR 


M(Aı,..., An) e irat a, M(Bi,..., Bm) 


(ii) If @ is (s = t), where s and ¢ are terms of sort B, then [Z.@] is the 
equalizer of 


[z.s] 
MAiA) aeaee M 
[z.t] 
(Equivalently, [z.¢] is the pullback of the diagonal MB — MB x MB 


along ([z.s], [Z.¢]). Thus we may think of equality between terms of 
sort B as a special relation symbol of type B, B, which differs from 
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other relation symbols in that its interpretation in a /-structure is not 
an arbitrary subobject, but is required to be the diagonal.) 


(iii) If @ is T, then [Z.¢] is the top element of Sub(M(Aj,..., An)). 
(iv) If ¢ is (Yy A x), then [Z.¢] is the intersection (= pullback) 


[z.¢] >——> [z. 4] 


[Z.x] >—> M(A),..., An) 


(v) If pis L, and C is a coherent category, then [Z.¢] is the bottom element 
of Sub(M(Aj,..., An)). 
(vi) Similarly, if ¢ is (Y V x), and C is a coherent category, then [z.¢] is the 
union of the subobjects [z.w] and [z. x]. 
(vii) If @ is (Y >x), and C is a Heyting category, then [Z.¢] is the implication 
[z.~] => [Z.x] in the Heyting algebra Sub(M(Aj,...,A,)) (cf. A1.4.13). 
(viii) Similarly, if @ is ~~, and C is a Heyting category, then [Z.¢] is the 
Heyting negation —~[Z.~]. 
(ix) If œ is (Sy) where y is of sort B, and C is a regular category, then |Z.¢] 
is the image of the composite 


[Z y.y] >> M(Aj,..., An, B) —-> M(Aj,..., An), 


where ~ is the projection on the first n factors. 
(x) If ¢ is (Vy) where y is of sort B, and C is a Heyting category, then [z. 4] 
is Vx([Z,y.w]), where a is the same projection as in (ix). 
(xi) If @ is Vier Vi, and C is a geometric category, then [z.¢] is the union of 
the [Z%.w;] in Sub(M(Aj,...,An)). 
(xii) If ¢ is Nier Vi, and C has arbitrary intersections of subobjects, then [z. 6] 
is the intersection of the [Z.y;]. 


Examining the clauses of the above definition, we see that if M is a 
-structure in a cartesian category C, we can assign interpretations in M to 
all Horn formulae-in-context over © (though, as we shall see in the next section, 
this is not best possible); and if C is a regular (resp. coherent, Heyting, geometric) 
Category then we can assign interpretations to all regular (resp. coherent, first- 
order, geometric) formulae-in-context over ©. (In fact, in a geometric category, 
we can assign interpretations to all infinitary first-order formulae, since we know 
from A1.4.18 that a geometric category is a Heyting category and that its sub- 
object lattices have arbitrary intersections. However, geometric functors (that 
is, regular functors preserving arbitrary unions of subobjects — note that the 
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inverse image functors of geometric morphisms are examples of such functors) 
do not in general preserve this extra structure, and so they will not preserve 
the interpretations of non-geometric formulae.) To avoid tedious repetition in 
what follows, we shall use the phrase ‘¢ is a formula interpretable in C’ to mean 
that C belongs to one of the classes of categories mentioned above, and ¢ is in 
the corresponding class of formulae. (We shall similarly speak of sequents and 
theories being interpretable in C.) 

When we introduced the notions of regular, coherent and Heyting cate- 
gories in Chapter Al, we were at pains to ensure that the appropriate structure 
(images, unions, universal quantification) should not only exist in the categories 
in question, but should be stable under pullback. (In the first two cases this 
was specifically included as part of the definition, but for universal quantifi- 
cation it was a consequence of the other assumptions made by that stage - 
see Al.4.11.) Thus the Substitution Property has an immediate extension from 
terms to formulae, which is proved by a similarly straightforward induction: 


Lemma 1.2.7 Let 7.6 be a formula-in-context over © interpretable in C; let 
S be a string of terms of the same length and type as y, and let # be a contezt 
suitable for all the terms in 5. Then, for any X-structure M in C, there is a 
pullback square 


[z. o[5/a]]m ————_———_————_> [. 4] m 


((Z. siJu,---,]Z-sm]ar) 
M(Aj,..., An) —— M(B, ..., Bm) 
where the A; and B, are the sorts of the variables x; and y,. 0 


In particular, it follows that knowledge of the interpretation of @ in its 
canonical context determines its interpretation in any other suitable context, 
by pullback along an appropriate product projection. As with terms, we some- 
times abbreviate [z.¢] to [d] if Z is the canonical context for ¢. Another easy 
consequence of 1.2.7 gives a sufficient condition for discarding quantifiers over 
‘non-existent’ variables: 


Corollary 1.2.8 
(i) Let C be (at least) a regular category; let £.¢ be a formula-in-context over 
E interpretable in C, and let y: B be a variable not appearing in £ (and 
hence not free in o). Then, for any S-structure M in C such that the 
interpretation MB of B is well-supported (i.e. such that MB — 1 is a 
cover), the interpretations [Z.¢]u and [Z.(Sy)d].u are equal. 


(ii) IfC is a Heyting category and the hypotheses of (i) are otherwise unaltered, 
then the same conclusion holds for [Z.ġ]m and [Z.(Vy)¢] m- 
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Proof (i) If MB — 1 is a cover, then so are the bottom and top edges of the 
pullback square 


M(A,-.., An, B) — M (Aly Ad) 


(where z is projection on the first n factors), and so the top and right edges form 
the image factorization of the diagonal. 

(ii) In (i), we have shown that the composite 4, o z* is the identity on 
Sub(M(Aj1,...,An,)). Hence so is its right adjoint Yp o 7*. o 


We note that if there exists a closed term of sort B, then the interpretation 
of B in any &-structure will be well-supported, since the interpretation (in the 
empty context) of any such term will be a splitting for M B — 1. 

When we turn to the analogue for formulae of 1.2.5, life becomes more com- 
plicated: not all formulae have interpretations which are ‘natural’ with respect 
to arbitrary homomorphisms of }-structures. The best we can do is 


Lemma 1.2.9 Let C be (at least) a cartesian category, let h:ıM >N be a 
homomorphism of S-structures in C, and let .¢ be a formula-in-contezt which 
is both geometric and interpretable in C. Then there is a commutative square 


[z.d]a >—> M(Aj,..., An) 


[z.¢]n >——> N(Aj,.--.,; An) 


Proof This is once again an induction over the structure of ¢, using the fact 
that the diagram commutes iff [%.¢] is contained in the pullback of [Z.¢] 
along the right vertical map. B 


One reason why 1.2.9 does not extend to more general first-order formulae 
is that implication is not order-preserving in its first variable: if X,; < Y) and 
X2 < Yə in some subobject lattice in a Heyting category, we do not necessarily 
have (X,; > X2) < (Yı > Y2). (Similar problems arise with negation.) These 
problems can be avoided by requiring that the ‘naturality square’ of 1.2.9 should 
not only commute but also be a pullback. But we also have problems with 
universal quantification: for example, the centre of a group (the interpretation of 
the formula-in-context x.(Vy)(zy = yx)) is not ‘natural’ with respect to group 
homomorphisms. When dealing with first-order theories, we therefore have to 
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build the naturality result for arbitrary formulae into our notion of morphism of 
structures: 


Definition 1.2.10 (a) We call a homomorphism h: M — N of &-structures in 
a Heyting category an elementary morphism if, for each first-order formula-in- 
context Z.q@ over ©, there is a commutative square 


[Z.¢] >—> M(Aj,...,An) 


[Z.¢]y >—> N(Aj,..-, An) 


(Note that, thanks to 1.2.7, it is sufficient to demand this condition only for 
formulae in their canonical contexts.) 

(b) We call h an elementary embedding if the square above not only commutes 
but is a pullback for all first-order formulae-in-context. 

(c) We call h an embedding (or we say that M is a substructure of N) if the 
square above is a pullback for all atomic formulae (including equality). (Note 
that this requirement forces each h4 to be monic, since the naturality square 


MA>Ss MAx MA 


ha ha x ha 


NASER ANA 


for the equality relation of sort A is a pullback. Also, a straightforward inductive 
argument ensures that the naturality square is then a pullback for all quantifier- 
free formulae-in-context.) 


For structures in a Boolean coherent category, the notions of ‘elementary 
morphism’ and ‘elementary embedding’ coincide, since the commutativity of the 
squares of 1.2.10(a) for two complementary formulae ¢ and ~ọ forces both of 
them to be pullbacks (cf. the proof of A1.6.5). However, in the non-Boolean 
case an elementary morphism does not even have to be monic, as the following 
example shows: 


Example 1.2.11 Let C be the functor category {2, Set] (where 2 is the two- 
element ordered set {0,1}), and let © be the signature with one sort and no 
primitive symbols except equality (so that D-structures are simply objects). Let 
M be any object of C such that M(1) is a singleton, and let N be the terminal 
object of C. We claim that the unique morphism h: M — N is elementary. To 
see this, note first that, by an easy inductive argument, the interpretation in N 
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of any first-order formula-in-context 7,...,2,.@ is either the whole of N” = 1 
or its zero subobject. Thus, to verify that we have a commutative square as in 
1.2.10(a) for all Z.¢, it suffices to verify that [Z.ġ]m = 0 whenever [Z. 4] = 0. 
But the functor ‘evaluate at 1’ from C to Set is a Heyting functor (because it is 
the inverse image of an open inclusion, cf. A4.5.1), and it maps both M and N 
to singleton sets, so [z.¢] ~ = 0 forces [Z%.¢] (1) = [Z.¢]n(1) = 0. And this in 
turn forces [Z.¢]4(0) to be empty as well. 


We now come at last to the formal definition of the validity of an axiom in a 
structure. 


Definition 1.2.12 Let M be a ¥-structure in a category C. 


(a) Ifo = (ġ Fz Y) is a sequent over È interpretable in C, we say ø is satisfied 
in M (and write M € øo) if [Z.¢]m < [Z.¥] mu in Sub(M(A),...,An))- 

(b) If T is a theory over X interpretable in C, we say M is a model of T (and 
write M — T) if all the axioms of T are satisfied in M. 


(c) We write T-Mod(C) for the full subcategory of &-Str(C) whose objects are 
models of T, and T-Mod(C). for the non-full subcategory with the same 
objects but only elementary morphisms. 


Lemma 1.2.13 Let T:C — D be a cartesian (resp. regular, coherent, Heyting, 
geometric) functor between categories of the appropriate kind; let M be a 
-structure in C, and let o be a sequent over È interpretable in C. If M Eo 
in C, then X-Str(T)(M) | o in D. The converse implication holds if T is 
conservative. 


Proof An easy induction shows that T preserves the interpretations of all 
formulae-in-context interpretable in the appropriate class of categories; so the 
first assertion is immediate. For the second, note that a sequent (¢ Fz w) is 
satisfied in M iff the inclusion [¢.6A ~]m — [Z.¢] is an isomorphism. O 


In particular, if T is a regular (coherent, ...) theory over ©, then for any 
regular (coherent, ...) functor T: C — D the functor U-Str(T) which we defined 
after 1.2.1 restricts to a functor T-Mod(T): T-Mod(C) — T-Mod(D). (In prac- 
tice, we usually denote this functor simply by T.) Note also that any such T 
preserves pullbacks, and so further restricts to a functor between the non-full 
subcategories of T-models and (elementary) embeddings. 

The importance of the second assertion of 1.2.13 is that, in suitable cases, it 
allows us to reduce the problem of determining which /-structures are T-models 
in an unfamiliar category C to the corresponding problem in a more familiar 
category D, by constructing a suitable structure-preserving embedding of C in D. 
We give two examples of this process: 


Corollary 1.2.14 Let T be a geometric theory over a signature ©. Then 


(i) For any small category C, a -structure M in |C, Set] is a T-model iff each 
ev.(M), c € ob C, is a T-model in Set, where ev. denotes the functor 
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‘evaluate at c’. In fact we have an isomorphism T-Mod(|C,Set]) = 
[C, T-Mod(Set)]. 

(ii) For any topological space X, a S-structure M in Sh(X) is a T-model iff 
each z*(M), x E€ X, is a T-model in Set, where x*: Sh(X) — Set is 
the stalk functor associated with x (i.e. the inverse image functor along x 
regarded as a continuous map 1 > X, cf. C1.3.2). 


Proof In each case, we have merely to observe that the functors ev, (resp. 2*) 
are geometric (because they are inverse image functors), and that they are jointly 
conservative. For the second assertion of (i), we also need to observe the (obvious) 
fact that a homomorphism of ¥-structures in [C, Set] is essentially the same thing 
as a natural transformation between functors C —> L-Str(Set). o 


Note that 1.2.14(ii) is valid only for spaces X, and not for more general locales 
(cf. C1.2.3): indeed the assertion, for a locale X, that the functors x* are jointly 
conservative (as x ranges over all points of X) is equivalent to the assertion that 
X is spatial, as we saw in C1.4.9. 


Examples 1.2.15 We conclude this section by revisiting some of the exam- 
ples of theories considered in 1.1.7, and describing their models in appropriate 
categories. 

(a) If T is an algebraic theory, then T-models in a cartesian category C are 
just what we usually mean by objects equipped with algebraic structure (group 
objects, R-module objects, ...) in C. The key point in proving this is that a 
sequent (T Fz (s = t)) is satisfied in a structure M iff the equalizer of [Z.s]m 
and [Z.t]m is the whole of their domain, iff the interpretations of ¥.s and Z.t in 
M are equal. Similar remarks apply to the many-sorted algebraic case. (When 
dealing with algebraic theories, we tend to speak of algebras rather than models 
of a theory.) 

(c) Again, a model of the theory of posets, as described in 1.1.7(c), in a 
cartesian category C is just an internal poset in C, as usually defined. For exam- 
ple, if we write (a,b): Mı — M for the interpretation in a /-structure M of the 
relation symbol < (and we have suppressed the name of the unique sort of X, 
to simplify the notation), then the third (transitivity) axiom of 1.1.7(c) becomes 
the assertion that 

™2(Mi) N m93(M1) < m713(M1) 


in Sub(M x M x M), where 7;;: M x M x M — M x M is projection on the 
ith and jth factors. But this is equivalent to saying that, if we form the pullback 


q 
Mz ———> M, 


Mı ———> M 
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the morphism (ap, bq): Mz —> M x M factors through (a,b): M, —~ Mx M - 
which is how we defined transitivity for relations in a cartesian category in 
A1.3.6(d). 

(e) If C is a regular category, then a model of the theory of categories in C 
is, yet again, equivalent to an internal category in C as defined in B2.3.1. But 
the latter definition is interpretable in any cartesian category, whether or not 
it is regular. The reason is as follows: if (adopting the notation of B2.3.1) we 
write Co and Ci for the objects interpreting the sorts O and M respectively, 
and C2 > Ci x Ci x C for the interpretation of the ternary relation symbol T, 
then the existential quantifier in the third displayed sequent of 1.1.7(e) tells us 
to form the image of the composite 


T 
EE CEEE E A e E E A 


but the validity of the second displayed sequent tells us precisely that this com- 
posite is monic, and so it has an image factorization (which is, moreover, stable 
under pullback) even if arbitrary morphisms of C do not. The idea underlying 
this, that we are allowed to form images of morphisms which are already known 
to be monic, will be the key ingredient in the definition of a cartesian theory in 
the next section. 

(f) If A is an abelian group object in a regular category C, the divis- 
ibility axioms of 1.1.7(f) are satisfied in A precisely if the nth power map 
[z.nz]: A — A is a cover, for all n > 1. Once again, we note that the torsion- 
freeness axioms of 1.1.7(d) (plus a little abelian group theory) assert that these 
maps are monic; so the conjunction of the two says that they are isomorphisms, 
which makes sense in any cartesian category. 

(g) We shall not attempt to describe what it means to be a model of the 
theory of local rings in an arbitrary coherent category; but we note that for the 
category Sh(X) of sheaves on a topological space X, it reduces by 1.2.14(ii) to 
the notion of a sheaf of rings A on X (that is, a ring object in Sh(X)) whose 
stalks are all local rings — and this is what algebraic geometers usually mean 
when they talk about a ‘sheaf of local rings’. Note that this condition does not 
imply that the ring A(U) of sections of such a ring object over an open set U 
is local: the functor ‘evaluate at U’ is (cartesian but) not coherent. Even in the 
simple case when X is a discrete two-point space {x,y}, and U is the whole of 
X, we can observe this failure: A(X) is then the cartesian product of the stalks 
z*(A) and y*(A), and a cartesian product of two local rings is not local. Similar 
remarks apply to the (coherent) theory of fields, described in 1.1.7(h), and to 
the geometric theory of torsion groups described in 1.1.7(j). 

(k) Again, we shall not describe the models of the theory (K, say) of 1.1.7(k) 
in a general geometric category, but it is instructive to note that its models 
in a Grothendieck topos € are exactly the Kuratowski-finite objects of £, as 
defined in Section D5.4 below. (The proof of this fact is essentially contained in 
5.4.13.) Note, however, that morphisms of K-models are epimorphisms rather 
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than arbitrary morphisms between K-finite objects, since they are required to 
preserve the validity of the relations Ry. 

(m) Finally, we discuss the propositional theory P of completely prime filters 
in a frame L, introduced in 1.1.7(m). Clearly, a structure for the appropriate 
signature in a geometric category C is simply a function assigning to each atomic 
proposition F, a subterminal object Ua — 1 in C; and the assertion that such 
a structure is a model of P is equivalent to saying that a +> Ua preserves finite 
meets and arbitrary joins, i.e. that it is a frame homomorphism. In particular, 
if C = Sh(X) is the topos of sheaves on a locale X, then the lattice Sube(1) 
is isomorphic to O(X), by C1.3.15; so P-models in Sh(X) correspond to locale 
maps X — Y, where Y is the locale defined by O(Y) = L. And by C1.4.5 these 
in turn correspond to (isomorphism classes of) geometric morphisms Sh( X) —> 
Sh(Y). Moreover, a homomorphism of models in this case is just the same thing 
as an inequality between morphisms in Loc, as defined in C1.1.6; so we actually 
have an equivalence of categories 


P-Mod(Sh(X)) ~ Top (Sh(X), Sh(Y)) . 


This is a part of the assertion that Sh(Y) is a classifying topos for the theory P; 
see 3.1.14 below. 


Suggestions for further reading: Kock & Reyes [640], Makkai & Reyes [790], 
Reyes [1004]. 


D1.3 First-order logic 


The definitions of the last two sections provide an extremely useful tool for con- 
structing objects and morphisms with prescribed properties in a given category C. 
Starting with (a structure for) a signature which names some ‘basic’ objects and 
morphisms of C, we can use the terms and formulae over this signature to name 
more complicated objects and morphisms via the categorical semantics developed 
in the last section. 

But first-order logic is more than a convenient shorthand for describing par- 
ticular objects and morphisms of a category; it is also a tool for proving things 
about them via a suitable deduction-system. Our first concern in this section is 
to develop such deduction-systems for the various fragments of logic which we 
have considered, and to prove that they are sound for the categorical semantics; 
i.e. that anything which is formally derivable in the deduction-system will be 
valid in any structure for the given signature in any category of the appropriate 
type. Of course, we should also like the deduction-systems to be complete in 
the sense that the converse of the last assertion holds: anything which is valid 
in all structures in all suitable categories should be formally derivable in the 
logic. The completeness theorem is indeed true for the deduction-systems we 
shall construct, but the proof of this result will be deferred to Section D1.4. 
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Our deduction-systems will be formulated as sequent calculi; that is, they will 
provide rules for inferring the validity of certain sequents (as defined in 1.1.5) 
from certain other sequents. (There are other possible ways of presenting them, 
but experience suggests that a sequent calculus is the most convenient for our 
purposes.) We shall write the rules in the form 


T 


oO 


where T is a (possibly empty) list of sequents and ø is a sequent; the interpreta- 
tion of the rule being that if we have established the validity of all the sequents 
in I’ we may infer the validity of ø. In the particular case when T is empty, we 
shall say that ø is a (logical) axiom, and omit the line above it. We shall also 


use the shorthand 
o 


T 


to indicate that both 2 and are rules of inference, i.e. that either of ø and T 
may be inferred from the other. A derivation in the deduction-system will then 
have the form of a tree such as 


onl 02 
03 O4 
05 


where each sequent without a line above it is an axiom, and each sequent below 
a line follows by a rule of inference from those above the line. (If our language 
includes either of the infinitary connectives V and N, the tree may be infinite; 
but we still require it to be well-founded in the sense that it has no infinite 
ascending chains.) A derivation relative to a theory T (defined as in 1.1.6) will 
be a similar tree, except that the sequents without lines above them are allowed 
to include (non-logical) axioms of T as well as the logical axioms to be listed 
below. 

Apart from the ‘structural’ rules for manipulating sequents, and the rules 
for handling the equality predicate, the rules mostly have the form of ‘intro- 
duction’ or ‘elimination’ rules for the various logical connectives and quantifiers. 
Of course, in each fragment of logic which we consider, we adopt the introduc- 
tion and elimination rules for those connectives and quantifiers which are present 
in the language (as well as the structural and equality rules, which are present 
in all fragments); thus, for example, coherent logic is specified by the rules in 
parts (a), (b), (c), (d), (f) and (i) of the following definition, and full (finitary) 
first-order logic is obtained by adding the rules in (e) and (g) (and deleting (7)). 
Throughout the definition, it is assumed that all the sequents which appear are 
well-formed, i.e. that the contexts which appear in them are suitable for the for- 
mulae on either side; but, apart from this restriction, the formulae ¢, ~ and x are 
allowed to be anything in the appropriate fragment of the first-order language. 
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Definition 1.3.1 (a) The structural rules consist of the identity ariom 


(dz 4), 


the substitution rule 
(dbz Y) 
(9[8/z] Fg Y[8/z)) 
where ¥ is any string of variables including all the variables occurring in the 
string of terms 5, and the cut rule 


(dbz Y) (Y Fz x) 
(Fz x) l 


Note that the substitution rule allows the possibility of making the trivial substi- 
tution [z/z]; thus it includes the weakening rule which says that we may derive 
(pty Y) from (dF z Y) provided ¥ contains all the variables in 7. Note also that 
the converse of the weakening rule is not a permitted rule of inference; this is 
particularly important when applying the cut rule, as it means that we cannot 
‘eliminate’ any variables which may occur in w but not in @ or x from the con- 
text of the conclusion of this rule. (However, we can eliminate a non-occurring 
variable from a context if it contains another variable of the same sort, by sub- 
stituting the latter for the former; and similarly if there is a closed term of the 
appropriate sort.) 
(b) The equality rules consist of the axioms 


(T Fz (x = 2)) 


and 
(((£ = F) Ab) Fz lg/2]) 
where 7 and # are contexts of the same length and type, (Z = Į) is a shorthand 
for ((£xı1 = yi) A--:A (an = Yn)), and Ž is any context containing 7, Į and 
the free variables of ¢. From these (plus the structural and conjunction rules), 
it is a standard exercise to derive the sequents ((£x = y) Fz (y = x)) and 
(((x = y) A (y = z)) Fzy,z (£ = z)), which say that equality is symmetric and 
transitive; so we could have assumed these as additional axioms if we wished. 
(c) The rules for (finite) conjunction are the axioms 


(dFzT), ((PAp) Fed), ((PAY) Fz Y) 
and the rule 
(dbz Y) (Fz x) 
(pre (PAX)) | 
(d) The rules for (finite) disjunction are the ‘duals’ of those in (c): the axioms 


(Lz), (bFz (PV Y)) and (Wz (PV y)), and the rule 


(dz x) (Y Fz x) 
(PV Y) Fz x) 
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(e) The rules for implication consist of the double rule 


((% AY) Fz x) 
(Y Fz (¢=>x)) 


The rules for negation are the special case of this double rule obtained by setting 
x = 1, and identifying ~ọ with (¢> 1). 
(f) The rules for existential quantification consist of the double rule 


($ Fz y Y) 
((dy)¢ Fz Y) 


(here our standing hypothesis that all sequents are well-formed includes the 
information that y is not free in 4). 
(g) The rules for universal quantification consist of the double rule 


(od Fey w) 
(bre (Yyy) 


(h) The rules for infinitary conjunction and disjunction are the obvious 
infinitary analogues of those in (c) and (d). 

(i) Finally, we mention two ‘mixed axioms’ which are required for coherent 
logic: the distributive axiom which links A and V: 


(CPA (HV x)) Fe (OA) V (OA X))) 


and the Frobenius axiom which links A and 2: 


(DA (Ay)¥) Fe (Ay)(bA ¥)) 


where y is a variable not in the context 7 (and hence not free in ¢). Both of 
these sequents are provable in full first-order logic using the implication rules; 
for example, here is a derivation of the second one: 


(Gy A Y) Fz (Gy) ^A W)) 
(PAY) Fz y (Gyo A y)) 
(Y Fz y (> (dy) A Y))) 

((dy)y Fz (@> (Ay)(@AY))) 

(pA (3y) Fz (Gy) A ¥)) - 


However, when we are working in fragments of first-order logic which do not 
include implication, these sequents are not derivable — for the distributive axiom, 
this may be seen from the fact that the other axioms involving A and V are sound 
for interpretations in arbitrary (not necessarily distributive) lattices. So we must 
add them both as axioms in coherent logic, and the Frobenius axiom alone in 
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regular logic. In geometric logic, we similarly add the appropriate infinitary 
generalization of the distributive axiom, together with the Frobenius axiom. 
(Incidentally, the converses of both axioms are derivable in coherent logic: for 
example, 


((Ay)p ke (Ay)w) 
(dAv) bey Y) (Whey (dyo) 
(PAW) Fzy $) (PAW) Fey (sy) 
((Ay)(@ Aw) Fz $) ((Sy)(oA Y) Fz (dyly) 
((Ay)(@A Y) Fz (PA (Ay)v)) 


is a derivation of the converse of Frobenius using only rules in (a), (c) and (f).) 


We say a sequent ø is derivable or provable in a (Horn, regular, coherent, ... ) 
theory T if there exists a derivation relative to T, in the appropriate fragment 
of first-order logic, with ø as its ‘bottom line’. The soundness theorem may now 
be proved by an easy induction over the ‘height’ of a derivation: 


Proposition 1.3.2 (Soundness Theorem) Let T be a Horn (respectively 
regular, coherent, first-order, geometric) theory over a signature %, and let M be 
a model of T in a cartesian (resp. regular, coherent, Heyting, geometric) category 
C. Ifo is a sequent (in the appropriate fragment of the first-order language over 
E) which is provable in T, then M Eo. 


Proof We have merely to show, for each of the rules in 1.3.1, that if M (lives 
in a category of the appropriate kind and) satisfies the sequent(s) appearing 
above the line in the rule, then it also satisfies the sequent below the line. But 
this is trivial in almost every case, from the way in which the interpretation of 
formulae was defined in the last section. For example, the soundness of the sub- 
stitution rule follows from Lemma 1.2.7 and the fact that pullback of subobjects 
along a fixed morphism is order-preserving; and the second equality axiom is 
the assertion that, for any subobject A’ — A in a cartesian category, we have 
TA) OA = 73(A'!)N A < 23(A’) in Sub(A x A). The soundness of the double 
rule for existential quantification is simply the assertion that existential quantifi- 
cation in a regular category is left adjoint to pullback (cf. A1.3.1(iii); note that. 
since w does not contain y as a free variable, its interpretation [Z,y.7] is the 
pullback of [F.Y] along a product projection, by 1.2.7). Similarly for universal 
quantification in a Heyting category. Finally, the validity of the Frobenius (resp. 
distributive) axiom in arbitrary regular (resp. coherent) categories was verified 
in A1.3.3 (resp. A1.4.2). 0 


Remark 1.3.3 The reader will have observed that our deduction-system is 
essentially constructive (or intuitionistic, if you prefer), in that the Law of 
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Excluded Middle 
(T kz (@V 7¢)) 


does not appear as an axiom (and indeed it is not derivable, since it is satisfied 
in a structure M iff the interpretation [Z.ġ]m is a complemented subobject of 
MA, x --- x MAn, and we know that our logic is sound for interpretations in 
categories where not all subobjects are complemented). Of course, we obtain clas- 
sical first-order logic by adding this axiom to the rules of 1.3.1(a~g); it follows 
from 1.3.2 that this logic is sound for arbitrary structures in Boolean coher- 
ent categories (recall that all such categories are in fact Heyting categories, by 
A1.4.10). 


We next fulfil a promise made earlier in this chapter, by introducing the 
notions of cartesian formula and cartesian theory. 


Definition 1.3.4 (a) Let T be a (regular) theory. The class of cartesian 
formulae-in-context relative to T is defined as follows: atomic formulae are carte- 
sian (in any context), finite conjunctions of cartesian formulae are cartesian, and 
£.(Ay)¢ is cartesian provided Z, y.¢ is cartesian and the sequent 


(DA dlz/y]) Fay,2 (y = 2)) 


is provable in T. (Here z is a variable of the same sort as y, not occurring in Z.) 
A sequent (@ Fz w) is said to be cartesian relative to T if both ¢ and wp are 
cartesian relative to T in the context 7. 

(b) A regular theory T is said to be cartesian if there is a well-founded partial 
ordering of its axioms, such that each axiom is cartesian relative to the subtheory 
formed by the axioms which precede it in the ordering. 


For example, the theory of categories is cartesian, because its only axiom 
involving existential quantification is the third displayed sequent of 1.1.7(e), and 
this is cartesian relative to the theory whose only axiom is the second displayed 
sequent. 

If M is a T-model (where T is, for the moment, a Horn theory) in a cartesian 
category C, then any cartesian formula-in-context relative to T has an interpre- 
tation in M: we simply redefine the interpretation |Z. (3y: B)\y]m, as compared 
with the definition in 1.2.6(ix), to be the composite 


IZ, yY] >> M(Ay,..-,An,B) —> M(Ai,..., An); 


which we know to be monic, by the soundness theorem for Horn logic in cartesian 
categories. It is clear that the Substitution Lemma 1.2.7 remains valid for this 
interpretation, and that the rules of 1.3.1(f) and the Frobenius axiom are sound 
for it; hence we can recursively define what it means for a 4-structure in C to be 
a model for a cartesian theory T, and we obtain 
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Proposition 1.3.5 Let T be a cartesian theory, and o a cartesian sequent 
relative to T. If o is provable in T (using the rules of regular logic), then it is 
satisfied in every T-model in a cartesian category. o 


Remark 1.3.6 Another fragment of first-order logic which is of some interest, 
although we shall not discuss it in detail here, is disjunctive logic: this is the 
sub-fragment of geometric logic (or, in the finitary case, coherent logic) in which 
formulae are subject to the same restriction as in 1.3.4 whenever an existential 
quantifier occurs, and additionally to the restriction that disjunctions must be 
‘provably disjoint’, i.e., before we can write down a disjunction V;er $i, we need 
to know that the sequents ((¢; A ġ;) + L) are provable for all i # j. An example 
of a disjunctive theory which is not cartesian is the theory of fields, as described 
in 1.1.7(h): in the formula 


((x = 0) v (Ay) (xy = 1) 


which occurs in the sequent displayed there, the existential quantifier is justified 
by the fact that inverses in a (commutative) ring are unique when they exist, and 
the disjunction is justified by the nontriviality axiom ((0 = 1) F L). On the other 
hand, the theory of local rings is not disjunctive, since the disjunction occurring 
in the displayed axiom of 1.1.7(g) is not provably disjoint. (Finitary) disjunctive 
logic may be interpreted in any category with finite limits, a strict initial object 
and disjoint (finite) coproducts which are stable under pullback; such categories 
are naturally called (finitary) disjunctive categories (in the finitary case, they 
are also sometimes known as lextensive categories). For more details about this 
fragment of logic, see [508]. 


We may also now introduce the notions of provable equivalence of formulae 
and of of equivalence of theories; the latter was promised after Definition 1.1.6. 


Definition 1.3.7 (a) Let ¢ and @’ be two formulae, and ¢ a context suitable 
for both of them. We say that ¢ and @’ are provably equivalent (relative to a 
theory T) in the context £ if the sequents (¢@ Hz ¢’) and (¢’ tz ¢) are both 
derivable (relative to T). (We sometimes write (p 4} z ¢’) to indicate that both 
these sequents hold.) 

(b) Let T and T’ be two theories over the same signature ©. We say T and 
T’ are equivalent if every axiom of T’ is provable (in the appropriate fragment 
of logic) in T, and conversely. 


It is clear that if T and T’ are equivalent, then exactly the same sequents 
are provable in each of them; for if we have a derivation of ø relative to T’, we 
may convert it into a derivation relative to T by inserting, above each axiom of 
T’ which appears, a derivation of that sequent relative to T. Hence also equiv- 
alence of theories is an equivalence relation. From now on, when we talk of a 
theory, we shall tend to think of it as an equivalence class rather than as a 
particular set of axioms. Note, however, that our notion of equivalence is still 
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signature-dependent: we cannot say, for example, that the theory of Boolean 
algebras (that is, complemented distributive lattices) is equivalent to the the- 
ory of Boolean rings (that is, rings in which every element z satisfies x? = <), 
because they are written over different signatures. Later on (in 1.4.9), we shall 
introduce the weaker notion of ‘Morita equivalence’ which frees our theories from 
this dependence on a particular signature. 

Using the notions introduced in 1.3.7, we may simplify the presentations of 
theories in a way which is often useful. 


Lemma 1.3.8 
(i) Any regular formula-in-contezt is provably equivalent (in the empty theory) 
to one of the form (Az)@ where ¢ is a Horn formula (and (AZ) is shorthand 
for (Ar,)--+(Ar,)). Moreover, if the original formula is cartesian relative 
to some theory T, so is the equivalent formula of the above form. 


(ii) Any coherent formula-in-context is provably equivalent to one of the form 
Vi Gi, where the ġ; are regular formulae (in the form described in (i)). 
The same holds for geometric formulae, if the disjunction is allowed to be 
infinitary. 


Proof (i) We observe first that a formula of the form (¢ A (Sxr)q) having an 
existential quantifier ‘inside’ a conjunction may be replaced by (Ar)(@Aw) using 
the Frobenius axiom and its converse, provided x does not occur in @ — and we 
may always assume this is so, since we are free to rename the bound variables of 
(Jx)y. By repeated use of this technique, we may reduce any regular formula to 
the required form. It is also clear that, if (Ar)y is cartesian relative to T, then 
so is (Jx)(¢ A Y), so we do not destroy cartesianness in this process. 

(ii) This time, we first use the distributive axiom and its converse, plus the 
(easy) fact that (Ar)(@V y) is provably equivalent to ((Sxr) dV (3x)y), to ‘pull all 
disjunctions to the front’, i.e. to reduce our formula to a disjunction of regular 
formulae. Then we apply the technique of part (i) to each of the latter. The 
geometric case is similar. o 


It will be noted that, in the above proof, we have made use of the fact that 
provable equivalence of formulae is ‘stable under compounding’, i.e. that if ¢ is 
provably equivalent to ¢’, then (@ A w) is provably equivalent to (¢’ A y), and 
Similarly for the other connectives and quantifiers. All such results are easily 
derived from the rules of inference in 1.3.1. 


Remark 1.3.9 We do not have any standard form, similar to those of 1.3.8, 
for first-order formulae-in-context. Indeed, in the constructive logic in which we 
are working, we cannot even pull all the quantifiers to the front of a first-order 
formula (as we are accustomed to do in classical logic): the ‘dual’ of the Frobenius 
axiom, i.e. the sequent 


(Vy) V y) Fz ($ V (Vy)v)) 
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where y is not free in ¢, is not constructively valid, and so the formula on the 
right is not in general provably equivalent to one with all its quantifiers at the 
front. (But cf. 5.4.6 below.) 


Proposition 1.3.10 
(i) Any Horn theory T is equivalent to one whose axioms have the form 
(bbz Y), where ¢ is a finite conjunction of atomic formulae (possibly the 
empty conjunction T) and w is atomic. 


(ii) Any regular theory (in particular, any cartesian theory) T is equivalent to 
one whose axioms have the form 


($ Fz (Ay)y) 


where @ and w are Horn formulae, and we may additionally assume that 
the sequent (Y Fz g $) is provable (in the empty theory). 


(iii) Any coherent theory T is equivalent to one whose axioms have the form 
(dbz \Y (Avi) vi) 
i=1 


where @ and the p; are Horn formulae, and the sequents (pi Fz g, Q) are 
provable. 


(iv) Any geometric theory T is equivalent to one whose arioms have the same 
form as in (iii), except that the disjunction on the right may be infinite. 


Proof (i) If we have a sequent of the form (¢+ T) in the axioms of T, we can 
leave it out, since such sequents are logical axioms. And if we have a sequent 
(ġ Fz A;_, Yi) with a nontrivial conjunction on the right, we may replace it by 
the family of sequents (¢ Hz Yi), 1 < i < n. It is straightforward to verify from 
the rules for finite conjunction (and, in one direction, the cut rule) that this 
family of sequents is equivalent to the single sequent with which we started. 

(ii) We first use Lemma 1.3.8(i) to replace any sequent (¢ + w) with an equiv- 
alent one in which the formulae on each side have all their existential quantifiers 
at the front. Then we observe that the rules for existential quantification allow 
us to remove such quantifiers from the left-hand side of a sequent, at the cost 
of adding more variables to the context. Finally, having got our axioms into the 
form (¢ tz (Ay) . Y) where ¢ and y are finite conjunctions of atomic formulae, 
we may replace the formula y on the right by (¢ A y) without changing (the 
equivalence class of) our theory. 

(iii) Similarly, we first reduce our axioms to sequents in which the formulae on 
either side are in the standard form described in 1.3.8(ii). Then we may eliminate 
any disjunctions on the left-hand side at the cost of increasing the number of 
axioms (as we did for conjunctions on the right in (i)), and eliminate existential 
quantifiers on the left as we did in (ii). The final assertion is established by the 
same trick as in (ii). 

(iv) is similar to (iii). o 
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Given a small category C with (at least) finite limits, we may define a canon- 
ical signature Xc for reasoning about it, as follows. (We shall also make informal 
use of this notation even when C is not small, even though we demanded in 
Definition 1.1.1 that our languages should be ‘set-based’; it clearly doesn’t 
make any essential difference if we allow a proper class of sorts.) The sig- 
nature Lc has one sort "A7 for each object A of C, one function symbol 
rf: TAT TAn — "B? for each morphism f: A; x --- x A, —> B in C, and 
one relation symbol "R'—" A;'---"A,' for each subobject R = A, xX: X Ay 
of a finite product in C. The first-order language over this signature is called the 
internal language of C. 

There is an evident structure for Xc in C, which assigns A to "A7, f to 
rf and R to TR”. If this structure satisfies a particular (cartesian, regular, 
coherent, ...) sequent o over Yc, we shall say that C itself satisfies the sequent, 
and write C H ø. The advantage of this (somewhat tautological) definition is 
that it allows us to argue ‘in the formal language’ to prove things about C: for 
example, if C is a regular category, then the set of regular sequents over Xe 
which are satisfied in C is closed under the rules of inference in 1.3.1(a—c) and 
(f), and so anything which we can deduce by these rules from sequents which 
we know to be valid will also express a true assertion about C. This allows us to 
reason about C ‘as if its objects were sets and its morphisms were functions’ (at 
least provided we restrict ourselves to regular logic, and argue constructively). 
Before giving some examples of this process, we first list some instances of how 
categorical information about C may be encoded in the formal language. 


Lemma 1.3.11 Let C be a regular category. 


(i) f: A— A is the identity morphism iff C = (T Fy (f(r) = 2)). 
(ii) f: A—C is the composite of g: A> B andh: B —C iffC — (T F; 
(f(x) =" h(g (z)))). 


(iii) f: A — B is monic iff C $ (f(x) =" f (2’)) ke (£ = 2’)) 
(iv) f: A— B is a cover iff C $ (T Fy (Ax) f(x) = y)). 
(v) A is a terminal object iff C H (T H (Ar)T) and C H (T Fz (£ = 2')) 


(here x and x’ are of sort" A‘). 


(vi) A diagram 
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is a pullback square iff C satisfies the three sequents 


(The TAU F(@)) =k (g (2), 
(fe) =" F) A (9 (2) =" 9 (2'))) Fra (£ = 2’)), and 
(R (y) = TR) Fy,z (Ax) F (a) = y) A ("9 (2) = 2))) - 


Proof We note that C H (T Fz (s = t)) iff the equalizer of [x.s] and [x.t] 
is the whole of A (where A” is the sort of x), iff [x.s] and [x.t] are the same 
morphism. Statements (i) and (ii) follow immediately from this. For (iii), we 
note that by Lemma 1.2.7 the interpretation [v,2’.(" f(z) =" f (x'))] is the 
pullback along f x f: A x A— B x B of the diagonal subobject of B x B. This 
pullback always contains the diagonal subobject of A x A; so the validity of the 
given sequent is equivalent to saying that 


A ————_> B 


A A 
fxf 


Ax A——>BxB 


is a pullback, ie. that f is monic. For (iv), we observe that the interpretation 
fy. (Ix) f(x) = y)] is the image of the graph of f under the projection Ax B —> 
B; but this is the same as the image of f itself. Now the two sequents in (v) say 
respectively that the unique morphism A — 1 is a cover and that it is monic; 
similarly, in (vi), the first sequent says that the given square commutes and the 
other two say that the canonical morphism from A to the pullback of h and k is 
an isomorphism. o 


From now on, we shall tend to drop the notation "—7, and simply identify 
the sorts, function symbols and relation symbols of the internal language of C 
with their interpretations in C. The following ‘functional completeness’ result is 
often useful: 


Proposition 1.3.12 Let C be (at least) a regular category, let A and B be 
objects of C and let @ be a formula of the internal language of C having free 
variables x and y of sorts A and B respectively. Suppose that the sequents 
(PA o[y’/y]) Fz y.y (y= y’)) and (T Fz (Ay)@) are satisfied in C. Then there 
is a unique morphism f: A — B in C such that the sequents (Otay (f(z) = y)) 
and ((f(z) = y) Fz, $) are valid. 


Proof If we write (a,b): R> A x B for the interpretation [z, y.@], then the 
two given sequents say respectively that a is monic and that it is a cover. So it 
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is an isomorphism, and we may take f to be the composite 


ese pe Sp: 
It is straightforward to verify that this is the unique morphism with the required 
property. 0O 


We remark that, although we assumed for simplicity that C was regular in 
1.3.12, we could have obtained the same result for cartesian categories: the second 
sequent in the statement is cartesian relative to the theory whose only axiom 
is the first one. The same remark applies to the sequents characterizing finite 
limits in 1.3.11. 

As an application of Proposition 1.3.12, we show how the proofs of (the hard 
parts of) Propositions A1.3.4 and A1.4.3 can be recast in terms of the internal 
language. Although these proofs are not different in any essential way from the 
ones we gave in Chapter A1, they look much more familiar, because they are 
much more closely modelled on what we should do in order to prove the results 
in Set (that is, we should construct a binary relation and then prove that it is 
the graph of a function with the required properties). 


Lemma 1.3.13 In a regular category, every cover is a coequalizer. 


Proof Let f: A — B bea cover in a regular category C, and let (a,b): R 3 A 
be its kernel-pair. We have to show that f is a coequalizer of a and b. So let 
c: Å — C be any morphism with ca = cb, and consider the formula 


b = (Sx)((f(z) = y) A (e(z) = 2)) 


having free variables y, z of sorts B,C respectively. We shall show that ¢ satisfies 
the sequents of 1.3.12, and so defines a morphism B — C which is the required 
factorization of c through f. First we observe that 


(PA ol2'/z2]) F Gx, 2’)((F(@) = f(x’)) A (e(z) = 2) A (e(2’) = 2’) 
F (dw)((e(a(w)) = z) A (c(b(w)) = 


F (z= 2"), 


where w is of sort R, and the second step uses the characterization of pullbacks 
in 1.3.11(vi). For the second sequent, we have 


T F (ae)(f(x) = y) 

F (Sr) (f(x) = y) A (Az)(e(x) = z)) 
F (32)(3x)((f (£) = y) A (e(z) = 2)) 
= (dz)¢, 


where we have made use of the Frobenius axiom at the third step. 0O 
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Lemma 1.3.14 Letm,: A; — A and mg: A2 — A be subobjects in a coherent 
category C. Then the square 


nı 
Aı Ag >—> Ay 
n2 Pı 


p2 
Ag >——> A, U Ag 
is a pushout in C. 


Proof As in the original proof of A1.4.3, we shall simplify notation by writing 
B,C for A; N Ag and A, U Ag respectively. Suppose we are given morphisms 
fi: A; > D (i = 1,2) with fin; = feng; so we need to construct a morphism 
k: C — D with kp; = fi (i = 1,2). As before, we do so by constructing a suitable 
coherent formula ¢ in the internal language and applying 1.3.12. Specifically, 
we take 


$ = ((3xı)((pı (£1) = 2) A (fi (z1) = w)) V (Are) ((po(r2) = z) A (f2(x2) = w))). 


Then by the distributive axiom (¢ A ¢[w’/w]) entails the disjunction of four 
formulae, namely 


(Ax, )(Sx5)((pi(wi) = py (a5) A (Filzi) = w) A (F)(@5) = w’) 


for 2,7 = 1,2. For the two cases where i = j we use the fact that p; is a monomor- 
phism (and 1.3.11 (iii)) to conclude that (x; = x}) and hence that (w = w’). In 
the other two cases, we use the fact that the square in the statement is a pull- 
back (and 1.3.11(vi)) to conclude (4y)((ni(y) = vi) A (n,(y) = x)) and hence 
(w = w’). 

To verify that ¢ satisfies the second sequent of 1.3.12 is straightforward, since 
C is the union of A; and A» and so we have 


(T Fz ((Ati)(pi(@1) = 2) V (Ax2)(p2(r2) = 2))) 


from which (T H, (Sw)@) follows easily. 
Hence we have a morphism k: C — D with [z,w.¢é] = [z,w.(k(z) = w)]. 
Since each p, is monic, it is easy to deduce that 


[z;, w. olp: (21)/2]] = [zri w. (fila) = w), 


and hence that kp; = f; (i = 1,2). The fact that k is the unique morphism 
with this property follows from the fact that C is the union of its subobjects A1 
and Ag. O 
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Suggestions for further reading: Bénabou [100], Coste [258], Johnstone [508], 
Makkai & Reyes [790]. 


D1.4 Syntactic categories 


In 1.3.2 we proved a Soundness Theorem, asserting that ‘anything provable is 
true’; i.e. if a sequent o is derivable from the axioms of a theory T (in any of 
the fragments of logic we have considered), then it is satisfied in all T-models in 
categories of the appropriate kind. Naturally, we wish to have a converse result 
asserting that ‘anything true is provable’; such a result is known to logicians as 
a Completeness Theorem. In this section we shall prove such a theorem. 

One advantage of considering models in general categories, rather than just 
in Set, is that the theorem has a particularly simple form: given a theory T, 
we shall construct a category Cr of the appropriate kind (the syntactic category 
of T) containing a particular T-model Mr (the generic model of T) with the 
property that the sequents valid in Mr are precisely those provable in T. (We 
shall see in the next section how a more conventional completeness theorem in 
terms of T-models in Set may be extracted from this one.) 

The basic idea behind the construction of Cr is that its objects are precisely 
those things which have to be present in any category (of the appropriate kind) 
containing a model of T. More specifically, every object of Cr will be the interpre- 
tation in Mr of some formula-in-context of the language of T, and the morphisms 
of Cr will be exactly those morphisms that T ‘tells us’ must exist between these 
objects. For a more formal definition, we shall deal first with the case of a carte- 
sian theory T, and indicate afterwards how things must be modified for more 
complex theories. 

So, for the moment, let T be a cartesian theory (over a signature X, say). We 
define the objects of Cr to be a-equivalence classes of formulae-in-context over 
& which are cartesian relative to T; we shall write {z.} for the a-equivalence 
class of 7T.. Here ‘a-equivalence’ means something a little broader than it did 
in Section D1.1: in addition to the bound variables of ¢, we wish to be free to 
rename the variables in the context 7, i.e. to regard #.¢ as equivalent to 7. 6|y/Z} 
if y is a context of the same length and type as z. (In effect, the context itself 
is regarded as binding the variables which appear in it. Note, incidentally, that 
if M is any T-model in a cartesian category, then the rules of 1.2.6 ensure that 
the objects [.@]ar and [7.¢[¥/Z]] 1 are equal (and not merely isomorphic).) 

In defining what are the morphisms of Cr from {z.@} to {F.Y}, we may thus 
assume (and it is convenient to do so) that the contexts 7 and y¥ are disjoint. 
Suppose now that we have a cartesian formula 6, whose free variables are all in 
the context Z, 7, such that the sequents 
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are provable in T. (Here, as before, (J = Z) is shorthand for the conjunction 
((y. = 21) A ++: A (Ym = 2m)), and (4y) similarly means (Ayı) - -+ (Ayn). Note, 
by the way, that the provability of the second sequent ensures that the third 
is cartesian relative to T.) Then in any T-model M in a cartesian category C, 
the interpretation [Z, 7.6], will (by soundness) be the graph of a morphism 
from [z.d]x to [y-Y]m. In particular, this must be so in the model Mr, so 
the formula-in-context 7, 7.6 must represent a morphism {Z.¢} — {F.Y} in Cr. 
However, it is clear that two formulae 0 and 6’ must represent the same morphism 
if they are provably equivalent in T, as defined in 1.3.7(a). (We remark in passing 
that, if 0 and 0’ are ‘functional’ in the sense that the sequents displayed above 
are provable, then either of the sequents (0 + 6’) and (@’ | 6) implies the other. 
This is the logical version of an assertion whose categorical formulation may be 
found at A3.2.3(ii).) 

Thus we take the morphisms of Cr to be T-provable-equivalence classes 
of formulae-in-context which are T-provably functional, i.e. satisfy the three 
sequents displayed above. We shall write [6] for the provable-equivalence class 
of 0; we may safely omit its context, since the latter can be inferred from the 
objects which appear as the domain and codomain of [6]. 

The composite of two morphisms 


[0] h] 


{7.9} — {7.4} —> {7.x} 


is defined to be the T-provable-equivalence class of the formula (3y)(y A 0); it 
follows from the remark after 1.3.8 that this does not depend on the choice of the 
representatives 0 and y, and it is straightforward to verify that it is (cartesian 
and) T-provably functional from {z.¢} to {Z.x}. It is similarly easy to verify 
that this composition is associative (up to provable equivalence), and that the 
provable-equivalence class of 


pf 


(g^ (z= 2’))| 


{z . 6} —— > {v . el /z]} 


serves as the identity morphism on {z.¢} (note the use we have made of 
a@-equivalence; here x’ is assumed to be a context of the same length and type 
as Z, but disjoint from it); so we have 


Lemma 1.4.1 Cr is a category. CO 


Next, we prove 
Lemma 1.4.2 Cr has finite limits. 


Proof The object {[].T} is a terminal object: a formula 0 is provably func- 
tional from {z.} to {[]. T} iff @ is provably equivalent to ¢, so there is a unique 
morphism {Z.¢} — {[].T} in Cr. The product of two objects {Z.¢} and {9.4} 
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(where 7 and 7 are assumed to be disjoint) is {%,7.(¢@Aw)}, and the product 
projections are the provable-equivalence classes 


a a  [(GAMAE=8))) | KAya 4 
{x’.p|x'/z]} c {#, 9. (6 nh ———> y vly’/a}} . 
Given morphisms [6]: {7.x} — {%.¢} and fy]: {7.x} — {F.Y}, the induced 
morphism into the product is the provable-equivalence class of (0 A y); it is 
again straightforward to verify that this is the unique morphism whose com- 
posites with the projections are [0] and [y]. To form the equalizer of a parallel 

pair 
[6] 


{2.0} Z3 {7.4}, 


ly] 


we take the object {x . (39) (0la/Z] A y[x’/Z])} and the morphism from this to 
{Z.¢} given by the provable-equivalence class of (49)(9 A y A(x’ = #)). If 
[ô]: {2.x} — {Z.¢} is a morphism having equal composites with [6] and [y], 
then its factorization through the equalizer is simply [5[z’/z]]. As before, the 
verification that all this works is tedious but straightforward. 0 


Remark 1.4.3 Note that the construction of equalizers in the above proof 
requires a choice of representatives for the equivalence classes [0] and fy]; if we 
make a different choice, we will get a different — though isomorphic — object as 
the domain of our equalizer. Thus, unless we assume that we have some means of 
making such a choice — for example, that we have a well-ordering of the formulae 
in our language, which will be the case if our signature is countable — we cannot 
assert that Cr has equalizers in the constructive sense on which we insisted in 
Section Al.2. The problem could be avoided by factoring the objects of Cr by 
provable equivalence as well as a-equivalence; but this tends to become very 
messy in practice, and it simply means that the arbitrary choices have to be 
made elsewhere, namely in the proof of 1.4.7 below. 


Much as in 1.3.11, properties of objects and morphisms in Cr are reflected in 
the T-provability of the appropriate sequents: 


Lemma 1.4.4 
(i) A morphism [6]: {Z.¢} — {J.Y} is an isomorphism in Cr iff is also 
T-provably functional from {y.} to {Z.o}. 


(ii) Any object of Cr is isomorphic to one of the form {Z.ġ} where ¢ is 
quantifier-free (that is, a finite conjunction of atomic formulae). 


(iii) [6] is a monomorphism in Cr iff the sequent 
((8 A 6[2'/z]) + (Z = 2")) 


is provable in T. 
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(iv) Any subobject of {Z.ġ} in Cr is isomorphic to one of the form 


Ib A (Z = x’))] 


{x' ylh e A.A) 


where w is a formula such that the sequent (y tz ġ) is provable in T. 
Moreover, for two such subobjects (corresponding to formulae y and x), 
we have {T.y} < {%.x} in Sub({Z.¢}) iff the sequent (y tz x) ts provable 
in T. 


Proof (i) If ð is provably functional in both directions, then 
10]: {¥.~} — {Z.¢} is easily seen to be inverse to [6]: {¢.¢} — {F.Y} in Cr. Con- 
versely, if [0] has an inverse |y], then one may verify that y must be T-provably 
equivalent to 0; so @ is functional in both directions. 

(ii) It is clear that if @ and @’ are provably equivalent formulae in some 
context F, then {Z.@} and {Z.¢’} are isomorphic objects of Cr; so by 1.3.8(i) we 
may reduce to the case where ¢ has all its quantifiers at the front. It thus suffices 
to prove that if ọ = (Sy)y is a cartesian formula (relative to T) in a context Z, 
then the objects {%.¢} and {Z,y.y} are isomorphic in Cr. But it follows easily 
from (i) and the fact that ¢ is cartesian that 


(yA (Z = 2’) 


{Z,y-} Lo {2 . piz /z]} 


is an isomorphism. 

(iii) From the proof of 1.4.2, we may construct the kernel-pair of [0] as the 
object {7, z. (OA Oa’ /z|)}. Then the provability of the given sequent is equiva- 
lent, by (i), to the assertion that the diagonal map from {%.¢} to this object is 
an isomorphism. 

(iv) By (iii), any morphism of the displayed form is clearly monic. But if 
we are given an arbitrary monomorphism [6]: {7.w} — {z.¢}, then (iii) ensures 
that the formula-in-context Z.(4y)@ is cartesian relative to T, and (i) ensures 
that [6] is an isomorphism (over {%.¢}) from {y.y} to {%.(4y)6}. So every 
subobject is isomorphic to one of the displayed form. For the final assertion, we 
have merely to note that [(w A (Z = 2’))]: {x . yla /z]} — {#.x} is a morphism 
of Cr iff (Y F x) is provable in T; and it is the only possible morphism over {z.¢} 
between these two objects. CO 


We have a canonical &-structure Mr in Cr: it assigns to a sort A the object 
{x.T} where z is a variable of sort A, to a function symbol f: A,--- A, — B 
the morphism 


[(f(21,---,%n) = y)| 
{r1,...,2n.1} — [y.T} 
and to a relation symbol R — A,--- A, the subobject of {7),...,2,.T} whose 


domain is {x1,...,2n-R(x1,...,2n)} (as in 1.4.4(iv)). It is now easy to establish 
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Lemma 1.4.5 

(i) For any term-in-context T.t over X, the interpretation [T.t] in Mr is the 

morphism 
[(t(@) = y)] 
{z.T } —————} {y.T}. 

(ii) For any cartesian formula-in-context Z. over X, the interpretation [z. d] 
in Mr is the subobject {Z.p} — {Z.T} (as in 1.4.4(iii) ). 

(iii) A cartesian sequent (ġ Fz Y) is satisfied in Mr iff it is provable in T. 


Proof (i) and (ii) are straightforward inductions over the structure of t and 
@. (iii) now follows immediately from (ii) and 1.4.4(iv). o 


Theorem 1.4.6 (Completeness Theorem for cartesian logic) Let T be 
a cartesian theory. If a T-cartesian sequent o is satisfied in all models of T in 
cartesian categories, then it is provable in T. 


Proof From 1.4.5(iii), we deduce in particular that Mr is a model of T. So a 
sequent satisfied in all T-models must in particular be satisfied in Mr, and hence 
must be provable in T. o 


The generic model Mr is more than a convenient tool for proving the com- 
pleteness theorem, however: it also has an important universal property, namely 
that all T-models are images of it under cartesian functors. More formally, we 
have the following result (where we write Cart (C, D) for the category of cartesian 
functors from C to D, and all natural transformations between them): 


Theorem 1.4.7 LetT be a cartesian theory. Then, for any cartesian category 
D, the functor 
Cart (Cr, D) —> T-Mod(D) 


which sends F: Cr > D to F(Mr) is (part of) an equivalence of categories. 


Proof We saw in 1.2.13 that the image of a T-model under a cartesian functor 
is a T-model, so the indicated functor does take values in T-Mod(D). Conversely, 
given a T-model M in D, we define Fm: Cr > D by setting 


Fu ({z.0}) = [Z-¢] ar 


for each formula-in-context Z.@, and taking Fm ([0]) to be the morphism whose 
graph is the interpretation [z7, 7.6] ,. (By soundness, this does not depend on 
the choice of the representative 0.) It is straightforward to verify that Fm is a 
functor, and (given the description of finite limits in Cr in the proof of 1.4.2) that 
it is cartesian. Moreover, if h: M — N is a homomorphism of T-models in D, 
then 1.2.9 ensures that we have morphisms [Z.¢]u — [¢.¢] for every formula- 
in-context, and these form a natural transformation Fy — Fy. So M | Fm 
becomes a functor T-Mod(D) — €att (Cr, D). 
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It is clear that Fu(Mr) = M, and that this isomorphism is natural in M. 
Conversely, if we are given any cartesian functor F': Cr — D with F(Mr) = 
M, then an easy induction over the structure of ¢@ produces isomorphisms 
F({Z.¢}) S [¥.¢]m, which are the components of a natural isomorphism 
F = Fy. And this isomorphism is itself natural in F; so we have the required 


equivalence of categories. oa 


Thus we may identify models of a cartesian theory T, up to isomorphism, 
with cartesian functors defined on Cr. In the converse direction, we have 


Example 1.4.8 For any small cartesian category C, there is a cartesian the- 
ory whose models in an arbitrary cartesian category D correspond to cartesian 
functors C — D. To construct it, we take the canonical signature ic, as defined 
before 1.3.11 (but restricted to unary function symbols, and no relation sym- 
bols), and write down all the sequents which occur in parts (i), (ii), (v) and 
(vi) of 1.3.11 (for all identity morphisms, composable pairs, terminal objects 
and pullback squares in C). It is easy to see that a structure for Uc in D which 
satisfies the sequents of 1.3.11(i) and (ii) is the same thing as a functor C — D, 
and that it satisfies the sequents of 1.3.11(v) and (vi) iff it preserves the terminal 
object and pullbacks, i.e. iff it is cartesian. 


So, up to equivalence of model categories, cartesian theories are ‘the same 
thing as’ small cartesian categories. This is the basis for the notion of Morita 
equivalence at which we have hinted earlier: we say two cartesian theories T and 
T’ are Morita-equivalent if their syntactic categories Cr and Cr are equivalent. By 
1.4.7, this is equivalent to saying that their categories of models in any cartesian 
category D are equivalent, in a fashion which is ‘natural in D’. (The term ‘Morita 
equivalence’ is borrowed from ring theory, where it is used to describe the relation 
between two rings which says that their categories of modules are equivalent; the 
analogy between modules over a ring and models of a theory is often a fruitful 
one.) 

In passing, we note the following result about Morita equivalence, which 
shows that (for fragments of logic containing cartesian logic) there was some 
redundancy in Definition 1.1.1, in that we could have done without either the 
relation symbols or the function symbols. 


Lemma 1.4.9 For any cartesian theory T, there are cartesian theories T’ and 
T”, Morita-equivalent to T, such that the signature of T' has no relation symbols, 
and that of T” has no function symbols. 


Proof For T’, we could simply take the theory constructed as in 1.4.8 from 
the syntactic category Cr. Alternatively, we may form a new signature ©’ from 
that of T, by replacing each relation symbol R — A,--- A, by a new sort [R] 
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together with n function symbols p;: |R] — A;. Then T’ contains the axiom 


( Notv) = pi(w’)) Fw w (w = w’)) 


¿=1 


& 
II 


(where w and w are variables of sort [R]), for each such R; and whenever 
a subformula R(t1,...,t,) appears in an axiom of T, we replace it by the 
subformula 


(3w) A (pilw) = ti) 


i=1 


(note that the sequent displayed above ensures that this formula is cartesian). 
It is straightforward to verify that every T-model in a cartesian category can be 
given a }/-structure which makes it a T’-model, and vice versa. 

For T”, we use the device of replacing functions by the relations which 
are their graphs: that is, we replace each function symbol f: A- An > B 
in the signature of T by a new relation symbol Ry — A+: An, B, for which 
we adopt the axioms ((Rj(£1,.-., 8n, Y) A Rg (21,---,2n,y’)) F (y = y’)) and 
(T H (ay) Re (x1,.--,2n,y)) (note that the second of these is cartesian relative 
to the first). Wherever we have a term of the form f(z1,...,2n) in a formula 4, 
we may replace the latter by 


(Ay)(Ry (21, tees Tns y) N ¢') 


where ¢’ is obtained from ¢ on replacing the term f(z,,...,2n) by the (new) 
variable y (note that this is a cartesian formula if ¢ is). Applying this procedure 
repeatedly, we may eliminate all terms other than variables from the axioms of 
T; so we arrive at a theory T”, over a signature without function symbols, whose 
models in any cartesian category are again easily seen to be ‘the same as’ models 
of T. O 


We remark that cartesian logic is the smallest fragment of first-order logic for 
which this result is true: the expressive power of Horn logic depends nontrivially 
on whether we allow our signature to contain function symbols, relation symbols 
or both — cf. 2.4.4 and 2.4.5 below. 

We now consider how the foregoing has to be modified for theories in richer 
fragments of first-order logic. If T is a regular (resp. coherent, first-order, geo- 
metric) theory, we shall write Cyë (resp. Coo, CP, C#°™) for the category 
constructed exactly as at the beginning of this section, but allowing the for- 
mulae in the definition of both objects and morphisms to range over arbitrary 
regular (resp. coherent, first-order, geometric) formulae over the relevant sig- 
nature. The proofs of 1.4.1, 1.4.2 and 1.4.4 remain valid (except for part (ii) 


848 FIRST-ORDER CATEGORICAL LOGIC D1.4 


of 1.4.4), and we have 


Lemma 1.4.10 
(i) For any regular theory T, Cņ* is a regular category. 
(ii) For any coherent theory T, C&°" is a coherent category. 
(iii) For any first-order theory T, C is a Heyting category. 


(iv) For any geometric theory T, CE°°™ is a geometric category. 


Proof (i) Given a morphism [0]: {Z.¢} > {F.Y} in Cr’, 1.4.4(iv) implies that 
we may form its image as the subobject {7.(4z)0} of {¥.w}; in particular, [6] 
is a cover iff the sequent (Y tg (47)0) is provable in T. The fact that covers are 
stable under pullback follows straightforwardly from the construction of finite 
limits in the proof of 1.4.2. 

(ii) Images in C¢°" are constructed exactly as in Cy}; to form the union of 
two subobjects {Z.w} and {7.x} of {£.¢}, we of course form {%.(w V x)}, and 
the least subobject is {7.1}. It is again easy to verify from 1.4.2 that these 
constructions are stable under pullback. 

(iii) The coherent structure on Cf is constructed as in (ii). To construct 
universal quantification along a morphism [0]: {%.¢} — {F.Y}, let {Z%.x} be 
a subobject of its domain, in the canonical form of 1.4.4(iv). Then we take 
Vie ({Z-x}) to be the subobject {7.(p A (YZ)(0=x))} of {7.y}; it is again 
straightforward from 1.4.4(iv) to verify that this works. 

(iv) is simply the infinitary version of (ii), except that we have to verify that 
ce°™ (which is no longer small, because we have a proper class of geometric 
formulae-in-context) is well-powered. The key to this is the observation made in 
1.3.8(ii) that any geometric formula is provably equivalent to a disjunction of 
regular formulae; since there is only a set of regular formulae in a given context, 
it follows that there is only a set of provable-equivalence classes of geometric 
formulae in a given context. This establishes the local smallness of C#°°™ directly 
from the definition of its morphisms; its well-poweredness follows from 1.4.4(iv). 


0O 


Lemma 1.4.5 now extends straightforwardly from cartesian formulae to all 
formulae in the fragment of the language under consideration, and so we obtain 
the analogue of 1.4.6: 


Theorem 1.4.11 (Completeness Theorem) LetT be a regular (resp. coher- 
ent, first-order, geometric) theory. If o is a regular (resp. coherent, first-order, 
geometric) sequent which is satisfied in all models of T in regular (resp. coherent, 
Heyting, geometric) categories, then it is provable in T. 


The analogue of 1.4.7 also works straightforwardly for regular, coherent and 
geometric theories. For first-order theories it breaks down, because 1.2.9 fails for 
non-geometric formulae: thus natural transformations between functors C + D 
correspond not to arbitrary homomorphisms of T-models in D, but to elementary 
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morphisms as we defined them in 1.2.10(a). (In particular, if D is Boolean, they 
correspond to elementary embeddings.) 

In passing, we remark that if T happens to be a cartesian theory, then C;* 
is (equivalent to) the regularization Reg(Cr) of the cartesian syntactic category 
of T, as constructed in A1.3.9. This follows easily from the fact that regular 
functors defined on C;® correspond (to models of T in the codomain category, 
and hence) to cartesian functors defined on Cr. 

Example 1.4.8 extends in a straightforward way to categories with the richer 
structure now under consideration. For example, to obtain the theory of regular 
functors defined on C, we simply add the sequent appearing in 1.3.11(iv), for 
each cover f: A — B in C, to the theory of 1.4.8, and for the coherent case we 
add the sequents (T +, 1) where z is a variable of sort 0, and 


(T Fe ((Ay1)(mi (y1) = x) V (Ay2)(ma(y2) = x))) 


whenever the sort A of x is the union of two subobjects m;: A; — A (i = 1, 2). 
Lemma 1.4.9 also extends to all the richer fragments of logic. 

In general, the category Cņ ° need not be effective regular, as defined in 
Section A1.3; nor need Cooh be positive coherent. However, we developed means 
of effectivizing regular categories, and of positivizing coherent ones, in A3.3.10 
and A1.4.5 respectively, and so we may obtain 


Proposition 1.4.12 

(i) For any regular theory T, there exists an effective regular category Er 
and a T-model My in Ey which is generic among T-models in effective regular 
categories, in the sense that we have an equivalence 


Reg (Er, D) ~ T-Mod(D) 


sending F to F(M T), for any effective regular category D. Moreover, the regular 
sequents satisfied in My are exactly those provable in T. 

(ii) For any coherent theory T, there is a pretopos Py and a T-model Mr 
in Py which is generic among T- models in pretoposes. Moreover the coherent 
sequents satisfied in My are exactly those provable in T. 

(iii) For any geometric theory T, there is an oo-pretopos Gr containing a 
model of T which is generic among T-models in oo-pretoposes, and in which 
satisfaction for geometric sequents is equivalent to provability in T. 


Proof (i) Take Er to be the effectivization Eff(C;*), and Mr to be the image 
of Mr under the canonical embedding. The fact that this embedding is full, and 
therefore conservative, ensures by 1.2.13 that satisfaction in Mr is equivalent to 
satisfaction in Mr; and the generic property of Mr follows from combining (the 
‘regular’ version of) 1.4.7 with A3.3.10. 

(ii) Similarly, we take Pr to be Eff(Pos(C¢°")), where Pos is the posi- 
tivization functor defined in A1.4.5. Once again, the canonical coherent functor 
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Cook — Pr is full and faithful, and therefore conservative; the remaining details 
are similar to (i). 

(iii) is similar to (ii), except that we use the infinitary analogue of the 
positivization construction, mentioned at the end of Section Al.4. Note that 
this construction, and the effectivization construction, do not destroy well- 
poweredness. o 


Since the embeddings of the syntactic categories into the categories con- 
sidered in 1.4.12 are full and therefore conservative, it follows from 1.2.13 
that we have a completeness theorem for regular (resp. coherent, geometric) 
logic involving only models in effective regular categories (resp. pretoposes, 
oo-pretoposes). 

We define Morita equivalence for regular (resp. coherent, geometric) theories 
to mean equivalence of the categories Er (resp. Pr, Gr), rather than of the syn- 
tactic categories themselves. Of course, this is a weaker notion than equivalence 
of the syntactic categories; but it suffices to ensure that the theories have equiv- 
alent model categories in any (cocomplete) topos, which is our main concern. 
(We shall see in 3.1.12 and 3.3.8 below that the converse is true.) 

Next we note a further simplification of the notion of signature, analogous to 
1.4.9, which may be made in the context of geometric logic. 


Lemma 1.4.13 Let T be a geometric theory. Then there is a geometric theory 
T’, Morita-equivalent to T, whose signature has only one sort. 


Proof First we assume, by using 1.4.9 if necessary, that the signature X of 
T has no function symbols. Let (A; | i € J) be the sorts of ©; then our new 
signature has a single sort A and a family of unary relation symbols U; — A 
corresponding to the sorts of ©, together with a relation symbol R — A---A 
for each relation symbol R — A,--- A, of ©. Among the axioms of T’, we have 
((U;(x)AU;(x)) Fz L) whenever i 4 j, and (T Fz Vier Ui(r)), which say that the 
interpretation [z.T] in a T’-model is the disjoint coproduct of the interpretations 
[z.U;(x)]. Then, for each R — A,--- An in È, we adopt the axiom 


(Roper a N U;(z:)), 
i1=1 


and finally each axiom of T gives rise to an axiom of T’ by putting lines over all 
the relation symbols, and introducing, on the left of each sequent, the additional 
hypotheses that the variables in its context satisfy the new unary relations corre- 
sponding to the sorts which they had in ©. It is then easy to see that a T-model 
M in an oo-pretopos G gives rise to a T’-model whose underlying object is the 
coproduct of the M A;, and conversely; so the oo-pretoposes Gr and Gr must be 
equivalent. 


We remark that, in contrast to 1.4.9, geometric logic is not the smallest 
fragment of infinitary first-order logic for which this reduction is possible; we 
could have done the same thing in the disjunctive logic mentioned in 1.3.6. 
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Remark 1.4.14 Before leaving this section, we should briefly discuss the 
syntactic categories of propositional theories as defined in 1.1.7(m). Since a 
propositional theory T has only one (empty) context, it is easy to see that Cr 
is a preorder: there exists a morphism {{].¢} — {{].w} iff ( Fi) Y) is provable 
in T, and the only such morphism is then [¢]. Thus Cr is equivalent to what 
is commonly called the Lindenbaum algebra of T, that is the poset of provable- 
equivalence classes of formulae, ordered by provable entailment. (For a general 
T, the Lindenbaum algebra — defined as above, but restricted to formulae in the 
empty context, i.e. to sentences — appears as the poset of subterminal objects 
in Cr.) The structure carried by the Lindenbaum algebra of course depends on 
the level of logical complexity at which T is considered to live: if T is a cartesian 
theory then it is a meet-semilattice, if T is coherent then it is a distributive 
lattice, if T is first-order then it is a Heyting algebra, and if T is geometric 
then it is a frame (the order-theoretic operations in each case being induced by 
the corresponding logical operations on formulae). Moreover, it is easily seen 
that if L is a meet-semilattice (resp. a distributive lattice, a frame) and C is 
an arbitrary cartesian (resp. coherent, geometric) category, then cartesian (resp. 
coherent, geometric) functors L — C correspond bijectively to models in C of 
the theory of filters (resp. prime filters, completely prime filters) of L, as defined 
in 1.1.7(m) (note that, since such functors preserve monomorphisms, they all 
take values in the poset of subterminal objects of C). Thus we may conclude 
that every cartesian (resp. coherent, geometric) propositional theory is Morita- 
equivalent to the theory of filters (resp. prime filters, completely prime filters) of 
a meet-semilattice (resp. a distributive lattice, a frame). 


Remark 1.4.15 The sequents appearing in the definition of morphisms of Cr, 
earlier in this section, may have reminded readers of the construction of the 
topos Set(Z) from the allegory Mat(L) in Section A3.3. This is no accident: 
for fragments of logic containing regular logic, the syntactic category may con- 
veniently be constructed by way of a ‘syntactic allegory’, which is in some ways 
a more natural generalization of the Lindenbaum algebra mentioned in the pre- 
ceding remark than the syntactic category itself. Formally, if T is (say) a regular 
theory, we take Ar” to be the allegory whose objects are a-equivalence classes 
of contexts over the signature of T, and whose morphisms from 7 to 7 (where 
these contexts are presumed disjoint) are T-provable equivalence classes of reg- 
ular formulae in the concatenated context 7, y. These morphisms are ordered by 
T-provable entailment (i.e. [Ø] < [y] iff ($ Fz,g Y) is provable in T); the opposite 
of [g] is simply [d] in the re-ordered context 7, Z; and the composite of [¢]: z + 7 
and fy]: 7 % Zis [(Ay¥)(PA y)]. It is readily verified that this is an allegory, and 
that it has a unit (the empty context) and is pre-tabular (the top element of 
Ar” (,¥) is [T], and it is tabulated by 
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So by A3.3.6(ii) and A3.2.10 Map(A7;*[K]) is a regular category, which is 
simply Cp as we have defined it in this section (and Map(A;°[S]) is its 
effectivization Er). Similarly, for a coherent (resp. first-order, geometric) the- 
ory T we obtain a union (resp. division, geometric) allegory A> (resp. Af, 
Ane which can be used to construct the appropriate syntactic category in 
each case. Unfortunately, however, this method will not work for cartesian theo- 
ries, since by A3.1.1 we cannot hope to obtain a non-regular cartesian category 
as the category of maps of an allegory. 


Suggestions for further reading: Freyd et al. [377], Makkai & Reyes [790]. 


D1.5 Classical completeness 


In the previous section we proved completeness theorems (1.4.6 and 1.4.11) in 
the form: for any theory T, there is a single model Mr such that satisfaction in 
Mr is equivalent to provability in T. Despite the evident neatness of this form of 
the theorem, the fact that Mr lives (not in Set but) in a rather ‘exotic’ category, 
constructed specially for the purpose of proving the theorem, is sometimes seen 
as a disadvantage. In ‘classical’ accounts of first-order logic, the completeness 
theorem involves only models in the ‘pre-existing’ category Set; so it is of interest 
to see whether such a theorem can be derived from the one we have proved. 

For cartesian theories, the answer is an immediate ‘yes’: we know from A1.2.2 
that for any small cartesian category C there is a jointly-conservative family of 
cartesian functors C — Set, namely the representable functors C(A,—), A € 
ob C. Applying this to Cr, and appealing to 1.2.13, we immediately obtain 


Proposition 1.5.1 Let T be a cartesian theory, and o a cartesian sequent 
relative to T. If o is satisfied in all T-models in Set, then it is provable in T. 


Proof If ø is satisfied in the T-models which correspond via 1.4.7 to the 
representable functors, then it is satisfied in Mr by 1.2.13, and hence provable 
in T by 1.4.6. EJ 


Our strategy for proving the classical completeness theorems for regular and 
coherent logic will be similar to the above: we shall show that for every small reg- 
ular (resp. coherent) category C, there is a (set-indexed) family of regular (resp. 
coherent) functors C — Set which is jointly conservative. There are various ways 
of achieving this: we shall follow an approach due to P. Freyd (see [381]), which 
has the advantage of minimizing the amount of duplication of effort required for 
the two cases. 

We recall the notion of capital regular category introduced in A1.3.8: our 
first aim is to show that every small regular category C admits a conservative 
regular functor to a (small) capital regular category C. 

We define a base to be a finite sequence B = (Bı, Bo,..., Bn) of distinct well- 
supported objects of C. (The requirement that the B; be distinct seems rather 
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bizarre, given that it doesn’t matter how many isomorphic copies of any given 
object there are in C; but it appears to be essential for technical reasons.) Given 
a base B, we write |] B for the (canonical) product Bı x --- x Bn in C; the need 
to have a well-defined product is the only reason for ordering the members of a 
base. We preorder the set of bases by setting B < C if every object in B appears 
in C (but not necessarily in the same order); clearly, the preordered set of bases 
is directed. If B < C and we are given an object f: A— BEz of C/ II 5, we 
write C*A for the product A x [[ C} (where j runs over those indices such that 
C} is not in B), i.e. the pullback of A along the appropriate product projection 
[IČ — [| Ø, and we write C* f: C*A > IČ for the pullback of f along this 
projection. 

We now define a category C; as follows. Its objects are triples (A, Bf ) where 
A is an object of C, B is a base and f: A— [Į B. Morphisms (A,B, f) > 
(A’, B'f’ ) are represented by pairs (C g) where Č is a base containing both B 
and B’ and g: C*f > © f' in C/ [| Č, modulo the equivalence relation which 
identifies (Č, g) and (Č, 9g’) if there is a base C” containing both C and Č, 
such that g and g’ pull back to the same morphism in C/ æ. (Recall that 
pullback along a cover is conservative by A1.3.2(iii); so the pairs (C,g) and 
(C, g’) represent the same morphism iff g = g’, and more importantly (C, g) 
is an isomorphism in C; iff g is an isomorphism in C/ || C.) Composition and 
identity morphisms are defined in the obvious way; it is clear that C ‘1 is a category. 

If B < C, then the object (A, B, f) is isomorphic to (C* A, C, C* f) in Cı (the 
isomorphism being represented by (C, 1q.,) in each direction). Hence if, for a 
fixed B, we consider the (non-full) subcategory C,|B of Cı whose objects are 
those (A, B', f) with B' < B, and whose morphisms are representable by pairs 
of the form (B, g), we see that C,|B is equivalent to C/ I] B (the equivalence 
sends (A, B', f) to B*f in one direction, and f to (dom f, B, f) in the other). 
In particular, clf] _(where [] denotes the empty base) is equivalent to C itself. 
Moreover, if B < C, then this equivalence identifies the inclusion C,|B C C,|C 
with the pullback functor C*: C/T] B > C/T] Č. 

But Cı is clearly the union of the subcategories G |B, as B ranges over all 
bases; so we have expressed it as a directed union of regular subcategories and 
conservative regular functors between them. Thus we obtain 


Proposition 1.5.2 For any small regular category C, there exists a small 
regular category Cı and a conservative regular functor T:C — Cı, such that 
if A’ — A is any proper subobject in C with A well-supported, there exists a 
morphism 1 — TA in Cı not factoring through TA’ — TA. Moreover, if C is 
(positive) coherent, then so is Cı and the functor T is coherent. 


Proof The construction of C}, and the proof that it is regular (and positive 
coherent if C is), is contained in the discussion above. The functor T is obtained 
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by identifying C with C,|[] C Cı; the fact that it is conservative follows from 
the fact that the functors C > C/ [| B are all conservative, since if Tf has an 
inverse in Cı then the latter must lie in Cı |B for some B. Finally, given A’ — A 
as in the statement of the proposition, we note that the singleton sequence (A) 
is a base, and that in C,|(A) ~ C/A we already have a morphism 1 — TA not 
factoring through TA’, namely the diagonal A — A x A regarded as a morphism 
14 — A*A in C/A. CJ 


The category Cı constructed in 1.5.2 will not in general be capital; for its 
well-supported objects will have proper subobjects which are not in the image 
of the functor T. However, we can achieve the capital property by iterating the 
construction just described: 


Theorem 1.5.3 For any small regular category C, there exists a small capital 
regular category C and a conservative regular functor F: C — C. Moreover, if C 
is (positive) coherent, then so is C and the functor F is coherent. 


Proof We construct a sequence of categories and functors 


To Tı Tz 
Co ————> C ———> C ———> >: 


where Co = C and Cn+1 is constructed from C, by the method of 1.5.2. We 
take Ĉ to be the (pseudo-)colimit of this sequence in Cat: explicitly, objects 
of Ĉ are pairs (n, A) with A € ob Cn, and morphisms (n, A) — (n', A’) are 
represented by pairs (m, f) where m > max {n,n’} and g is a morphism between 
the images of A and A’ in Cm, modulo the obvious equivalence relation. As 
before, the fact that the diagram is filtered ensures that its colimit inherits any 
finitary categorical structure (such as regularity or coherence) enjoyed by all the 
categories Cn and preserved by the transition functors Tn; and since all these 
functors are conservative, so are the induced functors C, — C for all n — in 
particular for n = 0. Finally, C is capital because, if f: A’ — A is any proper 
monomorphism in it with A well-supported, there exists n such that f lies in the 
image of the functor Cna — C; and then in C,,, we can find a morphism 1 — A 
which does not factor through f. O 


Corollary 1.5.4 
(i) For any small regular category C, there is a conservative regular functor 
C — Set/B for some set B. 


(ii) (Classical completeness for regular logic) If T is a regular theory. 
and o is a regular sequent over the signature of T which is satisfied in all 
T-models in Set, then o is provable in T. 
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Proof (i) Consider the family of composite functors 


Ce Gp Se 


where U ranges over the set of subterminal objects of C, F is the ‘capitalization’ 
of C/U constructed in 1.5.3, and F is the functor C/U (1,—). By A1.3.8, the 


terminal object of C/U is projective, so F (and hence also the composite above) 
is a regular functor. Moreover, if A’ — A is a proper subobject in C, let U — 1 
denote the support of A; then U*A’ — U*A is still proper, and U*A is well- 
supported in C/U, so (for this particular U) the composite above maps A’ — A 
to a proper monomorphism. Hence by A1.2.4 the family of all such composites is 
jointly conservative; equivalently, they can be combined into a single conservative 
functor C — Set/B where B is a set indexing the subterminal objects of C. 

(ii) We apply (i) to the syntactic category Cy” of T. If ø is satisfied in the 
Set-models of T which correspond to the functors in the conservative family 
constructed in (i), then by 1.2.13 it is satisfied in the generic model Mr, and 
hence provable in T. o 


Remark 1.5.5 As we mentioned after A1.3.8, there is a significant strength- 
ening of the first part of 1.5.4: every small regular category C admits a full and 
faithful regular functor into a functor category [D, Set], where D is a small cat- 
egory. However, the extra strength of this result does not yield any significant 
extra information about regular theories, and so we shall not prove it here. 


Remark 1.5.6 Itis of interest to ask when a (small) regular category C admits 
a single conservative regular functor to Set (i.e. when we can take B = 1 in 
1.5.4(i)). This will clearly be the case if 1 is the only subterminal object of C, 
i.e. if every object of C is well-supported; but it will also hold if (as in Set 
itself) the only proper subobject of 1 is a strict initial object 0, since then 
C/0 is degenerate and we can omit the functor corresponding to 0 from the 
family of 1.5.4(i) without affecting its conservativity. So we arrive at the sufficient 
condition that given any diagram of the form 


f g 
A ——> B >——__> 1 
inC, either f or g must be an isomorphism. But this condition holds in Set, and 
its truth is clearly reflected by conservative regular functors; so the condition is 
necessary as well as sufficient. 


For coherent categories, we have to do a bit more work; for the functors F 
used in the proof of 1.5.4(i), though regular, will not in general be coherent. The 
key to obtaining coherent Set-valued functors is the following result: 


Lemma 1.5.7 In a positive coherent category, a complemented subobject of a 
(cover-)projective object is projective. 
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Proof Let A’ — A be a complemented subobject of a projective object A, 
with complement A” — A. We have to show that every cover e: B —> A’ splits; 
but any such cover extends to a cover e HI 1: BI. A” + A’ II A” S A, and by 
disjointness of coproducts a splitting for this must map A’ into B, so it yields a 
splitting for e. o 


Thus, in a capital positive coherent category C, every complemented subter- 
minal object U is projective ~ equivalently, the functor C (U, —) is regular. Given 
such a category, let B denote the lattice (in fact a Boolean algebra) of comple- 
mented subterminal objects of C; and for any filter F C B, let Ty denote the 
(filtered) colimit (in [C, Set]) of the functors C (U, —), U € F. Explicitly, P(A) 
is the quotient of [[y~,-C(U, A) by the equivalence relation which identifies 
f:U — A with g: V — A if there exists a commutative square 


U 
| 
A 
with W e F. It is clear that T, as a filtered colimit of regular functors, is again 
regular. 


W >> 
g 
V ——> 


Lemma 1.5.8 The functor Tr constructed above is coherent iff F is an 
ultrafilter (i.e. a prime filter) in B. 


Proof Since C is positive coherent, Fp will be coherent iff it preserves finite 
coproducts. But I'¢ preserves 0 iff 0 ¢ F; and, given an element z of F(A B) 
(represented by a morphism f: U — ALU B, say), we can decompose U as a 
disjoint union f* A U f*B, and primeness says that just one of these will be in 
F, so x is in (the image of) just one of F(A) and [¢(B). Thus Tp preserves 
binary coproducts if F is prime; for the converse, note that a complemented 
subterminal object U satisfies .-(U) = 1 iff U € F. o 


Proposition 1.5.9 ZLetC be a small capital positive coherent category. Then 
the functors TF, as F ranges over all prime filters of complemented subterminal 
objects of C, are jointly conservative. 


Proof Let A’ — A be a proper subobject in C. Although A may not be 
well-supported, A H 1 is, and so we can find f: 1 — AII1 not factoring through 
A'TH 1> AN1. Pulling back along the first coprojection, we obtain a comple- 
mented subterminal object U and g: U — A not factoring through A’. Form the 
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further pullback 


A 


— A 


V 
| g 
U aA 


and let Z be the ideal of complemented subterminal objects contained in V. 
(Note that V itself need not be complemented.) Since U ¢ T, the Boolean Prime 
Ideal Theorem tells us that there is a prime filter F with U € F and FNAT = Ẹ; 
then g represents an element of F(A) which does not belong to F(A’). go 


Corollary 1.5.10 
(i) For any small coherent category C, there is a conservative coherent functor 
C — Set/B for some set B. 


(ii) (Classical completeness for coherent logic) [fT is a coherent theory, 
and o is a coherent sequent over the signature of T which is satisfied in 
all T-models in Set, then o is provable in T. 


Proof (i) Consider the composite functors 


ee i 
C ——+ Pos(c) “> Pos(C) —2-> Set 


where Pos(C) is the positivization of C constructed in A1.4.5, F is its capitaliza- 
tion as constructed in 1.5.3, and F ranges over all prime filters of complemented 


subterminal objects in Pos(C). By 1.5.3, Pos(C) is positive coherent as well 
as capital, so by 1.5.8 and 1.5.9 the functors Fp (and hence also the compos- 
ites above) are coherent and jointly conservative. (Recall that the embedding 
C — Pos(C) is full and hence conservative.) So we may combine them, as before, 
into a single conservative coherent functor C > Set/B, where B is a set indexing 
the filters F. 

(ii) follows from (i) exactly as 1.5.4(ii) followed from 1.5.4(i). oO 


Remark 1.5.11 Before proceeding further, we should make some comments 
about the use of the axiom of choice in the proofs of 1.5.4 and 1.5.10. Although 
the construction of 1.5.2 does not make any explicit use of choice, it does require 
the existence of a canonical choice of products in the category C from which 
we start. In the category Cı constructed in 1.5.2, products are constructed using 
pullbacks in C, so we will have a canonical choice of them provided C has a canon- 
ical choice of pullbacks; however, the construction of pullbacks in C4 requires the 
making of arbitrary choices, because of the equivalence relation involved in the 
definition of morphisms of Cı. Thus the iteration of the construction of 1.5.2, 
which is performed in the proof of 1.5.3, will eventually force us to make arbi- 
trary choices, unless we assume at the outset that we are given something like 
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a well-ordering of the objects and morphisms of C ~ it is not hard to see that 
such a well-ordering can be ‘carried along’ through all the stages of the iteration. 
Of course, if we start from the category Cy”, where T is a regular theory over 
a countable signature ©, it is not hard to construct such a well-ordering; so we 
may deduce that the classical completeness theorem for regular theories over 
countable signatures is valid without any use of choice. 

On the other hand, for coherent logic the use of the Boolean Prime Ideal 
Theorem, to which we appealed explicitly in the proof of 1.5.9, is unavoidable. 
In fact the classical completeness theorem for coherent logic is equivalent to this 
theorem, as may be seen by applying it to the propositional theory of prime 
filters of a distributive lattice Z, which we described in 1.1.7(m). Since this 
theory has a model (in fact its generic model, up to equivalence) in L itself 
considered as a coherent category, it is consistent whenever L is non-degenerate 
(i.e. satisfies 0 # 1), and hence classical completeness implies that every non- 
degenerate distributive lattice has a prime filter - which is well known to be one 
of the equivalent forms of the Prime Ideal Theorem. (See [520] for a discussion 
of the various forms in which this theorem appears in algebra and topology.) To 
put the same thing in more categorical terms: the assertion, for a distributive 
lattice L considered as a coherent category, that the coherent functors L — Set 
are jointly conservative is equivalent to saying that the prime filters of L separate 
its elements — which is another form of the theorem. 


Remark 1.5.12 By an argument similar to that just given, we may see that 
there can be no analogue of 1.5.10(ii) for geometric theories. For if we take T 
to be a geometric propositional theory, then T has a model in its Lindenbaum 
algebra L (cf. 1.4.14), or equivalently in the topos Sh(X), where X is the locale 
corresponding to L (cf. 1.2.15(m)), and so it is consistent whenever L is non- 
degenerate. But its models in Set are the same thing as points of X; so if X is 
a nontrivial locale without points (for example, if T is either of the theories of 
C1.2.8 or C1.2.9), then T is a consistent geometric theory having no models in 
Set. 

Nevertheless, there is a ‘classical completeness theorem’ for geometric logic, 
which asserts that any geometric theory has enough models to determine prov- 
ability (not in Set but) in Boolean toposes. We shall prove this theorem in 3.1.16 
below. 


There can be no completeness theorem in terms of Set-models (or even in 
terms of models in Boolean toposes) for (constructive) full first-order logic; for 
we know that the Law of Excluded Middle (1.3.3) is satisfied in all such models, 
but not provable in this logic. However, if we add this law as an axiom, thus 
obtaining classical first-order logic, we do have a completeness theorem, which 
we can deduce from that for coherent logic by the following trick. 


Lemma 1.5.13 Let T be a first-order theory over a signature £. Then there 
is a signature X' containing ©, and a coherent theory T’ over ’, such that for 
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any Boolean coherent category C we have 
T-Mod(C), ~ T’-Mod(C) . 


The theory T’ is sometimes called the Morleyization of T, in honour of 
M. Morley (cf. [446]). 


Proof We define ©’ to have the same sorts and function symbols as £, but 
(in addition to the relation symbols of ©) we add two new relation symbols 
Cy — Ai: An and Dg — Aj,---An for each first-order formula ¢ over ¥ 
(here the type A;--- A, of Cy and Dg is the string of sorts corresponding to 
the canonical context of ¢; we could have introduced new relation symbols for 
all the formulae-in-context over ©, but thanks to the Substitution Lemma 1.2.7 
there is no need to do so). In T’ we include the axioms 


(Cs (Big cing £n) A Do(T1,.--,En)) Fa. æn L) 


and 
(T Fri, en (ColT1,--- 2n) V Dol(T1,..-,2n))) 


which ensure that the interpretations of Cy and Dg in any T’-model M are com- 
plementary subobjects of M(Aj,...,An), for each formula ¢ over ©. Then we 
have a group of axioms, depending on the structure of the formula ¢, which spec- 
ify the interpretation of the relation symbol Cy in terms of the Cy for appropriate 
subformulae wy of ¢, as follows: 


(i) If ¢ is atomic, 
(Co(£) Are ¢) 


(where we use the notation introduced in 1.3.7 to indicate that we are 
taking both a sequent and its converse as axioms); 
(ii) fọ =T, 
(T Fi] Co); 
(iii) If = (Y A^ x), 
(Co(Z) Are (Cy A Cx)) 
(here the relation symbols on the right are applied to the appropriate 
substrings of 7); 
(iv) If d=, 
(Co Fy L); 
(v) F= (4V x), 
(Co(7) 1Fz (Cy V Cx)); 


(vi) If 6 = =y, 
(Cs(@) AH; Dy(@)); 
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(vii) If ọ = (b= x), 
(Co(#) 4tz (Dy V Cy)); 
(viii) If 6 = (Ay)y, 
l (C(#) Are (3y)Cy (T, y)); 
(ix) If d= (Vy)y, 
(Do(£) Abe (Ay) Dy(@,y)) - 

It will be seen that these axioms are all coherent, and that they ensure, 
for any T’-model M in a Boolean coherent category, that the interpretation of 
£.C4(£) coincides with that of %.@ (for any suitable context 7, not just the 
canonical context of ¢). Thus, to ensure that T’-models coincide with T-models 
in such categories, we have simply to add the axiom (Cy Hz Cy) for each axiom 
(dt zg Y) of T. The fact that homomorphisms of T’-models correspond to elemen- 
tary morphisms of T-models follows from the fact that the former must preserve 
the interpretations of all the Cg. oO 


Corollary 1.5.14 (Classical completeness for first-order logic) Let T be 
a first-order theory over a signature X. If a first-order sequent (ob tz Y) over X 
is satisfied in all T-models in Set, then it is provable in T using classical logic. 


Proof If (@ tz wy) is satisfied in all T-models in Set, then (in the notation 
of 1.5.13) (Cy tz Cy) is satisfied in all T’-models in Set, and so is provable 
in T’. But if we work through this proof replacing every subformula of the form 
Cy (t1, --- tm) by the corresponding substitution instance x[t/Z] of x, and every 
D, by a substitution instance of =x, we will obtain a classical proof in T of the 
sequent (f+ Y), since the effect of this transformation on each of the axioms 
of T’ is to produce either a classically derivable sequent or an axiom of T. For 
example, the second axiom in the first group becomes the Law of Excluded 
Middle itself; and the left-to-right sequent (ix) of the second group transforms 
into (7(Vy)w tz (Sy)-w), which may be proved by combining the following 
(constructively valid) derivation with the instance (T Hz ((Sy)>w V 7(Ay)7y)) 
of the Law of Excluded Middle and the sequent ((Vy)>7~ Fz (Vy)w), which 
follows straightforwardly from the Law of Excluded Middle: 


(=(4y) 7 Fz (3y) 7) 
((-Gy) 7% A (Jy) >y) Fz L) 
((dy)ay Fz a7(Gy)7y) 
(Y Fay 77 Gy)7Y¥) 
CEEA Fay D 
(=(Gy) 7 Fay =) 
GG) Fe Ww). 
Hence (¢ H y) is classically derivable in T. a 


Suggestions for further reading: Freyd [371], Freyd & Scedrov [381], 
Makkai & Reyes [790, 791]. 


D2 


SKETCHES 


D2.1 The concept of sketch 


This chapter (rather like Chapter A3) is something of a digression from our 
main line of development: we shall not make any essential use of sketches in 
anything we wish to do with categorical logic, but, since they are extensively 
used by others in this context, it seems a good idea to give an account of what 
they are and how they relate to the line that we are following. Sketches were 
first introduced by C. Ehresmann [332] and were extensively developed by his 
pupils, notably L. Coppey and C. Lair [658]; for many years they were regarded 
as the ‘exclusive property’ of the French school of category-theorists (of which 
Ehresmann was the founder), but since the publication of the books by Barr and 
Wells [87, 88], and even more since the work of Makkai and Paré [787], their use 
has become much more widespread. In this account, we shall largely follow the 
conventions of Barr and Wells regarding terminology and notation. 

Briefly, a sketch is an alternative way of ‘presenting’ a theory whose models 
we wish to study in appropriate categories, which is ‘categorical’ in character 
as compared with the ‘linguistic’ presentation of Chapter D1. A sketch is in 
some sense intermediate between a theory, as defined in 1.1.6, and its syntactic 
category as constructed in Section D1.4; it has some of the advantages of each, 
but also some of the disadvantages of each. Its principal advantage, as compared 
with a theory, is that we do not have to go through the tedious recursive con- 
struction of terms and formulae over a given signature before we can write down 
the axioms; the analogue of a ‘signature’ from which we start (the underlying 
directed graph of a sketch) contains enough to write down all the axioms directly 
in terms of it. Compared with a syntactic category, its main advantage is econ- 
omy; for example, the syntactic category of even a simple theory such as that 
of groups is infinite, and quite complicated to describe, whereas the sketch for 
groups is finite and can be written down quite explicitly. On the other hand, 
it is appreciably less economical than the linguistic presentation of a theory; 
again taking groups as an example, the usual linguistic presentation requires 
one sort, three function symbols, no relation symbols and three axioms, whereas 
the sketch for groups has four objects, sixteen morphisms and nineteen axioms 
(as we shall see in 2.1.4(b) below). And sketches share with theories the dis- 
advantage of ‘signature-dependence’: equivalence of sketches is a much stronger 
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condition than the assertion that they have equivalent categories of models in 
all appropriate categories. 

To begin the formal definition, recall that a directed graph is a ‘(small) cate- 
gory without composition or identities’; that is, it has a set of vertices or objects, 
a set of arrows or morphisms, and a source and target (or domain and codomain) 
for each arrow. (To help us distinguish categories from directed graphs, we shall 
generally use the italicized terminology, which is not in common use for cate- 
gories, when working in the latter.) Clearly, a directed graph is just a model 
of a certain two-sorted algebraic theory (cf. 1.1.7(b)); alternatively, it is a func- 
tor C — Set, where C is the category with two objects and a parallel pair of 
morphisms between them (cf. A1.3.10(d)). The notion of morphism of directed 
graphs is the obvious one. 

By a diagram in a directed graph G, we mean a graph morphism d: G’ — G; 
we say the diagram is finite if its domain is finite. If G is (the underlying directed 
graph of) a category, we say a diagram d: G’ — G commutes if, for any two 
paths (= finite strings of arrows) in G’ with the same source and target, the 
two morphisms in G obtained by composing the images under d of the arrows 
along the two paths are equal. (We allow the possibility that one of the paths 
might be empty; the composite of an empty path in a category is interpreted as 
being the appropriate identity morphism.) We say a diagram d is a cone if G” 
has a distinguished vertex go such that, for any other vertex g, there is exactly 
one arrow go — g in G’ and no arrows g — go. We say d is a discrete cone if it 
is a cone and there are no arrows in G” other than those with source go. If G” 
satisfies the corresponding conditions with ‘source’ and ‘target’ interchanged, we 
say d is a (discrete) cocone. 


Definition 2.1.1 


(a) A sketch S is a quadruple (Gs, Ds, Ls,Cs) where Gs is a directed graph, 
Dg is a set of finite diagrams in Gs, Ls is a set of cones in Gg and Cs is 
a set of cocones in Gg. (We normally omit the subscript S’s, if there is no 
danger of confusion.) 


(b) A morphism of sketches f: S — T is a morphism f: Gs — Gr of the under- 
lying directed graphs, composition with which takes each member of Ds 
into a member of Dr, each member of Lg into a member of Ly and each 
member of Cs into a member of Cr. We write Sk for the category of sketches 
and sketch morphisms. 


(c) A model of a sketch S in a category C is a graph morphism M: Gs >C 
such that 


(i) for each d: G’ — G in Ds, the composite M o d commutes; 

(ii) for each 1: G! > G in Ls, the images under M o l of the arrows with 
source go in G’ form a limit cone over the diagram formed by restricting 
M ol to the subgraph G” obtained by deleting go, and all arrows with 
source go, from G’; and 
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(iii) for each c: G’ — G in Cg, the images under M oc of the arrows with 
target go in G’ similarly form a colimit cone under the rest of the 
diagram. 


Equivalently (ignoring questions of size), a model of S in C is a sketch mor- 
phism from S to the ‘underlying sketch of C’, where the latter is defined 
to be the underlying directed graph of C, equipped with all finite diagrams 
which commute in C, all limit cones in C and all colimit cones in C. A mor- 
phism of models f: M — N is a natural transformation between graph 
morphisms (which makes sense because the codomain of M and N is not 
just a directed graph but a category). We write S-Mod(C) for the category 
of S-models in C. 


It will be noted that the definition of a model of a sketch does not presuppose 
any general information about the existence of limits or colimits in C, merely that 
certain particular (co)cones are limits or colimits. However, when considering 
the category of S-models in C, it is customary to assume that C has limits and 
colimits of all the shapes which appear in Lg and Cs. 

Just as we classified theories according to the degree of linguistic complexity 
needed to formulate their axioms, so we classify sketches according to the types 
of cones and cocones which appear in them. We next list some of the classes that 
we may wish to consider. 


Definition 2.1.2 Let S= (G, D, L,C) be a sketch. 


(a) 
(b) 


We say S is linear (or elementary) if both L and C are empty. 


We say S is a limit sketch (or projective sketch) if C is empty. If in addition 
all the cones in L are finite, we call S a finite limit sketch (or cartesian 
sketch). If C is empty and all the cones in L are discrete (and finite), we 
call S a (finite) product sketch. 


Similarly, we have the notions of (finite) colimit sketch (also called an 
inductive sketch) and of (finzte) coproduct sketch when L is empty. 

The term mized sketch is sometimes used (for emphasis) to indicate that 
both L and C are nonempty. We say S is finitary if (it is mixed and) all 
members of L and C are finite. 


We say S is a (finitary) disjunctive sketch if all cones in L are finite, and 
all cocones in C are (finite and) discrete. 


We say S is a regular sketch if L consists of finite cones, and all the cocones 
in C have the form 


—— 
p 
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(with distinguished vertex k) where 


Q 
g ————> h 
ô 
h ————> k 
ô 


(with distinguished vertex g) is a cone in L. 


(g) We say S is a coherent sketch if all cones in L are finite, and all cocones in 
C are either of the form described in (f), or finite and discrete. 


(h) We say S is a geometric sketch if all cones in L are finite, but there are no 
restrictions on the cocones in C. 


Remark 2.1.3 After seeing the definition of coherent sketch, the reader may 
have been surprised to discover that that of geometric sketch imposes no restric- 
tions at all on cocones, rather than requiring the ‘infinitary analogue’ of the 
restrictions in 2.1.2(g). We could have done this, but the result would have been 
the same as far as the expressive power of the class of geometric sketches is 
concerned; essentially, the reason for this is contained in A1.4.19, where we saw 
that an oo-pretopos has all small colimits. See also 2.2.8 below. 


Of course, the reappearance in 2.1.2 of various words (regular, coherent, 
geometric, ...) that were used to describe classes of theories in Chapter D1 is no 
accident: we shall see in the next section that these classes of sketches have the 
same expressive power as the classes of theories with the same names. Before 
turning to this, however, we give some examples of the expressive power of the 
various classes of sketches listed above. 


Examples 2.1.4 (a) IfS is a linear sketch, then an S-model in C is essentially 
the same thing as a functor from Cs = (FGs/ =) to C, where FGs is the free 
category on Gs (i.e. the category whose objects are the vertices of Gs and whose 
morphisms are the paths in Gs), and = is the smallest congruence on FGs which 
identifies all parallel pairs which occur in the diagrams in Ds. Conversely, given 
a small category D, we can construct a linear sketch whose models are functors 
defined in D, by taking G to be the underlying directed graph of D, and D to 
consist of all commutative triangles 


A 


; |- 
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in D and all loops 
Aelas 


where 14 is an identity morphism in D. Thus, up to Morita equivalence, linear 
sketches are essentially the same thing as small categories. 

(b) As promised, we give a presentation of the sketch for groups, which is 
a finite product sketch. The underlying directed graph has four vertices which 
we denote by 1, G, G? and G? (anticipating what their interpretations will be 
in a model of the sketch) and the following sixteen arrows: !: G — 1; e: 1 > G; 
two arrows lg and i: G — G; three arrows 7,72 and m: G? — G; two arrows 
e x 1 and (i, 1): G — G?; three arrows 7,72 and 73: G3 — G; and four arrows 
T12, T23, M X 1 and 1 x m: G3 — G?. L consists of the three discrete cones 


1 G? G’ 

G G G G 
(of which the first corresponds to the assertion that 1 is the limit of the empty 
diagram). D contains the loop diagram which says that lg is the identity on 
G, plus, for each of the six arrows with target G?, a pair of diagrams specifying 


that (the interpretation in a model of) the arrow is ‘what its name implies’. For 
example, the diagrams for m x 1 are 


G 


grene gp 


and those for (i, 1) are 
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Finally, the three equations usually thought of as the axioms of group theory are 
represented by the diagrams 


z mxi 2 exl (i, 1) 9 

G’ — G* < G and G ———>> G 

1xm m ; | | 
G 

CE SG eee se 


It is clear that any (many-sorted) algebraic theory, as defined in 1.1.7(a) 
and (b), can be presented in a similar way as a finite product sketch. Conversely, 
any finite product sketch S gives rise to a many-sorted algebraic theory: we take 
the vertices of Gs as sorts and its arrows as unary function symbols, plus, for 
each cone in Lg with arrows (f;: A > B; | 1 < i < n), an n-ary function symbol 
g: B,---B, — A. Each ‘commuting cell’ in a diagram in Dg gives rise to an 
equation in the obvious way, and each n-ary cone gives rise to n + 1 equations 


(alfil) <5 »fn(x)) = T) 


and (filg, Eae ,Yn)) = Yi) (1 Sis n). 

(c) Similarly, (infinite) product sketches can be used to describe ‘infinitary 
algebraic theories’ where one has operations acting on infinitely many elements 
of the structure. For example, for any infinite cardinal «x, one can formulate a 
sketch for k-complete lattices, i.e. lattices in which every subset of cardinality 
less than « has a join and a meet. (However, the smallness restrictions built 
into the definition of a sketch prevent us from describing a sketch for complete 
lattices, in which all subsets have joins and meets; this is demonstrated by the 
well-known fact that there is no free complete lattice on three generators [427]. 
See also 2.3.13 below.) These structures do not fit into the linguistic approach 
as we developed it in Chapter D1, since they require formulae with infinitely 
many free variables; but there is an evident extension of the notion of first-order 
language which would allow this. 

(d) If C is a small cartesian category, there is a finite limit sketch whose mod- 
els are cartesian functors defined on C: just take the linear sketch for functors 
defined on C, as given in (a), and add a limit cone for each finite diagram in 
C (equivalently, for the empty diagram and for each diagram formed by a pair 
of morphisms with common codomain). Thus, in view of 1.4.7, we know that 
every cartesian theory is Morita-equivalent to a finite limit sketch. (We shall 
prove the converse in 2.2.1 below.) Similarly, if C has (some) infinite limits, we 
can construct a limit sketch whose models are functors preserving these lim- 
its. In linguistic terms, such sketches correspond to the infinitary analogues of 
cartesian theories (which have been called ‘ruled theories’ by Isbell [475], mod- 
ulo some differences about size restrictions); as in (c), these theories require an 
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infinitary extension of the language as we defined it in Section D1.1, but can 
be expressed by sequents built up using (finite and) infinite conjunction, and 
existential quantification subject to the restriction of 1.3.4. 

(e) An example of a (finitary) disjunctive sketch is the sketch for fields: here 
we take a directed graph whose vertices, in addition to 1, F, F? and F’, include 
an object U which is the distinguished vertex of a cone 


U 


FP —_____- | — F 


(where m is the multiplication of F), whose effect is to ensure that, in any model 
M of the sketch, M (U) is the ‘object of units’ of M (F). Then the axioms which 
distinguish fields from commutative rings can be expressed by a discrete cocone 


1 U 


F 


(where the arrow U — F is not the same one that appears in the cone above, 
but is the composite of the arrow U — F? with one of the projections). 

(f) We shall give examples of regular and coherent sketches in the next 
section. A simple example of a finitary (mixed) sketch which is not coherent is 
given by the sketch for connected directed graphs. Directed graphs are of course 
models of the linear sketch corresponding to the directed graph 


Ase ee VG 


to express the idea of connectedness, we add a further vertex 1 with arrows 
A — 1 and V — 1, plus the discrete cone which says that 1 is a limit for the 
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empty diagram, and the cocone 


A ee re 


1 


We shall see in 2.4.12(a) below that connected graphs cannot be the models of 
a finitary first-order theory; in particular, they are not the models of a coherent 
theory. 

(g) Let (C,T) be a small site whose underlying category C is cartesian. Then 
we may construct a geometric sketch S whose models, in any geometric category 
€, are cartesian cover-preserving functors C — E (where E is equipped with the 
coverage generated by small covering families); in particular, for a Grothendieck 
topos £, we have 


S-Mod(€) ~ Top (£, Sh(C, T)) 


by C2.3.9. Explicitly, we take the sketch for cartesian functors on C, as defined 
in (d) above, and for each covering family (f;: A; — A | i € I) we add a cocone 
whose vertices are A (the distinguished vertex), the A; and the pullbacks A; x 4A, 
((i,j) € I x J), and whose arrows are the f;, the projections A; x 4 A; — A; and 
A; x4 A, — A, and the composites A; x 4 A, > A; — A. It is straightforward 
to verify that a cartesian functor F: C — E maps this cocone to a colimit iff the 
family (F(f:) |i € J) is covering in £. 

(h) Let (C,T) be a site as in (g). We may also construct a limit sketch whose 
models (in Set) are sheaves on C, as follows: we take the linear sketch for functors 
on C°P, as in (a) above, and then take each of the cocones described under (9), 
with all arrows reversed, as a distinguished cone of the sketch. Of course, this 
sketch will not be geometric unless all the covers in T can be generated by finite 
families; this will happen, for example, if C is a coherent category and T is its 
coherent coverage (cf. C2.1.12(d)). 


Suggestions for further reading: Barr [77], Barr & Wells [87, 89], Makkai & 
Paré [787], Wells [1226]. 


D2.2 Sketches and theories 


The next step in a conventional development of categorical logic in terms of 
sketches would be the construction, for each sketch S in one of the classes con- 
sidered in 2.1.2, of a classifying category for S containing a model of S which 
is generic among S-models in categories of the appropriate kind. However, since 
we have already done the corresponding job for theories in Section D1.4, we can 
by-pass this work by showing that sketches have the same expressive power as 
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theories, i.e. for every sketch in one of the appropriate classes there is a theory 
of the appropriate type having ‘the same’ models in all appropriate categories, 
and conversely. In fact we shall rely heavily on Section D1.4 in order to do this; 
we shall see how to convert sketches into Morita-equivalent theories directly, but 
in going the other way we shall appeal to the fact that (for example) any regular 
theory T is Morita-equivalent to the theory of regular functors on a small regu- 
lar category C;®, and then show how the latter may be converted into a regular 
sketch. 

For cartesian theories and finite limit sketches, the latter half of the work has 
been done already in 2.1.4(d): 


Proposition 2.2.1 Let Cart denote the 2-category of small cartesian cate- 
gories, cartesian functors and arbitrary natural transformations between them. 
For a 2-functor F: Cart — Cat, the following are equivalent: 


(i) F is representable (in the ‘up-to-equivalence’ sense). 
(ii) There exists a cartesian theory T such that F ~ T-Mod(-—). 
(iii) There exists a finite limit sketch S such that F ~ S-Mod(-—). 


Proof (ii) => (i) was proved in 1.4.7, and (i) = (iii) in 2.1.4(d). So it suffices to 
prove (iii) => (ii). 

Given a finite limit sketch S, we take a signature © whose sorts are the ver- 
tices of the graph Gs, with the arrows of Gs as unary function symbols (and 
no relation symbols, other than equality). For each parallel pair of paths in 
a diagram in Dg, we adopt an axiom of the form (T +, (s = t)), where s 
and ¢ are terms of the form f(g(---(z)---)). Now suppose we have a finite 
cone in Lg with vertices Ag, A1,..., An (Ao being the distinguished one), distin- 
guished arrows f;: Ag — A; (1 < i < n) and other arrows typically denoted by 
g: A; — A,. Then we adopt the axioms (T Fz (9(f,(z)) = f,(x))) for each such 
g, together with 


(Aue fi(y)) bay (# = 0) 
and 


(Atala) = 25) Fr. tn (AZo) Ai fı(2o) =x) 


where the conjunction on the left-hand side of the last sequent is over all arrows g 
whose source is not Ag. Clearly, the second displayed sequent. is cartesian relative 
to the first. Also, a =-structure in a (cartesian) category C is the same thing as 
a graph morphism Gg — C; it satisfies the axioms of the first type iff it maps 
the diagrams in Dg to commutative diagrams in C, and it satisfies those of the 
other three types iff it transforms the cones in Lg to limit cones in C. o 
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Remark 2.2.2 Proposition 2.2.1 appears to state less than we have actually 
proved, in that it refers only to T-models and S-models in small cartesian cate- 
gories, and we have considered them in arbitrary cartesian categories. But the 
difference is only ‘apparent, since it is clear that, given any T-model M in a 
cartesian category C, we can find a small subcategory C’ of C, closed under finite 
limits, which contains M. Similar remarks apply to the results to be proved 
below about regular and coherent theories and sketches. 


Proposition 2.2.3 Let Reg denote the 2-category of small regular categories, 
regular functors and arbitrary natural transformations between them. For a 
2-functor F: Reg — Cat, the following are equivalent: 


(i) F is representable. 
(ii) There exists a regular theory T such that F ~ T-Mod(-—). 
(iii) There exists a regular sketch S such that F ~ S-Mod(-—). 
Proof Once again, (ii) => (i) was proved in Section D1.4. 
(i) = (ili): Let C be a small regular category. We construct a regular sketch S 


whose models are regular functors defined on C, by taking the finite limit sketch 
of 2.1.4(d) and adding the cocone 


|| NSE SS Sa NS AER: 
b 
fa f 
B 


whenever f is a cover in C and (a,b) is its kernel-pair. (This is allowable under 
2.1.2(f), since the cones of S include that for the pullback of f against itself.) 
Clearly, a model of S is a cartesian functor defined on C which sends covers to 
coequalizers of their kernel-pairs; but this is the same thing as a regular functor, 
by Al.3.4. 

(iii) = (ii): Given a regular sketch S, we construct a regular theory T with 
the same models, by taking the cartesian theory described in the proof of 2.2.1 
and adding the sequent (T +, (Sz: A)(f(z) = y)) for each cocone 


a 
Po re 
b 
g f 
B 


in S. Then a model of T in a regular category D is a model M of the finite limit 
part of S, such that for each cocone as above the morphism M(f) is a cover 
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in D. But since the finite limit part of S includes the information that (a,b) is 
the kernel-pair of f, this is equivalent to saying that M (f) is the coequalizer of 
M(a) and M(b), i.e. that M is a model of S. 0 


Remark 2.2.4 Proposition 2.2.3 remains valid if we replace the 2-category 
Reg by the sub-2-category Eff Reg of effective regular categories, by 1.4.12(i). 


To get a similar result for the coherent case, we actually have to restrict 
our attention to models in positive coherent categories; this is a consequence of 
the way in which we defined coherent sketches, which allows us to talk about 
disjoint unions but not about arbitrary (finite) unions. It would be possible to 
formulate a more ‘relaxed’ version of Definition 2.1.2(g) which would not have 
this disadvantage (cf. the proof of 2.2.7(ii) below); but it would be appreciably 
more complicated. And, since we are primarily interested in models in toposes, 
the restriction is not one that worries us unduly; in fact we shall formulate the 
next proposition only for models in pretoposes. 


Proposition 2.2.5 Let PreTop denote the 2-category of small pretoposes, 
coherent functors and arbitrary natural transformations between them. For a 
2-functor F: BreTop — Cat, the following are equivalent: 


(i) F is representable. 
(ii) There exists a coherent theory T such that F ~ T-Mod(—). 
(iii) There exists a coherent sketch S such that F ~ S-Mod(-—). 


Proof (ii) > (i) was proved in 1.4.12(ii). For (i) => (iii), let C be a small 
pretopos; take the regular sketch constructed in 2.2.3, and add the discrete 
cocone which says that 0 is an initial object, and the cocones 


A B 


A+B 


for each pair of objects (A, B). A model of this sketch is then a regular func- 
tor defined on C which preserves finite coproducts; but since unions can be 
constructed from coproducts and images, this is the same thing as a coherent 
functor. 

(iii) > (ii): Given a coherent sketch S, take the theory constructed in the 
Proof of 2.2.3 from the regular part of S, and add the sequents 


(+ - VGA -= 2)). 


(alri) = fi(z)) Fr, z (£i = 24)) (for each i) and ((fi(xi) = fi(£3)) Fz,,z; L) 


t 


) 
(for each i # j) for every discrete cocone with arrows f;: Aj > A (1 <i <n). 


872 SKETCHES D2.2 


Then a T-model in a pretopos D is a model of the regular part of S, which 
sends each discrete cocone in S to a disjoint union of subobjects (equivalently, a 
coproduct) in D; so it is the same thing as an S-model in D. 


Remark 2.2.6 It will be observed that the sequents constructed from discrete 
cocones in the proof of 2.2.5(iii) => (ii) form a disjunctive theory, as defined 
in 1.3.6; this forms part of the proof (which we shall not give in detail) that 
disjunctive theories have the same expressive power as disjunctive sketches in 
the sense of 2.1.2(e). 


For geometric theories, we cannot state the analogue of 2.2.1, 2.2.3 and 2.2.5 
in precisely the same form, because of the fact that oo-pretoposes are generally 
not small. Nevertheless, we still have 


Proposition 2.2.7 
(i) Let S be a geometric sketch. Then there exists a geometric theory T such 
that, for all oo-pretoposes C, we have S-Mod(C) ~ T-Mod(C), naturally 
in C. 
(ii) Let T be a geometric theory. Then there exists a geometric sketch S such 
that, for all geometric categories C, we have T-Mod(C) ~ S-Mod(C), 
naturally in C. 


Proof (i) Given S, we construct the cartesian theory corresponding to the 
finite limit part of S, as in 2.2.1. Then, for each cocone in Cs with distinguished 
arrows (fi: A; — A | i € I), we add the sequent 


(TH, VOR e=) 
iE 


plus, for each non-distinguished arrow g: A; — Aj, the sequent 


(T Fz, (f)(9(22)) = fi(zi))) 


and, for each pair (i, j), the sequent 


(vie = f;(z;)) Fy, 2, V (An, . om ( A Pk A (x = y1) A (£) = =) 
k=1 


where the disjunction on the right is over all finite chains of non-distinguished 
arrows (in either direction) 


E ae ee te gee 


J 


and Øp is either (gk(Yk) = Yk+1) OF (gk(Yk+1) = Yk), depending on the direction 
of gx. It is straightforward to verify that, if M is a model of the finite limit part 
of S in an oo-pretopos C, then M satisfies the sequents displayed above iff MA 
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is covered by the coproduct of the M A;, i € I, and the kernel-pair of this cover 
is exactly the equivalence relation by which we have to factor the coproduct 
to obtain the colimit of the diagram formed by (the images under M of) the 
non-distinguished arrows in the cocone. (Compare the construction of arbitrary 
coequalizers in an oo-pretopos, given in Al.4.19.) Thus a model of the theory 
described above is a model of S. 

(ii) Given T, the geometric category C#°°™ is generally not small, but it is 
equivalent to a small category, since every object is isomorphic to one of the form 
{z.o} where ¢ is a disjunction of regular formulae, by 1.3.8(ii). So it suffices to 
prove that, given a small geometric category C, we can construct a geometric 
sketch S whose models correspond to geometric functors defined on C. For this, 
we take the regular sketch constructed in 2.2.3, and for each union of subobjects 
A = (J;e; Ai in C we add a cocone with distinguished vertex A, whose other 
vertices are all the A; and the pairwise intersections A; N A}, and whose arrows 
are all the inclusions A; — A, A; N A, — A, A; Aj — A; and A; N A, — A). 
By the infinitary analogue of A1.4.3, a functor F: C — D (where D is geometric) 
maps this cocone to a colimit iff FA is the union of the F A;. Hence F is a model 
of the sketch just described iff it is a regular functor and preserves arbitrary 
unions, i.e. iff it is a geometric functor. C] 


Remark 2.2.8 As we remarked in 2.1.3, our definition of geometric sketch 
in 2.1.2(k) is not simply the infinitary analogue of 2.1.2(g), in that it includes 
no restrictions on the types of cocones in S. The proof of 2.2.7(i) provides the 
justification for this choice; in a sense, it is just a reworking of the observation, 
already made in A1.4.19, that an oo-pretopos has all small colimits. Note also 
that it follows trivially from 2.2.7 that geometric sketches and geometric theories 
have the same expressive power in oo-pretoposes; however, the use of arbitrary 
geometric categories, rather than oo-pretoposes, in the proof of (ii) was essential, 
since an oo-pretopos (has all small coproducts, and hence) is almost never small. 


The reader may also be wondering whether it is possible to characterize a class 
of theories having the same expressive power as the class of (arbitrary) finitary 
sketches. At least as far as models in Grothendieck toposes are concerned, the 
answer is yes, thanks to a result which we noted in Section A2.5. In order to 
present it, let us define a o-coherent theory to be a geometric theory whose 
axioms involve only (finite or) countable disjunctions. We shall also call a sketch 
o-coherent if it contains only finite cones and (finite or) countable cocones. 


Proposition 2.2.9 
(i) Let S be a o-coherent sketch. Then there exists a o-coherent theory T such 
that, for all o-pretoposes C, we have S-Mod(C) ~ T-Mod(C), naturally 
in C. 
(ii) Let T be a o-coherent theory. Then there ezists a finitary sketch S such 
that, for all countably cocomplete toposes €, we have T-Mod(€) ~> 
S-Mod(€), naturally in E. 
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Proof (i) follows immediately from an inspection of the proof of 2.2.7(i): if 
the cocones in S are all countable, then so are all the disjunctions in the sequents 
displayed there. 

(ii) Given a o-coherent theory T, we may construct a (small) o-coherent 
syntactic category CZ°°™ in the usual way, and then positivize and effectivize it 
to a o-pretopos PF, such that for any o-pretopos € (in particular, for any topos 
with countable coproducts) we have 


T-Mod (E) ~ o-BreLVop (Pr, E) 


(this is just the ‘sigma’ analogue of 1.4.12(ii) and (iii)). But if € is a topos, then by 
A2.5.7 the objects of the category on the right-hand side (the o-coherent functors 
Pī — E) are exactly those functors which are both cartesian and cocartesian. 
So we may identify them with S-models in £, where S is the sketch formed by 
the underlying directed graph of PZ, together with all commutative diagrams, 
all finite limit cones and all finite colimit cocones in this category. E 


Thus the expressive power of finitary sketches coincides with that of 
o-coherent theories, and with that of o-coherent sketches. In particular, we 
note that the theory of K-finite objects, introduced in 1.1.7(k), is sketchable 
by a finitary sketch, although (as we shall see in 2.4.12(c) below) it is not 
Morita-equivalent to any finitary theory. 


Suggestions for further reading; Adámek et al. [19], Makkai [785]. 


D2.3 Sketchable and accessible categories 


One of the most remarkable results in the theory of sketches is that the class 
of sketchable categories — that is, those categories which occur as S-Mod(Set) 
for some sketch S — can be characterized up to equivalence in purely categorical 
terms, as the class of accessible categories. We shall not give the proof of this 
result in full detail, since it is somewhat peripheral to our main concerns, and it 
is adequately covered in the books by Makkai and Paré [787] and by Adámek and 
Rosicky [23]. Nevertheless, it seems appropriate to give at least a rough outline 
of the argument. 

We begin with some terminology. Throughout this section we shall be working 
with locally small categories (that is, categories enriched over a fixed category 
Set of sets), and the letters x and A will denote (infinite) regular cardinals in 
Set. A category C is called K-small if the class of all its morphisms is a set of 
cardinality less than «, and C is called «-filtered if every k-small diagram in C 
(that is, every functor D — C where D is k-small) has a cone under it. (This is 
in particular the case when C has all «-small colimits, in which case we call it 
k-cocomplete. ) 

The usual notion of filteredness, studied in Section B2.6 for internal cate- 
gories, is equivalent to w-filteredness, where w is the cardinality of the set of 
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natural numbers. For a general «, «-filteredness can also be expressed in more 
‘elementary’ terms, similar to the conditions of B2.6.2: it is sufficient to con- 
sider discrete diagrams (i.e. families of objects) of cardinality less than «, and 
diagrams consisting of two objects and a family of fewer than « parallel mor- 
phisms between them. We shall also need the ‘«k-analogue’ of (the specialization 
to Set of) the main result of Section B2.6: namely, that a small category C is 
«-filtered iff the colimit functor lime: [C, Set] — Set preserves all «-small limits. 
The proof is a straightforward generalization of the familiar one for K = w. 


Definition 2.3.1 Let C be a locally small category with «-filtered colimits 
(that is, colimits over all small «-filtered categories). 


(a) An object A of C is called k-presentable if the functor C (A, —): C — Set 
preserves «-filtered colimits. (In the special case x = w, we normally say 
‘finitely presentable’ rather than ‘w-presentable’.) 


(b) C is called k-accessible if it contains a set G of k-presentable objects, such 
that every object of C is expressible as a «-filtered colimit of objects in G. 


A category is called accessible if there exists a regular cardinal « for which it 
(has «-filtered colimits and) is «-accessible. It is clear that a K-presentable object 
is A-presentable for all regular A > «; it is not true in general that a «-accessible 
category is A-accessible for all A > «x, but for any accessible category C there 
exists a cardinal À such that C is ’-accessible for all regular ’ > A. 

The set G appearing in part (b) of the definition is clearly a generating set 
for C, as defined in Section A1.2; but the condition imposed in (b) is stronger 
than this. There is (up to equivalence) a ‘canonical’ choice for G: 


Lemma 2.3.2 Let C be a «-accessible category. Then 


(i) The full subcategory C, of k-presentable objects of C is essentially small 
(i.e. equivalent to a small category). 


(ii) For any object A of C, the comma category (C, | A) (whose objects are mor- 
phisms B — A with B € ob Cx, and whose morphisms are commutative 
triangles) is «-filtered. 

(iii) C,, is dense in C; i.e., for any object A of C, the canonical cone under the 
forgetful functor (Cx | A) — C with vertex A is colimiting. 


Proof If A is a x-presentable object of C, then by expressing it as a «-filtered 
colimit of objects G; in the set G given by 2.3.1(b), we see that the identity 
morphism A — A must factor through some G; — A, i.e. A is a retract of some 
G;. Since each G; has only a set of idempotent endomorphisms, it follows that 
there is only a set of isomorphism classes of k-presentable objects of C. So (i) 
is established. (Conversely, any retract of a K-presentable object is easily seen 
to be «-presentable; so Cx is precisely the full subcategory of C on those objects 
which are retracts of members of G.) 


876 SKETCHES D2.3 


Now let A be an arbitrary object of C, and let D: J — C be a diagram in 
C with colimit A, where J is «-filtered and the objects D(j), j € ob J, are in 
G. We have an obvious comparison functor F: J — (Cx | A) sending j to the jth 
leg of the colimiting cone; and from the very definition of k-presentability, it is 
easy to see that this functor is final (that is, each object of (C, | A) admits an 
‘essentially unique’ map to an object in the image of F). From this, it follows 
by standard arguments that, for any functor G: (Cx | A) — E where E is cocom- 
plete, the canonical morphism lim (Go F) — lim,e,) 4)(G) is an isomorphism. 
Taking € = Set and using the characterization of «-filtered categories mentioned 
earlier, we deduce that (Cx | A) is «-filtered; and taking G to be the forgetful 
functor (Cx | A) — C we obtain (iii), since the composite of this functor with F 
is simply D. o 


Thus, given that C is k-accessible, we may take the witnessing set G to be 
any representative set of objects of the subcategory Cx, and we shall normally 
do this from now on. 

In general, the denseness of a small full subcategory D of C is equivalent 
to saying that the canonical functor V: C — [D°?, Set] which sends A to (the 
restriction to D of) the functor C (—, A) is full and faithful: for a morphism 
V(A) — V(B) in [D°?, Set] is essentially the same thing as a cone under the 
forgetful functor (D | A) — C with vertex B. (Recall that ob D is a separating 
(resp. generating) set for C iff V is faithful (resp. conservative).) 

This leads to an alternative characterization of accessible categories as sub- 
categories of functor categories. We shall call a functor F: D°? — Set a «-torsor 
on D if it is expressible as a «-filtered colimit of representable functors; this 
is equivalent to saying that the domain F of the discrete fibration F — D 
corresponding to F is «-filtered (cf. B3.2.3). Using this, one may show that 
the category k-Tors(D) of «-torsors on D is closed under «-filtered colimits in 
[DP Set]. 


Proposition 2.3.3 A category C is K-accessible iff it is equivalent to the 
category «-Tors(D) for some small category D. 


Proof One direction follows from 2.3.2: if C is K-accessible, we take D to be 
a (small) skeleton of C,, and then the functor V: C — [D° Set] induces one 
half of the required equivalence. Conversely, we must show that «-Tors(D) is 
always «-accessible. We have observed above that it has «-filtered colimits; and 
since these are preserved by the inclusion x-Tors(D) — [D°?, Set], the Yoneda 
lemma ensures that the representable functors D°? — Set are «-presentable as 
objects of k-Tors(D). So they will serve as the members of the set G required 
for 2.3.1(b). z 


We remark that the «-presentable objects of «-Tors(D) are exactly the 
retracts of representable functors, by the argument in the proof of the first part 
of 2.3.2. (The 2-element monoid {1,e} with e? = e is a «-filtered category for 
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any K, So retracts of representable functors are always «-torsors.) Thus, provided 
D is Cauchy-complete, we can recover it (up to equivalence) from «-Tors(D). 

An important special case of 2.3.3 occurs when the category under consider- 
ation is (co)complete: 


Proposition 2.3.4 For a category C, the following are equivalent: 


(i) C ts K-accessible and cocomplete. 
(ii) C is K-accessible and K-cocomplete. 


(iii) C is equivalent to the category k-Cts (D°, Set) of K-continuous functors 
DP — Set (that is, functors preserving «-small limits), where D is a 
k-cocomplete small category. 


(iv) C is K-accessible and complete. 


Proof (i)=(ii) is trivial. If (ii) holds, it follows easily from the fact that «-small 
limits commute with «-filtered colimits in Set that C, is closed under k-small 
colimits in C; in particular it is k-cocomplete (and so is any skeleton D). But it is 
easy to see that any «-torsor on D preserves «-small limits, since it is a «-filtered 
colimit of functors preserving all limits; and conversely if D is k-cocomplete, then 
any «-continuous functor is a «-torsor, by the argument of B3.2.5. So condition 
(iii) follows from 2.3.3. For (iii) > (iv), note that k-Cts (D°?, Set) is closed under 
arbitrary small limits in [D°P, Set], since limits commute with limits. 

It remains to show that (iv) implies (i). For this, we shall employ the General 
Adjoint Functor Theorem, to construct a left adjoint for the diagonal functor 
A:C — [J,C] for any small category J. It is clear that (iv) implies that C has, 
and A preserves, all small limits, so we have to verify the solution set condition 
for a typical object D: J — C of [J,C]. Note first that since every object A of C is 
expressible as a small («-filtered) colimit of objects which are «-presentable (and 
hence A-presentable for any À > K), we can find A such that A is A-presentable. 
Hence, as J is small, we can find A such that J is A-small, C is A-accessible, and 
all the objects in the image of D are A-presentable. Now let G) be a representative 
set of A-presentable objects of C; then it is easy to see that the set of all cones 
under D with vertices in G, is a solution set for A at the object D. E 


A direct proof of (iii) > (i) in 2.3.4 may be given by using the Adjoint Functor 
Theorem to show that «-Cts(D°?, Set) is reflective in [D°P,Set], for any x; 
see [587]. 

Categories satisfying the equivalent conditions of 2.3.4 are called locally 
k-presentable (or simply locally presentable, if we do not wish to specify the 
cardinal x). They have been studied for considerably longer than general acces- 
sible categories; see [386]. In contrast to the situation noted after 2.3.1, a locally 
k-presentable category C is always locally A-presentable for all regular A > «x; 
for the full subcategory C, is closed under A-small colimits, and hence (C) | A) is 
A-filtered (indeed, \-cocomplete) for all A. However, in other respects the theory 
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of locally presentable categories generalizes with remarkably little difficulty to 
accessible categories. 


Remark 2.3.5 Although, as we indicated at the start of this section, our con- 
cern is really with infinite regular cardinals «, it is instructive to consider what 
happens to the results above if we take « to be the (trivially regular) cardinal 
0. Since no categories are 0-small, we see that every category is 0-filtered, and 
so an object A of a cocomplete category C is 0-presentable iff C (A, —) preserves 
arbitrary small colimits. Since a 0-accessible category is necessarily cocomplete, 
we thus deduce that a category C is 0-accessible iff it is locally 0-presentable, iff 
it is equivalent to [D°?, Set] for some small category D. Note also that, if C is 
an oo-pretopos, then an object A of C is 0-presentable iff C (A, —) is a geometric 
functor, iff A is an indecomposable projective (cf. A1.1.10); thus the result just 
quoted yields another proof of the characterization of functor categories amongst 
Grothendieck toposes given in C2.2.20. 


Locally presentable categories correspond, in a fairly precise sense, to limit 
sketches: 


Theorem 2.3.6 For a category C, the following are equivalent: 


(i) C is locally K-presentable. 


(ii) There exists a limit sketch S, in which all the distinguished cones have 
cardinality less than K, such that C ~ S-Mod (Set). 


Proof Suppose (i) holds. We construct a limit sketch S = (G, D, L) as follows: 
G is the underlying directed graph of (a small skeleton of) (C,,)°?, D consists of 
all finite commutative diagrams in this category, and L consists of all limit cones 
of cardinality less than «x. Then models of S are precisely «-continuous functors 
(C,,)°? — Set, so by 2.3.4(iii) they form a category equivalent to C. 

For the converse, consider first the case k = w. Then S is a finite limit 
sketch, so it corresponds to a cartesian theory T by 2.2.1, and its models can 
be identified with cartesian (that is, w-continuous) functors Cr — Set. So by 
2.3.4(iii) S-Mod(Set) is locally finitely presentable. For larger values of «, the 
argument is similar: we identify S with an appropriate theory in ‘«-cartesian 
logic’, that is the fragment of infinitary logic in which we allow conjunctions of 
cardinality less than «x, and existential quantification subject to the restriction 
in 1.3.4, and then we generalize the construction of the syntactic category Cy in 
1.4.1, by allowing ‘contexts’ to be families of variables of cardinality less than 
k. In this way we obtain a -complete small category D such that models of S 
correspond to «-continuous functors defined on D. E 


Since we are concerned mainly with sketches in which all cones are finite, 
we shall generally be interested only in the case k = w of the above theorem. 
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However, it is worth pausing to note one consequence of the general case: 
Corollary 2.3.7 Any Grothendieck topos is locally presentable. 


Proof We saw in 2.1.4(h) that, for any small site (C,T) whose underlying 
category C is cartesian, there is a limit sketch whose models are T-sheaves on 
C. But any Grothendieck topos can be presented as the category of sheaves on 
such a site, by C2.2.8(iii), so the result is immediate from 2.3.6. C] 


Generalizing the first half of the proof of 2.3.6, we have 


Proposition 2.3.8 Any accessible category is sketchable. More precisely, if C 
is K-accessible, then there is a sketch S such that C ~ S-Mod(Set), and all the 
cones in S have cardinality less than « (although the cocones may be larger). 


Proof Let D be a small category such that C œ «-Tors(D), as in 2.3.3. We 
shall construct a sketch whose models are (equivalent to) «-torsors on D. 

Let F: D°P — Set be a functor, and F — D the corresponding discrete 
fibration. The assertion that F satisfies the first of the ‘elementary’ conditions 
for «-filteredness, mentioned before 2.3.1, is tantamount to saying that, for each 
family (A; | i € I) of fewer than «x objects of D, the canonical map 


IFE) — [[ Fo. 


ie] 


where the coproduct is over all families of morphisms (fi: A; — B |i € I) with 
common codomain, is epic. But we may construct a sketch whose models are 
functors satisfying this condition, starting from the linear sketch for functors 
defined on DP, by adding new vertices and arrows to the graph for the product 
and coproduct above, their projections and coprojections, the canonical mor- 
phism between them, and the identity morphism on Į |,e; F(Ai), together with 
the diagrams which say that the two latter morphisms are what they ought to 
be, the cone and cocone which say that the product and coproduct are what 
they ought to be, and the cocone which says that the pushout of the displayed 
morphism above against itself is the identity. If we do this for all families of 
objects (A; | i € I) of cardinality less than « in D, and perform a similar con- 
struction for the second ‘elementary’ condition for filteredness, we obtain the 
required sketch. E 


In particular, we note that any finitely accessible category is sketchable by 
a geometric sketch; equivalently, by 2.2.7, it is (equivalent to) the category of 
models in Set of a geometric theory. However, the converse of 2.3.8 is not true 
in this ‘sharp’ form: a bound for the cardinalities of the cones in a sketch does 
not necessarily yield an ‘index of accessibility’ for its category of models. 


Example 2.3.9 Consider the theory T of directed graphs in which every vertex 
is the source of some arrow. This is clearly a regular theory, and so is sketchable 
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by a finitary (indeed, finite) sketch. However, we shall show that its category of 
models is not w-accessible. Let N< be the graph whose vertices are the natural 
numbers, with one arrow m — n whenever m < n, and N< the graph obtained 
from it by adding an arrow n — n for each n; clearly, both N< and N< are 
models of T. If A is a nonempty finitely presentable T-model and there is a 
homomorphism f: A — N<, then its composite with the inclusion N< — N< is 
also a homomorphism; but N< is the directed union of its finite full subgraphs 
(all of which are models of T), and so the finite presentability of A ensures that 
f has finite image. But this means that there is a vertex v of A such that f(v) 
is an upper bound for the image of f, which contradicts the requirement that 
there must exist some arrow v — w in A. So there can be no such f; hence Ne 
cannot be expressed as a filtered colimit of finitely presentable T-models. 


In proving the converse of 2.3.8, the key ingredient is a ‘downward 
Lowenheim-Skolem theorem’ for models of sketches. Given a Set-model M of a 
(small) sketch S, we define the underlying set |M/| of M to be the disjoint union 
of the sets MA, as A ranges over the vertices of Gs; the cardinality of M is the 
cardinality of its underlying set. Then we have 


Theorem 2.3.10 For any sketch S, there exists a regular cardinal x such that, 
for any S-model M in Set, each subset of |M] of cardinality less than kK is 
contained in (the underlying set of) a submodel of cardinality less than «K. O 


We omit the proof of 2.3.10, for which we refer the reader to [787, Section 3.3], 
(which also gives information on how k may be calculated from the size of the 
various components of S). 


Corollary 2.3.11 For any sketch S, the category S-Mod(Set) is accessible. 


Proof Let C be the small category such that models of the linear part of S 
(i.e. the sketch obtained from S by deleting all its cones and cocones) correspond 
to functors defined on C, as in 2.1.4(a). (Note that the objects of C may be 
identified with the vertices of Gs.) Then we may consider S-Mod(Set) as a full 
subcategory of the functor category [C, Set]. Let « be a cardinal such that 2.3.10 
holds; we may assume that « is larger than the cardinality of mor C (and larger 
than the cardinality of each cone in Lg). From the latter fact, and the fact that 
«-small limits commute with «-filtered colimits in Set, we may conclude that 
the colimit in [C, Set] of any «-filtered diagram of S-models is again an S-model: 
in particular, S-Mod(Set) has «-filtered colimits. 

Let us call a functor F:C — Set x«-small if the cardinality of |F] (i.e. of 
Llacob c FA) is less than «x. Since colimits in [C, Set] are computed pointwise. 
it is easy to see that any «-small functor is K-presentable in [C, Set]. Since the 
inclusion S-Mod(Set) — [C, Set] preserves «-filtered colimits, it follows that if 
F as above is actually an S-model then it is k-presentable in S-Mod(Set). But 
by 2.3.10, any S-model can be written as the «-directed union (and hence the 
K-filtered colimit) of its k-small submodels; hence also any K-presentable object 
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of S-Mod(Set) must itself be k-small, and there are (up to isomorphism) only 
a set of such models. 0O 


A substantially different proof of 2.3.11, which avoids the use of the 
Löwenheim-Skolem theorem but is appreciably longer, will be found in [23]. 


Remark 2.3.12 Let F be a locally presentable category, and S a sketch. By 
2.3.6, we may identify F with T-Mod(Set) for some limit sketch T; it is then not 
hard to construct a sketch S @ T such that (S @ T)-models in Set are ‘the same 
thing as’ S-models in T-Mod(Set). Thus we deduce that, for any such F and S, 
the category S-Mod(F) is accessible; in particular, combining this observation 
with 2.3.7 and 2.2.7, we deduce that for any Grothendieck topos F and any geo- 
metric theory S, the category S-Mod(F) is accessible. In view of 3.1.13 below, 
this is equivalent to saying that the 2-category BYop/Set of Grothendieck 
toposes is ‘locally accessible’, i.e. all its hom-categories are accessible. 

In particular this implies that, for any two Grothendieck toposes E and F, 
there exists a small full subcategory K of BYop/Set (F,E) such that every object 
of the latter category is a filtered columit of objects of K. Recalling the fact 
that filtered colimits in BYop/Set (F, €) are computed pointwise at the level of 
inverse image functors (B3.4.8), this yields an alternative proof of C2.2.11. 


For the particular case when F = Set, a direct proof of the italicized assertion 
in the second paragraph of 2.3.12, without appealing to the theory of accessible 
categories, may be found in [504, 7.16]. 


Example 2.3.13 Before leaving this section, we should give at least one exam- 
ple of a category which is not accessible. We claim that Set°? is not accessible. 
To see this, let x be an infinite regular cardinal, let A, be a set of cardi- 
nality «, and let J, be the poset of subsets of A, of cardinality less than 
K, ordered by inclusion. It is clear that J, is «-filtered. We have a diagram 
D: JP — Set which sends a subset S to A, \ S, and an inclusion S’ C S 
to the inclusion (A, \ S) C (Ax \ S’); clearly, the limit of this diagram in 
Set is ð. Now if B is any set with more than one element, then any two dis- 
tinct constant maps A, 3 B remain distinct when restricted to any vertex of 
the diagram D, but not when restricted to its limit; thus the canonical map 
lim 7(Set (D(S), B)) — Set (lim jop(D(S)), B} is not bijective. So B is not 
k-presentable in Set°?, for any «K; hence also it cannot be expressed as a colimit 
in Set°? of «-presentable objects. 

On the other hand, we know that Set°? is monadic over Set (A2.2.7); so, by 
the well-known identification of monads on Set with infinitary algebraic theories, 
it can be regarded as the category of models of such a theory. (The theory in 
question can be presented explicitly as the theory of complete Boolean algebras 
which are completely distributive, or alternatively as the theory of compact 
Hausdorff topological Boolean algebras, as we mentioned in A2.2.8.) And any 
infinitary algebraic theory can be presented by an ‘infinite product sketch’, as 
we observed in 2.1.4(c); however, if the theory corresponds (as in this case) to a 
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monad on Set whose functor part is inaccessible (i.e. does not preserve «-filtered 
colimits for any «), then the underlying graph of the sketch will be ‘large’, i.e. it 
will have proper classes of vertices and arrows. Thus the inaccessibility of Set°P 
is telling us that, in this case at least, the largeness is essential; that is, there is no 
way of presenting a theory Morita-equivalent to that of completely distributive 
complete Boolean algebras by means of a sketch based on a small directed graph. 


Suggestions for further reading: Adámek & Rosicky [23], Ageron [33], 
Diers (286, 287], Lair [658], Makkai & Paré [787], Makkai & Pitts [788]. 


D2.4 Properties of model categories 


The purpose of this section is to justify a number of remarks which were made 
in 1.1.7 and elsewhere, to the effect that certain particular theories do not lie in 
certain of the fragments of first-order logic that we have considered. It is easy to 
see how one shows that a particular theory is regular (for example): one simply 
writes down a set of axioms for it, and observes that they are all regular sequents. 
But how do we show that a theory T has no axiomatization by regular sequents? 

The answer, in most cases, is to look at its category of models (in Set): if we 
can show that this category fails to enjoy some categorical property enjoyed by 
the model categories of all regular theories, then T cannot be regular. Thus we 
are interested in finding properties which are in some sense characteristic of the 
model categories of theories in particular fragments of first-order logic. There 
are two types of properties in which we shall be interested: ‘absolute’ properties 
of T-Mod(Set) as a category (for example, the possession of certain limits or 
colimits), whose failure for a given T would show that T cannot be Morita- 
equivalent to a regular theory over any signature, and ‘relative’ properties of 
T-Mod(Set) as a subcategory of &-Str(Set) (for example, closure under certain 
limits or colimits), whose failure would show that T cannot be axiomatized by 
regular sequents over the particular signature © over which it is expressed. 

In general, we shall work through the classes of theories in order of increasing 
complexity, starting with algebraic theories. Before we do so, however, we need 
to establish some further information about the structure of model categories 
of cartesian theories. By 2.3.6, we know that such categories are locally finitely 
presentable (and that every locally finitely presentable category, up to equiva- 
lence, occurs in this way). Also, if T is such a theory, we know that the finitely 
presentable objects of T-Mod(Set) are exactly (the T-models corresponding to) 
the representable functors Cr — Set, since Cr has equalizers and so its idempo- 
tents split (cf. the remarks after 2.3.3). Let {¢.¢} be an object of Cr (where, by 
1.4.4(ii), we may as well assume that ¢ is a finite conjunction of atomic formu- 
lae): we shall write (Z : ¢) for the corresponding finitely presentable T-model. 
This notation is justified by 
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Lemma 2.4.1 (Z: ¢) is a representing object for the functor T-Mod(Set) — 
Set which sends a model M to [Z.¢] (cf. 1.2.9). 


Proof By the Yoneda lemma, morphisms (7 : ¢) — M correspond to elements 
of Fu({ĒZ.¢}), where Fm is the cartesian functor Cr — Set corresponding to 
M. So the result is immediate from the definition of this functor in the proof 
of 1.4.7. o 


Equivalently, we may think of (Z : ¢) as the T-model M freely generated by a 
finite string of elements (a; € MA; | 1 < i < n) (where A; is the sort of x;), sub- 
ject to the finite conjunction of conditions corresponding to the formula ¢. Thus 
the notion ‘finitely presentable’, for models of cartesian theories, corresponds to 
the traditional notion ‘finitely presented’. (This is, of course, the origin of the 
former term.) 

Now we turn to algebraic theories. 


Lemma 2.4.2 Let T be a finitary algebraic theory as in 1.1.7(a). Then 


(i) T-Mod(Set) is monadic over Set. 
(ii) T-Mod(Set) is effective regular. 


Proof These are well-known consequences of the characterization of monads 
on Set (see, for example, [726]; cf. also 5.3.1 and 5.3.2 below). o 


Similarly, the category of models of a many-sorted algebraic theory as defined 
in 1.1.7(b) is monadic over some power of Set, and hence effective regular. Thus, 
for example, the theory of torsion-free abelian groups, described in 1.1.7(d), 
cannot be Morita-equivalent to an algebraic theory, for we saw in Section A1.3 
that its category of models is (regular but) not effective. The latter state of affairs 
is typical of Horn theories over signatures having no relation symbols (other than 
equality), as we shall see in 2.4.4 below. But first we need 


Lemma 2.4.3 Let T be a Horn theory over a signature 5. Then 


(i) An arbitrary product (in U-Str(Set)) of T-models is a T-model. 


(ii) Any substructure of a T-model is a T-model. 


Proof (i) It is easy to verify that, for any family (M; | i € I) of D-structures 
and any Horn formula-in-context Z., the interpretation of z.¢ in the product 
[ier Mi is simply the product of the [Z. 4] ,. Hence any Horn sequent satisfied 
in each M, is satisfied in the product. 

(ii) Similarly, if N is a substructure of M, then as we remarked in 1.2.10(c) 
the interpretation in N of any quantifier-free formula-in-context (in particular, 
of any Horn formula-in-context) Z.¢@ is simply the intersection of [Z.¢]a¢ with 
N(Aj,...,An). So N satisfies any Horn sequent satisfied by M. g 
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Part (i) of 2.4.3 remains true for arbitrary regular theories, by essentially 
the same proof, but part (ii) does not (even for cartesian theories): for the 
theory of categories as described in 1.1.7(e), it is easily seen that there are 
substructures of models which are not models. Hence the theory of categories 
cannot be axiomatized by Horn sequents over the particular signature considered 
in 1.1.7(e). After 2.4.6 below, we shall be able to prove that it cannot be a Horn 
theory over any signature. 


Corollary 2.4.4 Let T be a Horn theory over a signature © such that }X-Rel 
is empty. Then 


(i) T-Mod(Set) is reflective in U-Str(Set), the unit of the reflection being 
regular epic. 


(ii) T-Mod(Set) is a regular category. 


Proof (i) Given a ¥-structure M and a family of congruences (R; | i € I) 
on M (that is, ‘sortwise’ equivalence relations on M which are sub-b-structures 
of M x M) such that each of the quotients M/R; is a T-model, the same is 
true of M/((),<,; Ri) (since it maps injectively into the product of the M/R;). 
Hence there is a unique smallest congruence R on M such that M/R is a 
T-model, namely the intersection of all such congruences. It is then clear that 
the assignment M ++ M/R defines a left adjoint to the inclusion T-Mod (Set) — 
y-Str(Set), the unit of the adjunction being the quotient map. 

(ii) It follows easily from (i) that regular epimorphisms in T-Mod(Set) are 
preserved by the inclusion into U-Str(Set), and hence that they are exactly the 
(sortwise) surjective homomorphisms of T-models. Hence they are stable under 
pullback, and T-Mod(Set) is regular. o 


The category Poset of partially ordered sets is not regular: if n denotes an 
n-element totally ordered set, then there is a regular epimorphism 2 H 2 — 3 
in Poset (obtained by identifying the top element of one copy of 2 with the 
bottom element of the other), whose pullback along the third injection 2 — 3 
is (monic, and hence) not a cover. Since we saw in 1.1.7(c) that posets are the 
models of a Horn theory, this shows that the restriction on X in the statement 
of 2.4.4 cannot be omitted — and also that there can be no analogue of 1.4.9 for 
Horn theories, as we claimed in Section D1.4. 

However, Horn theories over signatures without function symbols also have 
their limitations. We note that Poset, although not a regular category, resembles 
a coherent category to the extent of having a strict initial object (cf. A1.4.1), 
and this state of affairs is again typical: 


Lemma 2.4.5 Let © be a signature such that X-Fun is empty, and T a Horn 
theory over E. Then T-Mod(Set) has a strict initial object. 


Proof The initial object M of T-Mod(Set) (the free T-model on no genera- 
tors) may be constructed as follows. MA is empty for each sort A, which forces 
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MR to be empty for each relation symbol R of arity > 1. For nullary relations 
R, we also take MR = @ unless we are forced to do otherwise by the fact that 
(T Fy R) is provable in T. It is straightforward to verify that this &-structure 
is a T-model, and that it is strict initial in T-Mod(Set). o 


Since the initial object of TF is also terminal, we see that the theory of 
1.1.7(d) cannot be Morita-equivalent to a Horn theory over a signature without 
function symbols. (However, 2.4.5 does not characterize the model categories of 
Horn theories over relational signatures: the same argument would work if & 
contained function symbols of arity > 1, provided there were no constants.) 

The property which distinguishes Horn theories (over arbitrary signatures) 
from arbitrary cartesian theories is the presence of ‘enough projectives’ in their 
categories of models. 


Lemma 2.4.6 Let T be a Horn theory over a signature X. Then 


(i) Any cover h: M =N in T-Mod(Set) is a sortwise surjection, i.e., for 
each sort A of 3, ha: MA — NA is surjective. 


(ii) T-Mod(Set) has a separating set of objects which are projective with 
respect to covers. 


Proof (i) Given a homomorphism h: M — N of T-models, it is clear that the 
sortwise image of h can be made into a substructure of N, and hence a T-model 
by 2.4.3(ii). So if h is a cover this image must be the whole of N, i.e. h is sortwise 
surjective. 

(ii) For each sort A of X, the functor T-Mod(Set) — Set which sends M 
to MA is representable, by 2.4.1. Part (i) implies that the representing objects 
(x: T) are projective with respect to covers; and since the family of all such 
functors, as A ranges over the sorts of X}, is clearly faithful, they form a separat- 
ing set. 0 


If the signature © has no relation symbols, we may strengthen ‘separating’ 
to ‘generating’ in 2.4.6(ii), by essentially the same proof. 


Example 2.4.7 We may now prove, as promised above, that the theory of 
categories cannot be a Horn theory over any signature: for covers (in particular, 
regular epimorphisms) in Cat do not have to be surjective on morphisms, from 
which one may easily deduce that the only cover-projectives in Cat are discrete 
categories. And since it is possible for two functors to agree on objects but not 
on morphisms, these do not form a separating family for Cat. 


For any cartesian theory T, the category T-Mod(Set) is closed in =-Str(Set) 
not only under products, as we observed after 2.4.3, but also under equalizers 
(by a similarly straightforward proof) and hence under arbitrary limits. (Of 
course, we knew already from 2.3.6 that T-Mod(Set) has all small limits - and 
colimits.) Thus the theory of divisible abelian groups, mentioned in 1.1.7(f), 
is not expressible as a cartesian theory over the signature for groups, since the 
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category Div of divisible abelian groups is not closed under equalizers in Ab. In 
this case Div is complete and cocomplete, since it is coreflective in Ab; but it is 
not locally finitely presentable — it can be shown that the only finitely presentable 
object of Div is the trivial group — and so it cannot be the category of models 
of a cartesian theory over any signature. Similar remarks apply to the regular 
theory considered in 2.3.9. 

For regular theories, we still have closure under products, as mentioned after 
2.4.3, and hence in particular the category of models has products, though it may 
lack equalizers. (A simple counterexample is the theory O; of inhabited objects 
(cf. C5.2.8(c)), i.e. the theory over the signature with one sort and no primitive 
symbols except equality, whose only axiom is (T Hp (dz)T): its category of 
models is the category of nonempty sets, and the two morphisms 1 3 2 in 
this category have no equalizer.) Thus the theory of fields, considered in 1.1.7(h), 
cannot be Morita-equivalent to a regular theory over any signature, since its 
category of models lacks a terminal object. 


Remark 2.4.8 As previously, we shall not give detailed consideration to the 
class of disjunctive theories, introduced in 1.3.6; but it seems worth mention- 
ing that if T is a disjunctive theory over a signature Ł, then T-Mod(Set) is 
closed under arbitrary connected limits in U-Str(Set), and in particular it has 
connected limits. The proof is, as usual, a straightforward induction to show 
that the interpretations of disjunctive formulae are ‘preserved’ under connected 
limits. Using this result we may show that, for example, the theory of local rings 
(1.1.7(g)) cannot be Morita-equivalent to a disjunctive theory: for the diagram 


Z2) 


Z(3) >> Q 


of local rings (where Zp) denotes the ring of rationals whose denominators are 
prime to p) has no pullback — indeed, there is no cone over it in the category of 
local rings, since in any local ring either —2 or +3 is invertible. 


We saw in the last section that the category of models of a geometric theory, 
although it is always accessible, need not be finitely accessible. Nevertheless, it 
does always possess filtered (that is, w-filtered) colimits. To prove this, we need 
another closure result, similar to those already established: 


Lemma 2.4.9 Let T be a geometric theory over a signature ©. Then 
T-Mod(Set) is closed under filtered colimits in U-Str(Set). 


Proof Let (M; |i € I) be the vertices of a filtered diagram of X-structures, 
with colimit N. Since the structure used in interpreting geometric formulae is 
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all derived from finite limits and arbitrary colimits, and these commute with 
filtered colimits in Set, it is straightforward to verify that, for any geometric 
formula-in-context 7. ¢, the interpretation [Z.¢]x is the colimit of the JZ. o] m,- 
Hence N satisfies any geometric sequent which is satisfied by all the M;. 0 


In view of 3.1.13 below, 2.4.9 is really no more than a restatement of 
(a particular case of) Lemma B3.4.8. 


Example 2.4.10 We may now show that the theory T of posets in which 
every element lies below a maximal element, described in 1.1.7(2), is not Morita- 
equivalent to a coherent theory (or even a geometric theory). For every natural 
number n, the n-element totally ordered set n = {0,1,...,n — 1} is a model 
of T, and the inclusions n — m for n < m are T-model homomorphisms, 
which form a filtered diagram in T-Mod(Set). The colimit of this diagram in 
&-Str(Set) is the ordered set of natural numbers, which is not a model of T; 
so it is immediate that T cannot be axiomatized by geometric sequents over the 
signature ©. But in fact it is not hard to see that this diagram has no colimit 
in T-Mod(Set) — although we can construct a cone under it by adjoining a top 
element to the set of natural numbers, this is not a colimit because maximal 
elements do not have to be preserved by T-model homomorphisms - and so T 
cannot be equivalent to a geometric theory over any signature. A straightforward 
generalization of this argument, using successor ordinals regarded as T-models, 
shows that T-Mod(Set) does not have «-filtered colimits for any «; hence it 
is not accessible, and so T is not ‘Morita-equivalent’ to any sketch, let alone a 
finitary one. 


On the other hand, if T is any (finitary) first-order theory, then it fol- 
lows from 1.5.13 and 2.4.9 that T-Mod(Set), does have filtered colimits. And 
M. Richter {1017] has shown that, for any such T, any filtered colimits which 
exist in T-Mod(Set) must be preserved by the forgetful functor to U-Str(Set): 
that is, if the colimit in ©-Str(Set) of a filtered diagram of T-models fails to 
be a T-model, then the diagram cannot have a colimit in T-Mod(Set). (See 
also [1059].) 

In a similar vein, the theory of metric spaces cannot be axiomatized by 
geometric sequents over the signature of 1.1.7(l), since its models fail to be closed 
under filtered colimits (take a directed diagram of two-point spaces indexed 
by the natural numbers, such that the distance between the two points in the 
nth space is 27”). However, the category of metric spaces (and non-expansive 
mappings) does have filtered colimits, since it is reflective in the category of 
pseudometric spaces, and we observed in 1.1.7(l) that the latter are the models 
of a geometric theory. 

To distinguish finitary first-order theories from infinitary ones, the best tool 
we have available is the notion of ultraproduct, which in most cases yields a 
‘relative’ rather than an ‘absolute’ criterion. Given a family of D-structures (A; | 
i € I) and a filter ® on (the power-set of) I, we define the filterproduct [| Ai 
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to be the quotient of the product Ile r A; obtained by identifying two elements 
(a; | i € I) and (a; | i € J) iff the set {i € I | a; = a4} lies in ®. (This definition 
requires some modification if (some of the sorts of) some of the A; are allowed 
to be empty, but we shall not bother about this.) It is clear that [], A; can be 
given a -structure in a canonical way. If ® is an ultrafilter (in which case we call 
I] A: an ultraproduct), then an easy induction shows that for any first-order 
formula-in-context Z.¢, an n-tuple of elements b),...,6, of [[ẹ Ai lies in the 
interpretation of Z.¢ iff, given any choice of representatives (a,; | i € I) for each 
b,, the set 


{i € I | (aiis. aera) € [z.d] 4, } 
is in ®. We thus obtain the result known to model-theorists as Los’s Theorem: 


Lemma 2.4.11 For any (finitary) first-order theory T over a signature X, the 
class of T-models is closed under ultraproducts in the class of S-structures. O 


The reason why this does not lead to an absolute condition characterizing 
model categories of first-order theories is that the ultraproduct construction 
cannot be characterized in abstract categorical terms; it is a piece of extra 
structure carried by the model category. The ‘ultracategories’ studied by 
M. Makkai [778, 780] are in effect categories equipped with extra structure of 
this type. 


Examples 2.4.12 (a) Nevertheless, we may now show (as promised in 
2.1.4(f)) that the theory of connected directed graphs, although it can be axiom- 
atized by a finitary sketch, cannot be axiomatized by any finitary first-order 
theory over the signature for directed graphs. For each natural number n > 0. 
let Gn, denote the graph with n vertices 1,2,...,n and a chain of (n — 1) arrows 
joining them; then Gn is connected for each n. But if ® is any non-principal 
ultrafilter on the set N of natural numbers, then []ẹ Gn is not connected, since 
the vertices which are the equivalence classes of the sequences (1 | n € N) and 
(n | n € N) cannot be joined by any finite chain. 

(b) In the same way, we may construct ultraproducts of torsion groups 
which have elements of infinite order; so the geometric theory of torsion groups. 
described in 1.1.7(7), cannot be axiomatized by finitary sequents over the 
signature for groups. 

(c) The theory K of K-finite objects, axiomatized in 1.1.7(k), cannot be 
Morita-equivalent to a finitary theory over any signature. To see this, note that 
if (A; | 7 € I) is any family of models of a finitary theory, and © is an ultrafilter 
on J, then the ultraproduct [],, Aut(A,) of the automorphism groups of the A, 
embeds in the obvious way as a subgroup of the automorphism group of [[ẹ 4:- 
But K has models (in Set) with arbitrarily large finite automorphism groups; sO 
a nontrivial ultraproduct of these would have to have an infinite automorphism 
group. 
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D3 
CLASSIFYING TOPOSES 


D3.1 Classifying toposes via syntactic sites 


We saw in Section D1.4 that, for each cartesian (resp. regular, coherent, 
geometric) theory T, there is a category Cr of the appropriate kind contain- 
ing a model of T which is generic, in the sense defined there. Since we are mainly 
interested in models which live in toposes, it would be convenient to be able to 
construct a topos containing a model of a given theory which is generic amongst 
models in toposes. There are various methods for doing this. One, which was 
sketched in Section B4.2, relies on the 2-categorical structure of (Grothendieck) 
toposes, in order to build up (the topos containing) the generic model by an 
inductive process. Here we shall give a more ‘pedestrian’, but also more explicit, 
method, which has the advantage that the explicit description of the generic 
model which we obtain allows us to establish results about models of a the- 
ory by first proving them in the generic case, and then transferring them along 
inverse image functors. (We shall see several examples of this technique in the 
subsequent sections of this chapter, as well as in Chapters F1 and F4.) 

The essence of this construction is to use the syntactic category Cr itself as 
the underlying category of a site (the syntactic site of the theory), by imposing 
a suitable coverage J on it in such a way that cartesian (resp. regular, coher- 
ent, geometric) functors defined on Cr are equivalent to geometric morphisms 
(over Set) into Sh(Cr, J). (As in Section B4.2, we shall denote this topos by 
Set[T], and it will often be fruitful to think of it as the category of all ‘geometric 
constructs’, in the sense defined there, in the theory T.) There are (at least) 
four different versions of the construction, corresponding to the four different 
constructions of Cr appropriate to the different fragments of logic under con- 
sideration. As in Section D1.4, we shall take them in order of increasing logical 
complexity. (Of course, every cartesian theory is regular, every regular theory is 
coherent, and every coherent theory is geometric, so we don’t really need four 
different constructions — the one for geometric theories would suffice. However, 
the emphasis in this section is not on the mere existence of classifying toposes, 
but rather on obtaining the most explicit description possible of the classifying 
topos, and in particular of the generic model, of a given theory.) 
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For cartesian theories, in fact, no coverage is needed. We saw in 1.4.7 that, 
for any cartesian theory T, we have a natural equivalence 


T-Mod(€) ~ Catt (Cr, €) 


for any cartesian category €. But if € is a Set-topos, then we also have a natural 
equivalence 


Cart (Cr, E) ~ Top/Set (E, [(Cr)°?, Set]) 


by Diaconescu’s theorem (B3.2.7). So, on combining these two equivalences, we 
deduce 


Theorem 3.1.1 Let T be a cartesian theory. Then the topos [(Cr)°?, Set] 
contains a T-model Gr such that, for any Set-topos E, the functor 


Top /Set (E, [(Cr)°?, Set]) —> T-Mod(€) 


which sends a geometric morphism f to f*(Gr) is one half of an equivalence of 
categories. 0 


Since the second of the two equivalences above is obtained (in one direction) 
by composing with the Yoneda embedding Cr — [(Cr)°?, Set], the generic model 
Gr of 3.1.1 is simply the image under this embedding of the generic model Mr of 
1.4.7. But in 2.4.1 we saw that the representable functors on Cr may be identified 
with finitely presented T-models in Set. Thus we obtain 


Corollary 3.1.2 For a cartesian theory T, the classifying topos Set[T] may be 
identified with the functor category [T-Mod(Set),,, Set], where T-Mod(Set),, 
denotes the category of finitely presented T-models; and Gr may be identified with 
the inclusion functor T-Mod(Set),, + T-Mod(Set), regarded as in 1.2.14(i) as 
a T-model in the functor category — that is, for each sort A of the signature of 
T, GrA ts the functor which sends a finitely presented T-model M to MA, and 
similarly for function symbols and relation symbols. o 


This explicit description of Gr will be of use on many occasions later on. 
Notice, by the way, that the syntactic category Cr is recoverable up to equivalence 
from the classifying topos Set[T], since it is dual to the full subcategory of finitely 
presentable objects of Top/Set (Set, Set[T]). That is, the classifying topos of a 
cartesian theory determines the theory up to Morita equivalence. 

As an example of the applications of 3.1.2, we give 


Corollary 3.1.3 Let T be a finitary algebraic theory in the sense of 1.1.7(a). 
Then the generic T-model Gr is functionally complete in the sense that, for any 
natural number n, each morphism (Gr)” — Gr in Set[T] is the interpretation 
in Gr of some n-ary derived operation of T (that ts, of some term-in-context 
L1,...,Z7.t). Moreover, two such terms-in-context define the same morphism 
(Gr)" — Gr iff the sequent expressing their equality is provable in T. 
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Proof Since (the underlying object of) Gr is the forgetful functor from finitely- 
presented T-models to Set, it is representable by the free T-model F(1) on one 
generator; similarly, (Gr)” is representable by the free model F(n) on n gener- 
ators. So the Yoneda lemma tells us that morphisms (Gr)” — Gr correspond 
bijectively to morphisms F'(1) — Fn), or equivalently to elements of the under- 
lying set of F (n); but the latter are exactly the provable-equality-classes of n-ary 
derived operations. go 


For example, if T is the theory of commutative rings (with 1), then the 
morphisms (Gr)” — Gr in Set/T] are exactly the polynomial functions of n 
variables (with integer coefficients). Although we stated 3.1.3 only for single- 
sorted theories, it remains valid (with the same proof — but a more complicated 
statement) for many-sorted algebraic theories as defined in 1.1.7(6). 

We turn now to regular theories. We recall that if C is a small regular cate- 
gory, the regular coverage R on C has as covering sieves all sieves which contain 
a cover in the sense defined in Section A1.3. In A2.1.11(a), we saw that the 
representable functors C°P — Set are all sheaves for this coverage, so that the 
Yoneda embedding C — [C°?, Set] factors through Sh(C, R); and the result- 
ing full embedding C — Sh(C, R) is (not only cartesian but) a regular functor. 
Further, for any Grothendieck topos €, a cartesian functor F’: C — E is cover- 
preserving iff it maps each sieve generated by a single cover in C to an epimorphic 
family, iff it maps covers in C to epimorphisms in €, iff it is a regular functor. So 
by C2.3.9 we have an equivalence 


Reg (C, E) ~ Top/Set (E, Sh(C, R)) 
for any such topos €. Applying the above to the syntactic category Cy” of a 
regular theory T, we immediately obtain 


Theorem 3.1.4 For any regular theory T, the topos Sh(C;®,R) contains a 
T-model Gr such that, for any Grothendieck topos E, the functor 


Top /Set (E, Sh(C%*, R)) —> T-Mod(€) 


sending f to f*(Gr) is one half of an equivalence of categories. Moreover, the 
regular sequents satisfied by Gr are precisely those provable in T. 


Proof Combine the regular version of 1.4.7 with the remarks above. The T- 
model Gr is of course the image of Mr under the embedding Cy’ — Sh(C;*); 
the last assertion of the theorem follows from the fact that this embedding is full 


and therefore conservative. O 


Remark 3.1.5 Since any topos € is effective as a regular category, any regular 
functor Cr ® — E factors through the effectivization Er of Cy’ (cf. 1.4.12(i)). So 
we could have used Er, with its regular coverage, as a site of definition for the 
classifying topos of T. This fact, combined with the existence of small regular 
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categories which are not effective, indicates that we cannot hope to recover the 
syntactic category Cņ® up to equivalence from the classifying topos Set[T], as 
we could in the case of cartesian theories. However, we shall see in 3.3.10 below 
that we can recover Er up to equivalence from Set/T] — that is, we can recover 
the theory T up to Morita equivalence as defined after 1.4.12. 


Remark 3.1.6 We noted after 1.4.11 that if T is a cartesian theory then its 
regular syntactic category Cy ® is equivalent to the regularization of its cartesian 
syntactic category Cr. But in C2.2.19 we noted that, for any small cartesian 
category C, the topos of sheaves on Reg(C) for the regular coverage is equivalent 
to [C°P, Set]. Thus we have a direct proof that 3.1.1 and 3.1.4 produce equivalent 
constructions of Set[T]. 


There is an alternative construction of Set[T], less ‘canonical’ but often more 
convenient to use, which in some sense stands in the same relation to 3.1.4 as 
3.1.2 does to 3.1.1. Given a regular theory T, we can always find a cartesian 
theory To over the same signature, all of whose axioms are provable in T. (We 
could take To to be the empty theory in this signature, or the theory consisting 
of all the Horn sequents provable in T; but Tp need not be a Horn theory - 
for example, if T were the theory of strongly connected categories (categories 
in which, given any two objects, there exists a morphism from the first to the 
second; cf. A4.6.9), it would be natural to take To to be the theory of categories.) 
Then, by 1.3.10(ii), we may write the axioms of T (other than those which occur 
as, or are derivable from, axioms of To) in the form (¢ Fz (Sy)w), where £.¢ 
and z,y.% are cartesian formulae-in-context relative to To (indeed, they could 
be Horn formulae), and the sequent (Y tz, ¢) is provable in To. This latter 
condition means that, in the syntactic category Cr, of To, we have a morphism 


wWA(E= a2! eae 
£90) OS eae a 


and a To-model M in a regular category C is a T-model iff, for each axiom of 
T, the functor Fm corresponding to M via 1.4.7 sends this morphism to a cover 
in C. Thus we obtain 


Proposition 3.1.7 With the above notation, the classifying topos Set|T| 
may be identified with Sh(C, Jr) where C is the syntactic category of To (equiv- 
alently, the opposite of the category of finitely presented To-models in Set), and 
Jr is the smallest coverage on this category for which the sieve generated by the 
single morphism displayed above is Jy-covering for each axiom of T not occur- 
ring among (or derivable from) the axioms of To. O 


The usefulness of this result is greatly dependent on the extent to which 
we can give an explicit description of the coverage Jr, and in particular on 
whether we can prove it to be subcanonical (i.e. whether we can prove that all 
representable functors are sheaves for it). This is because the generic T-model Gr 
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is simply the associated Jr-sheaf of the generic To-model; but since the latter 
consists of representable functors by 3.1.2, it will already be a sheaf if Jr is 
subcanonical, and therefore we shall have an explicit description of Gr at no 
extra cost. Clearly, the question whether Jr is subcanonical will depend (not 
just on T but) on the choice of the theory To: the stronger we make this theory, 
the fewer the covering families we shall have to put into Jr, and the more likely 
it becomes that we shall obtain a standard site. 


Example 3.1.8 If T is the theory of strongly connected categories, mentioned 
above, and we take To to be the theory of categories, then 3.1.7 tells us to find 
the smallest coverage on the opposite of the category Cat,, of finitely presented 
categories for which the inclusion 
EE 

where 2 is the two-object discrete category and 2 is the two-element totally 
ordered set, generates a covering sieve on 2. By the results of Section C2.1, a 
functor F: Cat,, — Set will be a sheaf for this coverage iff it satisfies the sheaf 
axiom for (the sieves generated by) the pushouts of i along arbitrary morphisms 
2 — C, i.e. for the sieves generated by morphisms of the form C — C[f], where 
C[f] denotes the category obtained by freely adjoining a morphism f between 
two given objects of C. But it is easy to verify that all such morphisms are regular 
monomorphisms in Cat (i.e. they are the equalizers of their cokernel-pairs), and 
hence in Cat,,; so the representable functors are indeed sheaves. Thus we may 
describe the generic strongly connected category explicitly: it is simply given by 
the same data as the generic category, but regarded as an internal category in 
the sheaf topos Sh(Cat°?, Jr) rather than in the functor category [Cat,,, Set]. 

On the other hand, if we took Tg to be the ‘Horn part’ of the theory of 
categories (i.e. the theory obtained from that of categories by omitting the third 
displayed axiom of 1.1.7(e); we may think of models of this theory as ‘partial 
categories’, in which some composites may be undefined), then our coverage on 
C°P (where C is the category of finitely presented partial categories) would have 
to contain the sieve generated by the inclusion 


e——_~> °°. 6+ 


where, in the right-hand partial category, z is specified as being the composite of 
x and y. And this morphism is a non-invertible epimorphism in C, so it cannot 
be a regular monomorphism; hence not all representable functors are sheaves for 
this coverage. 
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We leave it to the reader to verify that the classifying topos for the theory of 
divisible abelian groups (1.1.7(f)) may be constructed by means of a subcanon- 
ical coverage on the opposite of the category Ab,, of finitely-presented abelian 
groups. 

For coherent theories, the arguments are not very different from those for 
regular theories. If C is a small coherent category, we defined the coherent cover- 
age P on C in A2.1.11(b) to consist of all finite families (f;: A; ~ A|1<i<n) 
such that the union of the images of the f; is the whole of A. We observed 
that this coverage is subcanonical, so that the Yoneda embedding becomes a full 
embedding C — Sh(C, P), and that this functor is coherent. And it is easy to 
verify that a cartesian functor from C to a topos sends finite families as above 
to epimorphic families iff it preserves covers and finite unions of subobjects, i.e. 
iff it is a coherent functor. So we obtain 


Theorem 3.1.9 For any coherent theory T, the topos Sh(C&", P) contains a 
T-model Gr such that, for any Grothendieck topos E, the functor 


Top /Set (£, Sh(C—o", P)) —> T-Mod(€) 


sending f to f*(Gr) is one half of an equivalence of categories. Moreover, the 
coherent sequents satisfied by Gr are precisely those provable in T. o 


As in 3.1.5, we could have replaced C&°" by its pretopos completion Pr 
(cf 1.4.12(ii)) in the statement of the above theorem; and in 3.3.8 below we 
shall see that Pr, though not C$, is recoverable up to equivalence from the 
topos Set|T]. 

In the opposite direction, we can make the underlying category of the site 
smaller, as in 3.1.7, by regarding T as a ‘quotient’ of a cartesian theory To 
obtained by adding new axioms of the form 


n 


($ He VGE) 


i=l 


where z.¢ and F, y;.y; are cartesian formulae-in-context relative to To and the 
sequents (Ypi Hz, @) are provable in To (cf. 1.3.10(iii)). For each such axiom, we 
then have a family of morphisms 


{Z, yi. vi} DA {2 .g|x'/z]} 


(1 < i < n) in the syntactic category Cr,, and a To-model is a T-model iff 
the cartesian functor on Cr, corresponding to it sends these families to covering 
families. Thus we obtain 


Proposition 3.1.10 With the above notation, the classifying topos Set|T] may 
be identified with Sh(C, Jr) where C is the syntactic category of To (equivalently, 
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the opposite of the category of finitely presented To-models in Set), and Jr is the 
smallest coverage on this category for which the sieves generated by the families 
described above are Jr-covering. O 


As with 3.1.7, this result is most useful when the coverage Jr can be explicitly 
determined and shown to be subcanonical. We give a couple of examples: 


Examples 3.1.11 (a) If T is the theory of local rings, as described in 1.1.7(g), 
then it is natural to take To to be the (algebraic) theory of rings (throughout 
these examples, the word ‘ring’ will mean ‘commutative ring with 1’). In this 
case, the resulting coverage on the opposite of the category Rng, of finitely 
presented rings has an explicit description: it is exactly the Zariski coverage Z 
described in A2.1.11(f), i-e. the covering sieves are those which contain finite 
families of ring homomorphisms of the form (A — A[s,"] | 1 < i < n), where 
{5,,...,5n} is a set of elements of A contained in no proper ideal of A (equiva- 
lently, such that there exist elements a),...,an of A with }>"_, aisi = 1). It is 
also a well-known result that this coverage is subcanonical; so the generic local 
ring is simply the forgetful functor Rng,, — Set, regarded as a ring object in the 
topos Sh(Rng??, Z). In particular, we note that it inherits the functional com- 
pleteness property of 3.1.3 from the generic ring. (We shall see some applications 
of this fact in Section F1.1.) 

(b) Similarly, if T is the (coherent) theory of fields described in 1.1.7(h), we 
might take To to be the theory of rings. If we did so, then our coverage on Rng, 
would have the property that, for any finitely presented ring A and any s € A, 
the two morphisms A — A[s~'] and A — A/(s) (where (s) is the principal ideal 
generated by s) would generate a covering sieve. Since both these morphisms 
are epic in Rng, and their pushout is the degenerate ring 1 (which is covered 
by the empty sieve, because of the nontriviality axiom), it follows that if a 
functor F: Rng, — Set is a sheaf then we have F(A) S F(A[s~*]) x F(A/(s)) 
for all such A and s. But the representable functors do not in general satisfy 
this condition (for the forgetful functor, it holds iff s is idempotent), and so the 
coverage is not subcanonical. 

Nevertheless, there is a cartesian theory To such that the classifying topos for 
fields may be presented using a subcanonical coverage on the syntactic category 
of To, namely the theory of (von Neumann) regular rings. This is obtained from 
the theory of rings by adding the axiom 


(T Hr (Ay)((x?y = 2) A (z4? = y))). 


It is an easy exercise in ring theory to verify that the latter axiom is cartesian 
relative to the theory of reduced rings (rings without nonzero nilpotent elements). 
and that it implies the Horn sequent ((z? = 0) +, (x = 0)) which says that a 
ring is reduced. For more details, see [520, V 2.6]. 


For geometric theories T, we may proceed in the same way as for coherent 
ones: that is, we may equip the geometric category C ™ with its canonical cov- 
erage (consisting of all sieves generated by small covering families, cf. C2.1.12(e)) 
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and then show as in 3.1.9 that the topos of sheaves on this site is a classifying 
topos for T. Alternatively, we could proceed as in 3.1.10, by finding an appropri- 
ate cartesian theory To of which T is a quotient, and imposing a suitable coverage 
on the syntactic category of To. However, in this case we have a possible short 
cut. The oo-pretopos Gr generated by C#°°™" (cf. 1.4.12(iii)), though not small, 
has a separating set of objects, namely (the images of) those objects {%.¢} of 
CR °™ for which ¢ is in the canonical form of 1.3.8(ii). (It is clear that these form 
a separating set for CF°°™", since they meet every isomorphism class of objects of 
this category; and since every object of Gr is a quotient of a coproduct of objects 
of C°°™, the result easily extends to the larger category.) Hence by Giraud’s 
Theorem ©2.2.8 Gr is itself a Grothendieck topos. Moreover, by C2.2.10 every 
geometric functor from Gr to a Grothendieck topos is an inverse image functor 
(i.e. has a right adjoint), and so we may conclude 


Proposition 3.1.12 For any geometric theory T, the oo-pretopos Gr of 
1.4.12(iii) (is a topos and) has the universal property of a classifying topos Set[T] 
for T. o 


Of course, we could view this result the other way round, and regard the pas- 
sage from C#°°™ to the topos Sh(C#°™) as an alternative construction of the 
oo-pretopos generated by this geometric category. Note also that, in this case, 
the fact that the classifying topos determines the theory up to Morita equiva- 
lence is a triviality, since Morita equivalence of geometric theories was defined 


as equivalence of the oo-pretoposes generated by their syntactic categories. 


Remark 3.1.13 Let (C, J) be a small site whose underlying category is carte- 
sian. In 2.1.4(g) we saw how to construct a geometric sketch S (and hence, via 
2.2.7, a geometric theory T) whose models in any Grothendieck topos € cor- 
respond to cartesian cover-preserving functors C — E. More explicitly, T may 
be obtained from the theory of cartesian functors defined on C, as described in 
1.4.8, by adding the sequent 


(The VGu: BQ) = z)) 


i€lI 


for each J-covering family (f;: B; — A | i € I). Combining this with C2.3.9, we 
see that Sh(C,T) has the universal property of a classifying topos for T. But 
since every Grothendieck topos may be generated by such a site (C2.2.8(iii)), 
this says that every Grothendieck topos is (equivalent to) the classifying topos of 
some geometric theory. In combination with the remarks above, we may interpret 
this result as saying that a Grothendieck topos is ‘the same thing’ as a Morita 
equivalence class of geometric theories - or perhaps more accurately, that the 
notion of Grothendieck topos represents the ‘extensional essence’ of the intuitive 
(intensional) notion of a geometric theory ‘up to Morita equivalence’. 


Remark 3.1.14 Suppose T is a geometric propositional theory as defined in 
geom 


1.1.7(m). In 1.4.14 we saw that the syntactic category C$ is equivalent to 
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a frame (L, say), and that T is Morita-equivalent to the theory of completely 
prime filters of L. It follows that Set[T] ~ Gr ~ Eff(Pos,,(L)) may be identified 
with the topos Set(L) ~ Sh(X), where X is the locale defined by O(X) = L 
(cf. the remarks at the end of Section A3.3, and C1.3.11). Thus we may extend 
the equivalence of 1.2.15(m) from localic toposes to arbitrary Set-toposes: for 
any such topos €, we have 


T-Mod(€) ~ Top (E, Sh(X)) . 


In particular, we recover the result of B4.2.12 that the classifying topos of any 
propositional geometric theory is localic over Set (and that any localic Set-topos 
occurs as the classifying topos of such a theory). 


Clearly, if T is Morita-equivalent to a propositional geometric theory (equiv- 
alently, if Set[T] is localic over Set), then T-Mod(€) is an essentially small 
preorder for any Set-topos €. However, the converse is false: the following 
counterexample is due to M. Makkai. 


Example 3.1.15 Let T be the theory over the signature © with one sort A 
and two infinite sequences Rn, Sn of unary relation symbols, whose axioms are 
(T Fe (Rn(£) V Sn(x£))) and ((Rn(£) A Sn(x)) ke L) for all n, plus 


(Trey (=v V (Balt) A Sato) ¥ (Sala) A Raly)))) ) 


n>0 


Thus a T-model M in a Set-topos € consists of an object MA together with a 
complemented subobject [R] — MAx N (where N is the natural number object 
of €, i.e. the countable copower of 1), satisfying a condition which in particular 
implies that the transpose M A — Q” of the classifying map of this subobject is 
monic. (If € is Boolean, the last axiom says precisely this; but in a non-Boolean 
topos it is stronger.) Moreover, since a morphism f: M — M’ of T-models must 
preserve the interpretations of both the Rn and the Sn, it is easy to see that it 
must make the diagram 


MA ^> M'A 


QN 
commute; thus T-Mod(£) is equivalent to a sub-poset of Sube (Q^) (and to the 
whole of this poset if € is Boolean). 
However, Set[T] is not localic over Set. To see this, note first that T has 2° 
non-isomorphic models in Set, where c = 2% is the cardinality of the continuum. 
But 1.3.8 imples that the Lindenbaum algebra of T is countably generated as a 
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complete join-semilattice, since there are only countably many regular formulae 
over X; in other words, the corresponding locale X has a countable basis in 
the sense of C2.2.4(b). And a locale with a basis of cardinality x can have at 
most 2” points, since a completely prime filter in O(X) is determined by the 
basis elements which it contains (cf. C1.2.2); so Sh(X) cannot be equivalent to 
Set[T]. 


We next fulfil a promise made in 1.5.12: 


Proposition 3.1.16 (‘Classical completeness’ for geometric theories) 
Let T be a geometric theory, and o a geometric sequent over the signature of 
T. If o is satisfied in all T-models in Boolean toposes, then it is provable in T. 


Proof By A4.5.23, there exists a surjective geometric morphism 
f: B — Set[T], where B is Boolean. So ø is satisfied in the T-model f*(Gr), 
and hence by 1.2.13 in Gr itself; but the latter is equivalent to provability in T. 

o 


In particular, it follows that if a geometric sequent ø is derivable from 
the axioms of a geometric theọry T using ‘classical geometric logic’ (i.e. the 
rules of geometric logic plus the Law of Excluded Middle), then there is also a 
constructive derivation of ø, not using the Law of Excluded Middle. 

We remark that, since every Grothendieck topos is a surjective image of 
a localic one (cf. C5.2.1), and since the surjection constructed in A4.5.23 is 
localic (cf. A4.6.2(e)), there is a possible further strengthening of 3.1.16: in fact 
provability in a geometric theory T may be determined by considering satisfaction 
in T-models in localic Boolean toposes. This has the advantage that all such 
toposes satisfy the Axiom of Choice (cf. 4.5.15 below); so we obtain a further 
conservativity result like that just mentioned, asserting that uses of the Axiom 
of Choice may be eliminated from any derivation of a geometric sequent from 
geometric hypotheses. 

To conclude this section, we digress for a while from its main theme to con- 
sider full first-order theories. Since inverse image functors are not in general 
Heyting functors, we cannot hope to have a classifying topos (in the sense we 
have considered so far) for such a theory. Nevertheless, it is of interest to ask 
whether there is a ‘completeness theorem’ for such theories in terms of a single 
model in a Grothendieck topos, similar to that which we derived for geometric 
theories from the existence of a classifying topos. The answer is yes; indeed the 
topos in question may be taken to have a particularly simple form, as we shall 
see. To show this, we need a result of which the first part, at least, could have 
been included in Section A1.4 (and the second in Section A2.1): 


Lemma 3.1.17 Let C be a small Heyting category. Then 


(i) The Yoneda embedding Y : C — [C°?, Set] preserves universal quantification. 


(ii) If J denotes the coherent coverage on C, then the Yoneda embedding 
Y:C — Sh(C, J) is a Heyting functor. 
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Proof (i) Let f: A — B be a morphism of C, let A’ — A be a subobject 
of A and let B’ = V;(A’). We must show that if R — Y(A) is any subobject 
whose pullback along Y(f) is contained in Y(A’), then R < Y(B’). We recall 
that subobjects of Y(B) in [C°?, Set] correspond to sieves on B in C, and that 
the pullback of R along Y(f) is the sieve of all morphisms g: C — A such that 
fg € R. Now if h: D— B € R, form the pullback 


C ———> D f 


A P ae B 
then g € f*(R), so by assumption it factors through A’ — A. Let D > I — B 
be the image factorization of h; since image factorizations in C are stable under 
pullback, we have f*(I) < A’, and hence I < V;(A’) = B’, as required. 

(ii) It is easy to see that the argument of (i) continues to work if we replace 
[c°P, Set] by Sh(C, J) for any subcanonical coverage J on C. But in A2.1.11(b) 
we observed that the coherent coverage on a small coherent category is always 
subcanonical, and also that for this particular coverage the Yoneda embedding 
becomes a coherent functor. Since it also preserves universal quantification, it is 
a Heyting functor. 0 


Corollary 3.1.18 For any first-order theory T, there is a Grothendieck topos 
E and a T-model Ny in E such that the first-order sequents satisfied in Ny are 
exactly those provable in T. 


Proof Take € = Sh(C%, J) where J is the coherent coverage, and let Nr be 
the image of Mr under the Yoneda embedding. Since the Yoneda embedding 
is full and faithful, it is conservative; so by 1.2.13 and 3.1.17 it preserves and 
reflects the truth of arbitrary first-order sequents. Hence the result follows from 
1.4.11. O 


It is possible to sharpen the result of 3.1.18. We again recall from C5.2.1 
that, for any Grothendieck topos £, there exists an open surjection f: F > € 
where F is localic (over Set); and from C3.1.7 that the inverse image of an open 
surjection of toposes is a conservative Heyting functor. Thus we may require the 
topos E of 3.1.18 to be localic over Set. In fact P. Freyd [381] has shown, by a 
substantially different method, that we may require it to be the topos of sheaves 
on a Stone space (that is, a compact zero-dimensional space ~ equivalently, the 
prime ideal space of a Boolean algebra); and that if our signature © is countable 
(so that the category clo is also countable) we may actually take E£ to be the 
particular topos Sh(K), where K is the Cantor space. (In other words, Sh(K) 
plays a role in constructive first-order logic analogous to that of Set in classical 
first-order logic.) 
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It is also possible to generalize 3.1.18 to infinitary first-order logic, as defined 
in 1.1.3(g). However, in contrast to the coherent/geometric case, we cannot 
work with the whole of infinitary first-order logic in constructing our syntactic 
category: the reason is that the resulting category will not in general be small, 
or even equivalent to a small category. (Even in the case of a propositional 
Signature with no axioms, the syntactic category would be equivalent to the 
free complete Heyting algebra on the set of atomic propositions, and if there is 
more than one atomic proposition then the latter is known to be a proper class; 
cf. [550].) We must therefore restrict ourselves to «-first-order logic for some 
infinite regular cardinal «, that is the fragment in which we allow conjunctions 
and disjunctions indexed by sets of cardinality less than x, but not any larger 
ones. (Provided we retain the convention that a theory is specified by a set 
of axioms, it is clear that for any infinitary first-order theory T we can find a 
cardinal « such that T lies in «-first-order logic.) Given a theory T in «-first-order 
logic, we may build its syntactic category C%' in the usual way, and verify that 
it is a (small) x-Heyting category (that is, a Heyting category having unions 
and intersections of families of subobjects of cardinality less than «x, which are 
stable under pullback), and that «-Heyting functors from Cf! to another such 
category D correspond to T-models in D. If we equip C&'° with the coverage 
J,, whose covering families are ‘T-provably surjective’ families of cardinality less 
than x, then we may further verify that J, is subcanonical and that the Yoneda 
embedding C%'° — Sh(C&', Jẹ) is a k-Heyting functor. Thus we obtain 


Proposition 3.1.19 Let T be any theory in «-first-order logic. Then there 
exists a Grothendieck topos E (which may be taken to be localic over Set) con- 
taining a T-model which satisfies exactly the K-first-order sequents which are 
provable in T. a 


However, this construction is not universal in any sense: in particular, dif- 
ferent choices of the cardinal « will yield inequivalent toposes. For more details. 
see [212]. 


Suggestions for further reading: Butz & Johnstone [212], Cole [245], Dubuc 
& Reyes [309], Tierney [1169], Wraith [1246]. 


D3.2 The object classifier 


In this section we shall study one particular classifying topos: namely that for 
the theory of objects, i.e. the empty theory over the signature with one sort and 
no primitive symbols except equality. (As in Section B4.2, we shall denote this 
theory by O.) Clearly, there is not very much of interest. to be said about O as 
a theory; but its classifying topos Set[O] has some interesting structure, which 
it seems appropriate to explore at this point. 

By 3.1.2, Set[O] may be identified with the functor category [Set ;, Set]. 
where Set is the category of finite sets (or a small skeleton thereof, such as the 
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category of finite cardinals and all maps between them), and the generic object 
Go itself is just the inclusion functor Sets — Set. 

Given an arbitrary object A of a Set-topos €, we have a corresponding geo- 
metric morphism € — [Set ,,Set], which we shall denote by A, and which is 
characterized by the property that A (Go) = A. Explicitly, A corresponds to 
the Set ;-torsor ha: Set? — E given by ha(n) = A”, and A’ sends a functor 
T: Sets — Set to the tensor product hA @set, T, i.e. to the coequalizer of 


[asa [Į æ 


a:m—n,x€T(m) nEN, <ET (n) 


where the maps are induced respectively by the actions of Set; on T and on ha. 
Equivalently, we can describe A’ (T) as the coend f” T(n) x A”. 

In particular, given an object T of Set{O], we have a geometric mor- 
phism T: Set(O] — Set{O} (unique up to isomorphism) such that T” (Go) & T. 
Composition of geometric endomorphisms of Set[O] gives rise to a bifunctor 
Q: Set[O] x Set[O] — Set[O}, defined by 


S@T=T'S (Go), 


so that S&T = S- T. The properties of this bifunctor are summarized by 


Proposition 3.2.1 (Set[O], 9, Go) is a (non-symmetric) monoidal category, 
which is closed on the right, i.e. for each object T the functor (—)@T has a right 
adjoint. 


Proof The associativity and unit isomorphisms for ® are obtained by 
transposing the associativity and unit identities for composition of geometric 
morphisms across the equivalence 


Top /Set (Set[O], Set[O]) ~ Set[O}] . 


It is straightforward to verify that they satisfy the coherence conditions for a 
monoidal structure. The second assertion follows from the fact that (—) ® T is 
isomorphic to the functor T”, which has a right adjoint 7’. O 


What can we say about the monoids for this monoidal structure? Clearly, 
an object T of Set[O] induces an endofunctor Tg of every Set-topos £, defined 
by Te(A) = A(T) (2 A T’(Go)). If T has a monoid structure, then Te has 
a monad structure, which is ‘natural’ with respect to arbitrary inverse image 
functors between Set-toposes. We shall see in Section D5.3 below that such 
monads may be naturally identified with finitary algebraic theories. 

We may give an explicit description of the tensor product, as follows. Spe- 
cializing the coend formula above for A (T) to the case when € = Set[O] and A 
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is a functor S: Set; — Set, we obtain 


(Pe@ sym) = f T) x (S(m))" : 


that is, (T8 S)(m) is the quotient of the disjoint union of the sets T(n) x (S(m))” 
by the smallest equivalence relation which identifies (T f(x), (yi,---,Yn)) with 
(x, (yfi,---,Yyn’)) whenever f:n’ — n in Sets, x E€ T(n’) and yi,...,yn E 
S(m). It is easy to see directly that, if T is (the restriction to Sets of) the 
functor underlying a finitary monad on Set (i.e. a finitary algebraic theory), then 
it carries a ®-monoid structure: for we may identify the elements of T(n) with 
n-ary operations of the theory T, and the multiplication sends (a, (4,,...,8n)) 
to the composite operation 


(z1,---, 2m) RO By Bis wenden) i -< -, Pri 2034 er) 


(Similarly, the unit map Go — T sends the ith element of n = Go(n) to the ith 
projection, considered as an n-ary operation of T.) 

We note that the generic object Go is a functor taking finite sets as values. 
This does not imply that, as an object of Set[O], it is finite in the Kuratowskian 
sense defined in 1.1.7(k); for K-finiteness is preserved by inverse image functors, 
and so if Go were K-finite then every object of every Set-topos would be K-finite. 
However, it does satisfy another, non-geometric, notion of finiteness, namely the 
sequent 


(“(z = y) Fry L), 


which says that it does not have two disjoint elements. For the interpretation of 
this sequent is that the pseudo-complement of the diagonal subobject of Go x Go 
is empty, and this is easily seen to be true since any nonempty subobject of 
Go x Go must contain the unique element of (Go x Go)(1). On the other hand, 
Go is also ‘potentially infinite’ in the sense that, for each natural number m, it 
satisfies the non-geometric sequent 


(Wa) ((@ = y1) V ++ V (£ = Ym)) Fyn, ym +) 


which says that it cannot be ‘covered by m elements’. (We interpret the case 
m = 0 of this sequent as ((Vz)1 Hij) 1), i.e. the assertion that Go is nonempty 
in the ‘negative’ sense, as opposed to the positive sense (T Hj) (4xz)T).) More 
generally, it may be shown that Go satisfies 


((Wa)((z = yi) V ++- V (£ = Ym) V $) Fyi ,...iym $) 


where ¢ is any coherent formula with FV(¢) C {yi,..-,ym}. (The previous 
displayed sequent is of course the special case ¢ = L of this scheme.) We may 
therefore conclude that the above sequents are ‘geometrically inconsequential’, 
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in the following sense: 


Lemma 3.2.2 Leto be a geometric sequent over the signature © with one 
sort and no primitive symbols except equality, and suppose o is (constructively) 
provable in the theory T whose axioms are the sequent (=(4 = y) Fz,, L) and all 
instances of the last displayed sequent on the previous page. Then o is provable 
in the empty theory over X. C) 


In fact it can be shown (cf. [212]) that the theory T of 3.2.2 is the full 
(infinitary) first-order theory of Gg: that is, any sequent satisfied by Go is 
deducible from T. (Equivalently, if A is any T-model in a Grothendieck topos €, 
then A: E — Set[O] is an open map, so that A satisfies all the infinitary first- 
order sequents satisfied by Gg.) 

The study of such ‘inconsequential’ non-geometric properties of generic mod- 
els is not entirely a pointless curiosity. Although 3.2.2 has no known applications, 
we shall see in Section F1.1 how the fact that the generic local ring satisfies a suit- 
able non-geometric ‘field axiom’ enables us to establish results about projective 
geometry over the generic local ring, and then (since these results are expressible 
by geometric sequents) to ‘transport’ them to results about projective geometry 
over an arbitrary local ring. 

An interesting fact about Set[{O] is that every Grothendieck topos € admits 
a localic geometric morphism to it. In order to prove this, we shall introduce a 
weakening of the notion of a bound for a geometric morphism, which we met in 
Section B3.1. 

We recall from B3.1.7 that an object B of an S-topos p: € — S is called 
a bound (for E over S) if every object of E is a subquotient of one of the form 
p*I x B, where I is an object of S. When S = Set, by B3.1.8(6), this is equivalent 
to saying that the subobjects of B form a separating set for €. The generic 
object Go is not a bound for Set[O] over Set; but it is ‘very nearly so’, in the 
following sense. By the Yoneda lemma, the representable functors Sets (n, —) 
form a separating set for Set[O], so their coproduct forms a bound for it; but 
these functors are just the finite powers of the generic object Sets (1, —), so their 
coproduct is the list object L(Go) over Go, as defined in A2.5.17. 


Definition 3.2.3 Given a geometric morphism p:€ — S, where S (and 
hence €) has a natural number object, we shall say that an object B of € is 
a pre-bound for E over S if the list object LB is a bound for E over S. 


If S is the topos Set, this is equivalent to saying that the subobjects of 
finite powers of B form a separating set for E£: one direction is trivial, and 
the converse follows from the fact that any subobject of LB decomposes as a 
coproduct of subobjects of finite powers of B. We recall that any object which 
contains a bound as a subobject (or has one as an epimorphic image) is itself a 
bound; hence in particular any bound is a pre-bound, since LB contains B as a 
subobject. 
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Lemma 3.2.4 For a geometric morphism f:F —€E, the following are 
equivalent: 


(i) f is localic. 


(ii) For any geometric morphism p: E — S and any bound B for E over S, 
f*B is a bound for F over S. 


(iii) There exists a geometric morphism p: E — S and an object B of E such 
that f*B is a bound for F over S. 


(iv) (if E has a natural number object) For any geometric morphism p: E > S 
where S has a natural number object, and any pre-bound B for E over S, 
f*B is a pre-bound for F over S. 


(v) (if E has a natural number object) There exists a geometric morphism 
p:E€ — S where S has a natural number object, and an object B of €, 
such that f*B is a pre-bound for F over S. 


Proof (i)> (ii) follows easily from inspection of the proof, given in B3.1.10(i), 
that a composite of bounded geometric morphisms is bounded. 

(ii) = (iii) is trivial (take p to be the identity morphism on £, and B to be 
its terminal object). 

(iii) > (i): by assumption, every object of F is a subquotient of one of the 
form (pf)*I x f*B, for some I € ob S. But this is isomorphic to f*(p*I x B), 
and hence in the image of f*; so f is localic. 

(ii) > (iv) follows easily from the fact that f* commutes up to isomorphism 
with the list-object functor, as we remarked after A2.5.15. 

(iv) => (v) is similar to (ii) > (iii) (and (iii) > (v) is also trivial). And (v) > (i) 
is similar to (iii) > (i), again using the fact that f* commutes with the list-object 
functor. o 


Theorem 3.2.5 ZLetE be a Grothendieck topos, B an object of E. The following 
are equivalent: 


(i) B is a pre-bound for E over Set. 
(ii) The geometric morphism B: E — Set[O] corresponding to B is localic. 
(iii) There exists a single-sorted geometric theory T and an equivalence E ~ 


Set/T] which identifies B with the underlying object of the generic T- 
model Gr. 


Proof (i) (ii) follows immediately from 3.2.4 and the fact, noted above, that 
Go is a pre-bound for Set[{O] over Set. 

(iii) > (ii): Suppose (iii) holds. Then, given an arbitrary Grothendieck topos 
F and an object A of F, factorizations of the classifying map A: F — Set[O] 
through B correspond to T-model structures on the given object A of F. In other 
words, as a topos over Set[O], € ~ Set[T] becomes the classifying topos for a 
propositional geometric theory in the sense of 1.1.7(m) — the theory of T-model 
structures on the particular object Go of Set{O]. So by B4.2.12 the geometric 
morphism B is localic. 
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Alternatively, we could give a direct proof of (iii) > (i), by observing that 
in the syntactic category C#°™ every object is a subobject of a finite power of 
(the underlying object of) My; but by 3.1.12 Set[T] may be taken to be the oo- 
pretopos generated by C$°°™, and so every object of it is a quotient of a coproduct 
of objects of C#°°™. So the subobjects of finite powers of the underlying object 
of Gr form a separating set for it. 

(i) = (iii): We take a single-sorted signature Ug having an n-ary relation 
symbol [R] for each subobject R — B” in E. (We could also add n-ary function 
symbols corresponding to the morphisms B” — B in E, but since their graphs 
will appear as (n + 1)-ary relation symbols, this is not necessary.) Then B itself 
carries an obvious Ug-structure in E; we define T to be the set of all geometric 
sequents over Xp which are satisfied in this structure. Thus B is a T-model; we 
shall show that the geometric morphism € — Set[T] which corresponds to it is 
an equivalence. 

Consider first the geometric functor Fp : CZ°°™ — E which corresponds to B. 
Since every object {11,...,2n.b} of C#°™ is a subobject of {21,...,an.T} = 
(Mr)” for some n, every object in the image of Fg is a subobject of a finite 
power of B; but the converse is also true, by the construction of Ug. Also, if 
f: R — S is any morphism of € between subobjects of finite powers of B, then 
the sequents which say that (the formula over Lg corresponding to) the graph 
of f represents a morphism of Cf°°™ are satisfied in B and hence provable in 
T; so Fg is full. Similarly, if two parallel morphisms [6] and [6’] of C#°°" have 
the same image under Fg, then @ and 6’ must be provably equivalent in T; so 
Fz is faithful. Since Fg is a geometric functor, it carries covering families in 
ce°™ to epimorphic families in E; but, once again, the definition of T ensures 
that the converse is true. Finally, since B is a pre-bound for € over Set, the 
objects in the image of Fg form a separating set for E; so we are in a position to 
apply the Comparison Lemma (C2.2.3) and conclude that E is equivalent to the 
topos of sheaves on C$ ™ for the canonical coverage. But the latter is exactly 
the classifying topos Set/T]. oO 


Since, by definition, every Grothendieck topos has a bound (and hence a 
pre-bound) over Set, we obtain as an immediate corollary the result, promised 
above, that every such topos admits a localic morphism to Set[O]. (We also 
obtain a new proof of 1.4.13; however, 1.4.13 could alternatively have been used 
to prove this result, by the argument given in the proof of (iii) > (ii) above.) 

We note the following easy consequences of 3.2.5: 


Corollary 3.2.6 Let T be a single-sorted geometric theory, and M a 
T-model in a Grothendieck topos E, classified by a geometric morphism 
f:& — Set[T]. Then 


(i) f is localic iff the underlying object of M is a pre-bound for E over Set. 


(ii) f is surjective iff M is a conservative T-model (i.e. satisfies only those 
geometric sequents derivable from T). 
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Proof (i) Let g: Set[T] — Set[O] classify the underlying object of the generic 
T-model. Then g is localic by 3.2.5, so by A4.6.2(e) and (f) the morphism f is 
localic iff the composite gf is. But gf classifies the underlying object of M, so 
the result is immediate from 3.2.5. 

(ii) One direction is immediate from 1.2.13: if f is surjective, then any geomet- 
ric sequent satisfied in M is satisfied in Gr, and hence provable in T. Conversely, 
suppose M is a conservative T-model, and consider the class C of objects A of 
Set|T] such that f* maps proper subobjects of A to proper subobjects of f* A. 
Conservativity of M says that C contains all subobjects of finite powers of Gr; 
but since f* preserves coproducts and epimorphisms, it is easy to see that C 
must be closed under coproducts and quotients, and hence (since Gr is a pre- 
bound for Set[T]) that it is the class of all objects of Set[T]. So by A1.2.4 f* is 
conservative. O 


We conclude this section by describing the classifying toposes of two theories 
fairly closely related to O: the theory D of decidable objects and the theory K of 
K-finite objects. D is written over the signature © with one sort and one binary 
relation # apart from equality (we write this relation in infix rather than prefix 
notation, i.e. (x # y) instead of #(x, y)). The axioms of D are the sequents 


((t#x) Fs L) 


and 
(T Fay (2 #y) V (£ = y))) 


which say that in any D-model A the interpretation of # is the complement 
of the diagonal subobject of A x A. Thus, for any topos E, D-Mod(€) is the 
category of decidable objects of E (that is, objects with complemented diagonal) 
and monomorphisms between them. (Recall from B4.2.3 that we cannot have 
a geometric theory whose category of models in an arbitrary € is the category 
of all objects of E and monomorphisms between them; D achieves this for all 
Boolean toposes €, which is best possible.) 

We may construct a classifying topos for D using the method of 3.1.10. For 
this purpose, we could take Do to be the empty theory over X, but it will simplify 
matters slightly if we allow it to include the Horn sequent 


((c#y) try (Y#2)) 


which says that # is symmetric (it is clear that this sequent is provable in D). 
Thus Do-models are objects equipped with a symmetric binary relation: what 
are sometimes called undirected graphs (with possible ‘loops’ joining a vertex 
to itself, but without multiple edges). The classifying topos for Do is clearly 
[Gph f Set], where Gph ș ìs the category of finite undirected graphs and graph 
homomorphisms. The axioms of D correspond to a coverage on Gph*? in which 
the graph with one vertex and a loop is covered by the empty family (ad hence, 
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by pushout, any finite graph with a loop is covered by the empty family), and 
the discrete graph 2 with two vertices is covered by the pair consisting of the 
inclusion 2 —» 2 (where 2 is the connected graph with two vertices) and the 
unique morphism 2 — 1. By pushing out and composing copies of these covering 
families, we may deduce that, for any finite graph (A, #), we have a covering 
family of morphisms 


((A, #) > (A/E, #e)|E£ € I) ’ 


where T is the set of all equivalence relations on A which are disjoint from # 
(note that this set is empty if # meets the diagonal), A/F is the quotient of A by 
the equivalence relation E, and #¢ is the complement of the diagonal relation on 
A/E. But it is easy to check that, if we define a cosieve on (A, #) to be covering 
whenever it contains the above family, then we do obtain a Grothendieck coverage 
on Gph’®?: so this is the coverage Jp which we require. Further, we see that if 
an object (A, #) has the property that # is the complement of the diagonal 
(i.e. if it is a model of D), then its only covering cosieve is the maximal one 
(cf. C2.2.18(a)). So, if we identify the category Set sm of finite sets and injections 
with the full subcategory of Gph, on these objects, then by C2.2.4(d) we have 
an equivalence of categories 


[Set jm, Set] ~ Sh(Gph?, Jp) . 


Thus we have proved 


Proposition 3.2.7 The classifying topos Set(D] for the coherent theory D of 
decidable objects may be taken to be the functor category [Set sm, Set]. oO 


The generic decidable object Gp is of course the inclusion functor Set sm —> 
Set; recall that an object F of a functor category [C, Set] is decidable iff F(a) 
is injective for each morphism a of C (A1.4.16(a)). 


Remark 3.2.8 The fact that the classifying topos for D is a functor cate- 
gory is not unrelated to the fact that D is a disjunctive theory in the sense 
of 1.3.6. It is not true that every disjunctive theory has a functor category 
as its classifying topos (this fails, for example, for the theory of fields as 
described in 1.1.7(h)); but for every such theory T over a signature ©, there 
is a disjunctive theory T’ over a larger signature ©’ (sometimes called the Diers 
completion of T) whose classifying topos is a functor category, and such that 
the forgetful functor &’-Str(Set) — L-Str(Set) restricts to an isomorphism 
T’-Mod(Set) = T-Mod(Set). (In the case when T is coherent, i.e. involves only 
finite disjunctions, we may replace Set by an arbitrary Boolean Grothendieck 
topos in the above.) For more details, see [508]. 


We mention in passing that the method of constructing a classifying topos 
for a theory T by way of a ‘simpler’ theory To of which it is a quotient, which 
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we sketched in 3.1.7 and 3.1.10, will continue to work even if To is not cartesian, 
provided we have a suitable representation of Set[To] as a functor category 
[C, Set]. We shall exploit this observation, with Ty = D, in 3.4.10 below. 


Remark 3.2.9 An alternative proof of 3.2.7 could be given ‘from the oppo- 
site end’, by starting with a presentation of the geometric theory classified by 
[Set rm, Set] (that is, the theory of torsors on Set;,,), and showing that it is 
Morita-equivalent to D. This is done, for example, in [549]. 


We gave an explicit presentation of the theory K of K-finite objects in 
1.1.7(k). Once again, we approach the problem of describing its classifying topos 
by the method of 3.1.10: this time we take Ko to be the Horn theory specified by 
(all instances of) the last two displayed sequents of 1.1.7(k). Thus a Kg-model is 
a set A equipped with a notion of ‘distinguished n-tuple’ of elements for every 
n > 0, such that if (a),...,a@n) is a distinguished n-tuple then so is any m-tuple 
obtained from it by re-ordering and either introducing or removing duplication 
of elements. Equivalently, therefore, we can think of it as a set equipped with a 
family of (nonempty) distinguished finite subsets. (In addition, a structure for 
the appropriate signature involves a truth-value [Ro], about which the theory Ko 
makes no assertions; we can think of Ro as the assertion that the empty subset 
is distinguished.) Once again, it is clear that the finitely presented Ko-models 
are just the finite ones; let us write C for the category of finite Ko-models. 

The case n = 0 of the first displayed sequent of 1.1.7(k) tells us that the 
coverage Jg on C°P must include the empty covering cosieve on the object given 
by a singleton with the empty set as the only distinguished subset; it follows that 
any nonempty finite Ko-model with the empty set distinguished is covered by the 
empty cosieve, and so we can effectively forget about these objects (cf. C2.2.4(e)). 
For n > 0, the first sequent of 1.1.7(k) tells us that, if A is the set {a),...,@n} 
with the whole of A as the only distinguished subset, and B is obtained from A by 
adding a new element b but retaining A as the only distinguished subset, then the 
family of all splittings for the inclusion A C B generates a covering cosieve on B. 
It follows that, for any object C of C with at least one nonempty distinguished 
subset, the family of all surjections f: C —> D, where the only distinguished 
subset of D is the whole of D, generates a covering cosieve. Finally, the second 
displayed sequent of 1.1.7(k) says that the initial object of C (the empty Ko- 
model with no distinguished subsets) is covered by the cosieve consisting of the 
identity map to the empty Ko-model with Ø distinguished, together with all 
maps to nonempty Ko-models with at least one distinguished subset. So, as 
with the theory D, we see that a cosieve on an arbitrary object A of C is Jg- 
covering iff it contains all morphisms with domain A whose codomain lies in the 
full subcategory (isomorphic to the category Set, of finite sets and surjections 
between them) whose objects are the (finite) K-models. Applying C2.2.4(d) once 
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again, we deduce 


Proposition 3.2.10 The classifying topos Set|K] for the geometric theory K 
of K-finite objects may be taken to be the functor category [Set fe, Set]. o 


As before, the generic K-model is the inclusion functor Setfe — Set; it 
follows from 5.4.14 below that this is a K-finite object of [Set fe, Set]. And, once 
again, 3.2.10 could alternatively have been proved by obtaining a presentation 
of the theory classified by [Set je, Set] and showing that it is Morita-equivalent 
to K; see [549]. 


Suggestions for further reading: Johnstone & Wraith [549], Ščedrov [1085]. 


D3.3 Coherent toposes 


In this section we study the particular properties of those Grothendieck toposes 
which occur as the classifying toposes of coherent theories. Such toposes are 
called coherent toposes; from our point of view, this name is slightly unfortunate, 
since (as we saw in A2.3.5) all toposes are coherent categories. However, the term 
‘coherent topos’ represents the original use of the word ‘coherent’ in topos theory 
(it was borrowed from the ‘coherent sheaves’ which occur in algebraic geometry), 
and ‘coherent theory’ and ‘coherent category’ are successive back-formations 
from this. 
We begin with 


Theorem 3.3.1 Let € be a Grothendieck topos. The following conditions are 
equivalent: 


(i) There exists a coherent (resp. regular) theory T such that E ~ Set/T}. 


(ii) There exists a site of definition (C,T) for E such that C is small and 
cartesian, and T 1s generated by finite (resp. singleton) covering families. 


(iii) There exists a standard site of definition (C,T) for E such that T is 
generated by finite (resp. singleton) covering families. 


(iv) There exists a small pretopos (resp. a small effective regular category) C 
such that E ~ Sh(C,T), where T is the coherent (resp. regular) coverage 
on C. 


Proof (ii) => (i) follows from the argument given in 3.1.13 to show that every 
Grothendieck topos is a classifying topos; for, in the axioms of the geometric 
theory T described there, the only disjunctions which appear are indexed by 
(generating subsets of) the covers in T. (iv) = (iii) and (iii) > (ii) are trivial; and 
(i) => (iv) follows from the construction of Set[T] mentioned after 3.1.9 (resp. 
in 3.1.5). (Note that the alternative construction of 3.1.10 (resp. 3.1.7) yields a 
site of definition for € satisfying the hypotheses of (ii), but not necessarily those 
of (iii).) o 
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Before proceeding further, we remark that the presence of finite limits in 
the underlying category, as well as the finiteness of the covering families, is an 
essential feature of the characterization in 3.3.1(ii). For example, we saw in 3.2.10 
that the theory K of K-finite objects has a classifying topos which is a functor 
category (i.e. has a site of definition with no nontrivial covering families); but we 
also saw in 2.4.12(c) that this theory cannot be Morita-equivalent to any finitary 
first-order theory, let alone a coherent one. 

If € satisfies the hypotheses of 3.3.1, we shall call it a coherent topos (resp. a 
regular topos). For the rest of this section, we shall deal in detail only with the 
coherent case, though we shall briefly discuss the corresponding results for the 
regular case in 3.3.10 below. 

In the search for a characterization of coherent toposes, the property of com- 
pactness plays an important rôle. We say that an object A of a topos (or more 
generally a geometric category) is compact if the top element of the lattice Sub( A) 
is compact; equivalently, if every covering family (A; — A | i € J) contains a 
finite covering subfamily. 


Example 3.3.2 In the topos Sh(X) of sheaves on a locale X, an object A 
is compact iff the domain of the corresponding local homeomorphism EF — X 
(cf. C1.3.11) is a compact locale, since subobjects of A may be identified with 
open sublocales of E. 


More generally, an object A of an arbitrary Grothendieck topos € is compact 
iff the topos €/A is compact (over Set) in the sense of C3.2.12; but we shall 
not make use of any results from Section C3.2 here. We shall need the following 
stability properties of compactness: 


Lemma 3.3.3 
(i) Let A be a compact object of a geometric category. Then, for any cover 
f: A— B, B is also compact. 


(ii) A disjoint union AL B is compact iff both A and B are compact. 


Proof (i) follows from the fact that f*: Sub(B) — Sub(A) preserves unions 
and is conservative. (ii) follows from the isomorphism Sub(A H B) S Sub(A) x 
Sub(B). 


The link with 3.3.1 is provided by 


Lemma 3.3.4 Let (C,T) be a small site, U € ob C. Then the associated sheaf 
\(U) of the functor C(—,U) is compact in Sh(C,T) iff U is T-compact’ in C; 
i.e. every T-covering sieve on U contains a finite family which generates a 
T-covering sieve. 


Proof This is immediate from the fact (C2.3.2(c)) that a sieve R on I(U) in 
Sh(C,T) is epimorphic iff the sieve {f: V — U | (f) € R} is T-covering in C. 
o 
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For an arbitrary sheaf A on (C,T), we always have an epimorphic family of 
morphisms (U) — A (as U ranges over all the objects of C); so a necessary 
condition for compactness of A is the existence of a finite epimorphic family 
(fi: (U;) + A|1<i< n). If the 1(U;) are all compact, then this condition is 
also sufficient, by 3.3.3. 

However, the presence of finite limits in C implies an important additional 
property of the objects 1(U). We shall say that an object A of a geometric 
category is stable if, whenever we are given a morphism f: B — A where B is 
compact, the domain R of the kernel-pair R = B of f is compact. And we 
shall call A coherent if it is both compact and stable. (Stability is more usually 
defined by reference to pullbacks of pairs of (not necessarily equal) morphisms 
from compact objects to A; but, using 3.3.3(ii), it is easy to prove that this 
definition is equivalent to the simpler one just given.) 

In a general topos, stability may be a vacuous condition, because there are 
‘not enough’ compact objects: for example, in the topos Sh(R) of sheaves on the 
real line, it follows easily from 3.3.2 that the only compact object is the initial 
object, and so every object is stable. But in the topos of sheaves on a site where 
every covering family is finite, 3.3.4 assures us of a plentiful supply of compact 
objects, and so stability becomes a nontrivial restriction. 


Example 3.3.5 Let G be a group, and consider the topos [G, Set} of (left) 
G-sets. It is trivial to verify that a G-set A is compact iff it has finitely many 
G-orbits; in particular, G itself (with G-action via left translations) is a compact 
G-set. Hence, if A is a stable G-set and x € A, the domain of the kernel-pair of 
f: G — A (where f is the unique G-equivariant map sending 1 to x) must have 
finitely many orbits. But it is easy to see that the orbits of this object correspond 
bijectively to elements of the stabilizer subgroup Gz = {g € G | gx = z}; so this 
must be finite. Conversely, if A is any G-set whose point-stabilizers are all finite, 
it is easy to verify that A is stable. 


Lemma 3.3.6 
(i) A subobject of a stable object is stable. 


(ii) A coproduct of stable objects is stable. 


Proof (i) is easy since if A’ — A is monic then the kernel-pair of any morphism 
B — A’ coincides with that of the composite B — A’ — A. 

(ii) Suppose A; and Ag are stable, and we are given a morphism 
f: B — A, U A with B compact. Then B decomposes as a disjoint union 
B, U B} (where B, is the pullback of B along the coprojection vi), and we 
have B xana, B S (Bi xa, By) I (B2 xa, B2), which is compact by 3.3.3(ii). 
This proves the result for finite coproducts; that for infinite coproducts follows, 
because if f: B — [],.,; A, is a morphism from a compact object to an infinite 
coproduct, there can be only finitely many 7 such that v7(B) is nonzero. o 


Theorem 3.3.7 Let (C,T) be a site satisfying (the coherent version of) the 
hypotheses of 3.3.1(ii). Then, for every object U of C, the object I(U) 18 coherent 
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in SH(C,T). If (C,T) satisfies the hypotheses of 3.3.1(iv), then the converse 
holds: every coherent object of Sh(C,T) is isomorphic to l(U) for some U. 


Proof By 3.3.4, we already know that I(U) is compact, so we have only to 
prove that it is stable. Suppose given a morphism A — (U) with A compact; 
then by the remarks after 3.3.4 we have a finite epimorphic family (I(V;) > A | 
1 < i < n). So A xu) A is covered by the pullbacks 1(Vi) xuyl(Vj), 1 < i,j <n, 
and it suffices by 3.3.3 to prove that each of these is compact. Since we are not 
assuming that the coverage T is subcanonical, the composites [(V;) + A — I(U) 
need not derive from morphisms V; — U in C; but, at the cost of replacing each 
V; by a finite T-covering family, we may assume that they do. But then since l 
is cartesian we have I(V;) xu) (Vj) S (Vi xu Vj), which is compact by 3.3.4. 
For the second assertion, let A be an arbitrary coherent object of Sh(C,T) 
(where we are now assuming that (C,T) is a pretopos with its coher- 
ent coverage). Since A is compact, we can find a finite epimorphic family 
(l(U;) + A|1<%<n); and since l: C > Sh(C,T) is a coherent functor, we can 
reduce this to a single epimorphism (U) = [[?_, (Ui) > A, where U = [[7_, Ui 
in C. Let R = 1(U) be the kernel-pair of |(U) —> A; then R is compact since A is 
stable, and so we can find a further epimorphism L(V) > R. Now the composite 
lV) > R = WU) x I(U) = UU x U) is in the image of l, since l is full; so its 
image R is also in the image of l. Finally, since C has and | preserves coequalizers 
of equivalence relations, we see that A is in the image of l. o 


Theorems 3.3.1 and 3.3.7 together tell us that, up to equivalence, coherent 
toposes are ‘the same thing as’ small pretoposes. We may thus fulfil a promise 
made in Section D3.1: 


Corollary 3.3.8 A coherent theory T is determined up to Morita equivalence 
by its classifying topos Set[|T]. 


Proof We defined Morita equivalence, for coherent theories, to be equivalence 
of the pretoposes Py generated by their syntactic categories. But by 3.3.7 we can 
recover Pr from Set[T], as its full subcategory of coherent objects. ca 


Remark 3.3.9 It also follows immediately from 3.3.7 that, in a coherent topos 
E, the full subcategory Econ of coherent objects is closed under finite limits, finite 
coproducts and coequalizers of equivalence relations. (In fact it is possible to give 
a direct proof of this, without invoking the theory of classifying toposes — this is 
done in [36] and [504]. However, these closure properties, unlike the elementary 
ones of 3.3.3 and 3.3.6, do not hold in an arbitrary topos.) But Econ need not 
be closed under arbitrary coequalizers, even if it has them. Let Set, be the full 
subcategory of Set whose objects are all finite or countable sets; then Set, is a 
pretopos, and so we obtain a coherent topos Sh(Set., P) of sheaves on it for the 
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coherent coverage. However, the coequalizer diagram 
s 
Ns SNe 
1 


(where N is the set of natural numbers) is not preserved by the embedding 
l: Sete — Sh(Set., P). For if it were, then since the coproduct diagram of 
A2.5.5(i) is also preserved by l, it would follow from 5.1.2 that 1(N) would be 
a natural number object in Sh(Set,, P). But the natural number object in any 
Grothendieck topos is a countable copower of 1, so it cannot be compact unless 
the topos is degenerate. Hence this contradicts 3.3.4. (See also C4.2.4, for a close 
relative of this counterexample.) 


Remark 3.3.10 As we remarked earlier, it is possible to give an analogous 
characterization of the classifying toposes of regular theories, replacing the notion 
of compactness by that of supercompactness (an object A of a geometric category 
is called supercompact if every covering family with codomain A contains a sin- 
gleton cover), and coherence by the corresponding notion of supercoherence. In 
particular, if € is a topos satisfying the ‘regular’ version of the conditions of 3.3.1, 
then the full subcategory of supercoherent objects of € is a small effective regu- 
lar category, and € is equivalent to the category of sheaves on it for the regular 
coverage; conversely, if C is any small effective regular category, then the super- 
coherent objects of Sh(C) are exactly the representable functors. We omit the 
proofs, which are very similar to those that we have given in the coherent case. 


Using the notion of supercompactness introduced in 3.3.10, we may deduce 
an interesting syntactic property of regular theories, which will be of use in 
Section D3.5: 


Lemma 3.3.11 Let T be a regular theory, and let ¢ and (y; |i € I) be regular 
formulae over the signature of T such that the geometric sequent ($ Fz Vic; Vi) 
is derivable in T (using geometric logic). Then, for some i € I, the regular 
sequent (ġ Fz Yi) is also derivable. 


Proof If (¢'z Vier yi) is derivable, then it must be satisfied in the generic 
T-model in Set[T]: that is, ({#.¢}) is contained in the union of the l({Z.,}) 
as a subobject of I({£.T}). (Here l, as usual, denotes the Yoneda embedding 
Cy” — Set[T].) But 1({Z.¢}) is supercompact, and so it must be contained in one 
of the 1({Z.1,}). Since l is conservative, this means that we have {#.6} < {Z.y} 
in Cy*, and hence by (the regular analogue of) 1.4.4(iv) the sequent (¢ tz Yi) 
is derivable in T. o 


There is of course an analogue of 3.3.11 for coherent theories, in which an 
arbitrary disjunction is reduced to a finite one; we leave the detailed formulation 
of this result to the reader. We should also mention that there is an alternative, 
more ‘classically model-theoretic’, proof of 3.3.11 using the classical completeness 


D3.3 COHERENT TOPOSES 915 


theorem 1.5.4(ii) and the fact (which we noted after 2.4.3) that the class of models 
of a regular theory is closed under products: for if the conclusion of 3.3.11 fails, 
then for each i we can find a T-model M; in Set containing a tuple of elements 
which belongs to the interpretation of ¥.@ but not of %.y;, and then the product 
Tier Mi is easily seen to contain a tuple which belongs to [%.¢] but not to 
IZ. Vie Yi]. (Using this argument, we may also prove a converse to the remark 
after 2.4.3: if T is a coherent theory over a signature ©, such that T-Mod(Set) is 
closed under arbitrary products in &-Str(Set), then T is equivalent to a regular 
theory over ©.) 


Remark 3.3.12 There is a close connection between the notion of coherence 
for objects of a topos, studied in this section, and the notion of finite presentabil- 
ity studied in Section D2.3. It is easy to see that a finitely presentable object 
A of a geometric category is compact; for if A is the union of a directed family 
of subobjects A;, then the identity morphism A — A must factor through some 
A; — A. Conversely, if (C, T) is a site such that T is generated by finite covering 
families, then it is easy to see that Sh(C,T) is closed under filtered colimits in 
[C°P, Set], and hence that the sheaves I(U) are finitely presentable in Sh(C,T); 
so it follows from 3.3.7 that every coherent object in a coherent topos is finitely 
presentable. However, neither of these implications is reversible. In any cocom- 
plete category, the finitely presentable objects are closed under arbitrary finite 
colimits, so that (for example) in the topos mentioned in 3.3.9, the coequalizer 
of 1(N) = I(N) is finitely presentable but not coherent. And if X is any compact 
space which is not strongly compact in the sense of Section C3.4 (for example, 
the space of C3.4.1(a)), then the terminal object of Sh(X) is compact (by 3.3.2) 
but not finitely presentable. 

We note in particular that a coherent topos is a locally finitely presentable 
category (this could of course have been deduced from 2.1.4(h) and 2.3.7). Once 
again, the converse is false: the topos Set[K] of 3.2.10 provides a counterexample, 
as does the topos [G, Set] for any infinite group G (cf. 3.3.5). 


The classical completeness theorem for coherent theories (1.5.10) may be 
translated into an important property of coherent toposes: 


Proposition 3.3.13 A coherent topos E has enough points; that is, the inverse 
image functors E — Set are jointly conservative. 


Proof Suppose € ~ Set[|T], and let (M; | i € I) be a set of T-models in 
Set which are sufficient to determine provability in T. We may regard these as 
a single T-model M in Set/I, and we need to show that its classifying map 
f: Set/I — Set[|T] is a surjection. We do this by the method of 3.2.6(ii): let 
C be the class of all objects A of Set/T] such that f* preserves properness of 
subobjects of A. Since M is conservative for coherent sequents, we see that C 
includes (the images in Set[T] of) all the objects of the syntactic category C0", 
but these form a separating set for Set[T], and C is closed under coproducts and 
quotients, so it contains all the objects of Set[T]. o 
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Proposition 3.3.13 may also be proved by ‘purely topos-theoretic’ methods, 
without reference to coherent theories or classifying toposes; this is done in [36] 
and [504]. However, inspection of the proof shows that it involves precisely the 
same sort of iterative construction that went into the proof of the completeness 
theorems in Section D1.5. This is not surprising, since the two are formally 
equivalent: the deduction of 1.5.10(ii) from the statement of 3.3.13 is even easier 
than the proof of 3.3.13, given that validity in the generic model of T is equivalent 
to provability in T. 


Remark 3.3.14 If T is a coherent propositional theory, then Set[T] is localic 
over Set (by B4.2.12, or alternatively because C¢°" is a preorder, as we observed 
in 1.4.14); so by 3.3.13 it is actually spatial, i.e. of the form Sh(X) for a topo- 
logical space X. The spaces X which occur in this context are easy to describe: 
they are exactly the coherent spaces, i.e. those whose topology has a basis which 
is closed under finite intersections (including the empty intersection X) and con- 
sists of open sets which are compact. If X is a coherent space, then the poset L of 
all compact open subsets of X is a distributive lattice, and X is homeomorphic 
to the spectrum spec L, i.e. the space whose points are the prime filters of L, 
with topology having as a basis the sets {F € spec L | U € F}, U € L; the 
Comparison Lemma (C2.2.3) then ensures that Sh(X) is equivalent to the topos 
of sheaves on L for the coherent coverage. So the fact that every localic coherent 
topos is of this form is another reflection of the fact, observed in 1.4.14, that 
every coherent propositional theory is Morita-equivalent to the theory of prime 
filters of a distributive lattice. 


In connection with the foregoing remark, we note 


Lemma 3.3.15 The localic reflection functor BYop/Set — LTop/Set pre- 
serves coherent toposes. 


Proof Suppose € œ~ Sh(C,7) is a coherent topos, where C is a small pretopos 
and T is its coherent coverage. Then (C,T) satisfies the hypotheses of C2.3.22, 
so by that result we may identify the localic reflection of E with Sh(Sube(1), T’) 
where T” is the coverage induced on Subc(1) by T. But Subc(1) is a distributive 
lattice by Al.4.2, and T’ may easily be identified with its coherent coverage; so 
Sh(Subc(1), T’) is a coherent topos. 0 


Lemma 3.3.15 may alternatively be proved starting from a site for € which 
merely satisfies the hypotheses of 3.3.1(ii). In this case, however, we have to use 
the more complicated description of the localic reflection provided by C2.3.20. 

We conclude this section by noting two further closure properties of the class 
of coherent toposes: 


Lemma 3.3.16 
(i) If E is a coherent topos, then the slice category E/B is a coherent topos iff 
B is a coherent object of E. 
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(ii) If E and F are Grothendieck toposes, the product category E x F is a 
coherent topos iff both E and F are. 


Proof (i) It is easily verified that an object f: A — B of E/B is compact 
(resp. coherent) iff A is compact (resp. coherent) in €. So if B is coherent, the 
subcategory (E/B)con of coherent objects of €/B is simply €.on/B; hence it is a 
pretopos, and its objects form a separating family for €/B. The converse follows 
from the fact that the terminal object of a coherent topos is a coherent object. 

(ii) By 3.3.3(ii) and 3.3.6(i), a complemented subterminal object in a coher- 
ent topos is coherent; so the left-to-right implication follows from (i). For the 
converse, it is again easy to verify that an object (A,B) of E x F is compact 
(resp. coherent) iff A is compact (resp. coherent) in € and B is compact (resp. 
coherent) in F. So if E and F are coherent, the coherent objects of € x F form 
a pretopos and a separating family. O 


Remark 3.3.17 An alternative ‘syntactic’ proof of the last part of 3.3.16 can 
be given as follows: let Tı and T2 be the coherent theories classified by € and 
F respectively. Since E x F is the coproduct of E and F in Yop, and since 
such coproducts are stable under pullback, a geometric morphism H — E x 
F corresponds to a complementary pair of subterminal objects (U,,U2) in H 
together with a T,-model in H/U, and a T2-model in H/U2. Now it is easy 
to construct a coherent theory T3 whose models correspond to the above data: 
specifically, if ©; and U2 are the signatures of T) and T2, we take a signature X3 
which is the disjoint union of these two together with two new sorts U, and U2, 
and new function symbols f4: A — U, (resp. ga: A — U2) for each sort A of ©) 
(resp. £2). As axioms, we take the axioms of T, (with a free variable of sort U, 
added to the context, if it happens to be empty), the axioms of T2 (similarly 
modified), and the sequents (T Fz, y, (vi = yi)) (i = 1,2), 


(T Hi] ((3x,)T V (dr2)T)) , and 


(((Ari)T A (3z2)T) Fi] L) ; 


where x; and y; are variables of sort U; (i = 1,2). It is now clear that if M is 
a T3-model in a topos H, then MU, and MU? are complementary subterminal 
objects, and the remaining data yield models of Tı and Tz in the corresponding 
slice categories. 

At least in the case when B is (the image in € of) an object of the syntactic 
category C&°h, where T is the coherent theory classified by £, a similar proof 
may be given for the first part of 3.3.16. For if B is the object {#.¢}, then €/B 
classifies the theory obtained from T by adding a string of new constants ¢ of 
the same length and type as 7, together with the new axiom (T Fy) ¢[c/7]). 


We note that a regular topos is connected and locally connected over Set, 
by C3.3.11(b). It follows that the second part of 3.3.16 does not hold for reg- 
ular toposes, though the first part does (by either of the two proofs given in 
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3.3.16 and 3.3.17), and so does the regular analogue of 3.3.15. Also, although 
3.3.16(ii) does not hold for regular toposes, it does hold for the classifying toposes 
of disjunctive theories, as may readily be seen from the proof given in 3.3.17. 


Suggestions for further reading: Artin et al. [36], Makkai [777]. 


D3.4 Boolean classifying toposes 


Given the ‘equivalence’ between Grothendieck toposes and geometric theories 
established in Section D3.1, it is naturally of interest to ask how familiar prop- 
erties of toposes relate to familiar (syntactic or model-theoretic) properties of 
the theories which they classify. In this section, we shall be concerned with the 
question of what can be said about a theory whose classifying topos is Boolean; 
we shall be mainly interested in coherent theories, for which this question has a 
particularly satisfying answer, first described by A. Blass and A. Séedrov [133]. 
We shall briefly discuss infinitary geometric theories whose classifying toposes 
are Boolean at the end of the section. 

To begin with, we need to investigate what can be said about Boolean toposes 
which are coherent, in the sense described in the last section. We start with a 
particular case: when is the topos Cont(G) of continuous G-sets for a topological 
group G (cf. A2.1.6) coherent? We recall that, given subgroups H and K of a 
group G, a double coset of the pair (H, K) is a subset of G of the form Hg Kk = 
{hgk | h € H,k € K} for some g € G. Just as for ordinary cosets, it is easy 
to prove that the double cosets of a given pair of subgroups form a partition 
of G (that is, if HgK and Hg’K overlap, they must be identical). We shall be 
particularly concerned with the case H = K; we shall say that a subgroup H 
has finite bi-index in G if the number of distinct double cosets HgH, g € G, 
is finite. And we shall say that a topological group G is coherent if every open 
subgroup of G has finite bi-index. Note that for a normal subgroup H, double 
cosets HgH coincide with ordinary cosets gH; hence, for a pro-discrete group G 
(one expressible as an inverse limit of discrete groups), coherence is equivalent 
to profiniteness. However, for non-normal subgroups, having finite bi-index is a 
weaker condition than having finite index: 


Example 3.4.1 Let G be the group of all permutations of N, and let H be the 
pointwise stabilizer of a finite subset S C N, i.e. the set of all g € G satisfying 
gn = n for all n € S. It is clear that, except in the case S = @, H has infinite 
index in G. However, it has finite bi-index: for if two elements g and g’ of G 
satisfy {n € S| gn € S} = {n € S | g'n € S} and the restrictions of g and g’ 
to this subset are equal, then it is easy to see that we can obtain g’ from g by 
pre- and post-multiplying by elements of H, and hence HgH = Hg'H. Thus the 
number of double cosets of H is }>;_9(;)?k!, where n is the number of elements 
of S; although this number increases quite rapidly with n, it is undeniably finite. 
Hence if we equip G with the topology of pointwise convergence, in which the 
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pointwise stabilizers of finite subsets of N form a base of neighbourhoods of the 
identity element, then it becomes a coherent topological group. 


Lemma 3.4.2 Let G be a topological group. Then the topos Cont(G) is 
coherent iff G is coherent in the sense defined above. 


Proof First we note that, as in the topos [G, Set] of all G-sets (3.3.5), the 
compact objects of Cont(G) are exactly the continuous G-sets with finitely 
many orbits. If Cont(G) is coherent, then its terminal object 1 must be stable, 
i.e. A x A must be compact for every compact A; so in particular the product 
G: H x G: H, where H is an open subgroup of G (and G : H denotes the set 
of left cosets of H, with G acting by left translation), must have finitely many 
G-orbits. But in fact this condition (for all open subgroups of G) is sufficient; for 
it implies that G: H x G : K is compact for any two open subgroups H and K 
(since HN K is open if H and K are, and G : H and G : K are both epimorphic 
images of G: (H N K)). Hence the square of any compact object, being a finite 
coproduct of objects of this form, is compact, and so (since any subobject of a 
compact object is compact) it follows that every object is stable. Given this, it is 
easy to see that the compact (equivalently, coherent) objects of Cont(G) form 
a pretopos, and that they form a site of definition for Cont(G) when equipped 
with the coherent coverage. 

Thus we are reduced to determining when G : H x G: H has finitely many 
orbits. But (cH, yH) is in the same orbit as (z’H, y’H) iff there exist elements 
g € G and h,k € H such that grh = 2’ and gyk = y', iff the double cosets 
Hx~)yH and Hr'~'y’H are equal. So a necessary and sufficient condition for 
coherence of Cont(G) is that each open subgroup H should have finite bi-index. 

0 


Theorem 3.4.3 For a Grothendieck topos E, the following are equivalent: 


(i) E is Boolean and coherent. 
(ii) E€ is atomic (over Set) and coherent. 


(iii) E is equivalent to a finite coproduct of toposes of the form Cont(G;), where 
the G; are coherent topological groups. 


Proof (ii) = (i) is trivial since atomic toposes are Boolean (C3.5.2), and 
(iii) > (ii) follows from 3.4.2 and the fact that a finite coproduct of coherent 
toposes is coherent (3.3.16(ii)). 

(i) > (ii): Suppose € is a Boolean coherent topos. By 3.3.3(ii) and 3.3.6(i), 
any subobject of a coherent object A of £ is coherent; moreover, since the lattice 
Sub(A) is Boolean and its top element is compact, it must actually be finite and 
hence atomic. So every object of Econ is a finite coproduct of atoms (and hence 
covered by atoms in the coherent coverage), and so we may replace E,on, as a 
site of definition for £, by its full subcategory Eat consisting of atoms. And this 
category, with the coverage induced by the coherent coverage on Econ, is easily 
seen to satisfy the hypotheses of C3.5.8; so € is atomic. 
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(ii) => (iii): If we add to (ii) the hypothesis that € is connected (i.e. that its 
terminal object is an atom), then (since it must have a point, by 3.3.13) it satisfies 
the hypotheses of C5.2.13, and so must be of the form Cont(G); moreover, by 
3.4.2, the topological group G must be coherent. In general, the terminal object 
1 is a finite coproduct [[j__, A; of atoms (since it is coherent), and so £ is a finite 
coproduct of connected atomic toposes €/A;, which are coherent by 3.3.16(ii). 
So the result follows from the particular case already discussed. o 


We note that the equivalence of (i) and (ii) in 3.4.3 could alternatively be 
deduced from 3.3.13 and C3.5.2. 

We now turn to the question of what can be said about a coherent theory T 
if its classifying topos Set[T] satisfies the conditions of 3.4.3. First we note 


Scholium 3.4.4 Let T be a coherent theory over a signature X, such that 
Set[T] is Boolean. Then, for any context £ over ©, there are only finitely many 
T-provable-equivalence classes of coherent formulae in the context T. 


Proof Such provable-equivalence classes correspond to (isomorphism classes 
of) subobjects of {Z.T} in Coo», by 1.4.4(iv). But the canonical functor CS9 — 
Set[T] is conservative, and we have seen in the proof of 3.4.3(i) > (ii) that any 
coherent object of Set[T] (in particular, any object in the image of this functor) 
has finite subobject lattice. o 


In order to proceed further, we shall find it convenient to work not with 
the whole syntactic category C£°™ but with its full subcategory C’ of consistent 
objects, i.e. those objects {%.¢} for which the sequent (¢ tz L) is not provable 
in T. The point of this is that the inconsistent objects are precisely those which 
are covered by the empty family in the coherent coverage P on C%°"; so if we 
delete them we obtain (as in C2.2.4(e)) a site (C’, P’) which gives rise to the same 
topos, but in which every covering family is nonempty — equivalently, Sh(C’, P’) 
is a dense subtopos of [C'°?, Set]. So, if the classifying topos of T is Boolean, we 
may identify P’ with the coverage corresponding to the double-negation local 
operator on [C‘°P, Set] (cf. A4.5.21); in other words, a sieve R on an object A of 
C' is P'-covering iff it is ‘stably nonempty’, i.e. its pullback along any f: B — A 
in C’ is nonempty. 


Lemma 3.4.5 Suppose T is a coherent theory such that Set|T] is Boolean. 
Then, for every coherent formula-in-contert T.p over the signature of T, there 
exists a coherent formula x.y in the same contezt such that the sequents 
((GAw) kz L) and (T Fz (@V w)) are provable in T. 


Proof By 3.4.4, there are (up to T-provable equivalence) only finitely many 
coherent formulae y for which the first sequent holds, so (by taking their dis- 
junction) we may find a single such y which is entailed by all the others. 
Suppose this y does not satisfy the second sequent; then the sieve on the object 
{Z.T} of C9" generated by the canonical monomorphisms {#.¢} — {7. T} and 
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{z.p} = {Z.T} is not covering for the coherent coverage. (We shall assume 
that the domains of both these morphisms are consistent; if either of them is 
inconsistent, the argument is similar but easier.) 

From the description given above of the coverage P’, we deduce that there 
exists a consistent object {y.x} and a morphism 


E l 
U=] 


such that no morphism with consistent domain factors through both [0] and one 
of the two monomorphisms above. In particular, this implies that {Z, 7. (0 A ¢)} 
and {z,7.(@Aw)} are both inconsistent. But the first of these implies that 
(((Sy)@ A p) Fz L) is provable in T, so by the construction of y the sequent 
((4y¥)@ tz wW) is provable, and hence (6 + zz (0 A w)) is provable. So {£, ¥.9} is 
inconsistent; but since 0 represents a morphism of C<°? with domain {7.x}, the 
sequent (x Hg (3Z)0) is provable, contradicting the consistency of {y¥. x}. O 


We are now ready for the main result of this section. 


Theorem 3.4.6 Let T be a coherent theory over a signature X. Then the 
following conditions are equivalent: 


(i) The classifying topos Set[T] is Boolean. 


(ii) T satisfies the following two conditions: 


(a) for every first-order formula-in-context £.@ over X, there exists a 
coherent formula T. in the same context, such that the sequents 
(ġ Fz Y) are provable in T using classical logic; and 


(b) in any context over ©, there are only finitely many first-order formulae 
up to classical T-provable equivalence. 


Proof First suppose Set[T] is Boolean. To prove that (ii)(a) holds, we pro- 
ceed by induction over the structure of ¢. Since provable equivalence is stable 
under compounding, we need only consider the cases when ¢ is obtained by nega- 
tion, implication or universal quantification from formulae for which the result is 
known to be true. Lemma 3.4.5 takes care of negation; for the other two cases, we 
use the fact that the sequents ((@=> x) 4b (-@Vx)) and ((Vz)@ 4+ 7=(Szr)-6) are 
classically provable. Condition (ii)(b) is now immediate from (ii)(a) and 3.4.4. 
Conversely, suppose T satisfies the conditions in (ii). By the classical com- 
pleteness theorem 1.5.10 (or by 3.1.16, if you prefer), the relations of classical 
T-provable equivalence and constructive T-provable equivalence coincide for 
coherent formulae, and so 1.4.4(iv) implies that the subobject lattice of any 
object of Cé° is a finite Boolean algebra, and in particular atomic. As before, let 
C’ be the full subcategory of C{°™ obtained by deleting the inconsistent objects, 
and let P’ be the coverage induced on it by the coherent coverage on C$”; we 
shall show that P’ coincides with the coverage N consisting of stably nonempty 
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sieves, so that Set[T] =~ Sh(C’, P’) ~ sh_.|C’°?, Set]. The inclusion P’ C N is 
clear, since every object of C’ is consistent. So suppose R is an N-covering sieve 
on an object {£.¢} of C’; then, for each atom {#.7} of Sub({Z.¢}), there exists 
a morphism f: {7.x} — {£.¢} in R which factors through {F.Y} — {Z.}. 
But {7.x} is consistent and {Z.y} has no proper subobjects in C’, so the image 
of this morphism must be precisely {Z.y}. Thus if we choose one such morphism 
for each atom, we obtain a finite family of morphisms in R, the union of whose 
images is the whole of {7.6}. So R is P’-covering. O 


Condition (ii)(b) of 3.4.6 is well-known to model-theorists as ‘Ryll- 
Nardzewski’s criterion for No-categoricity’. Specifically, we have the following 
result: 


Proposition 3.4.7 Let T be a (consistent) complete first-order theory over 
a signature X. If there are only finitely many classical-T-provable-equivalence 
classes of first-order formulae in any context over X, then T is No-categorical; 
that is, it has only one isomorphism class of (finite or) countable models in Set. 
If the signature © is countable (i.e. has countably many sorts, function symbols 
and relation symbols), then the converse holds. 0 


The proof of 3.4.7 is beyond the scope of this book; see [446, 7.3.1]. We recall 
that ‘T is complete’ means that every sentence over È is T-provably equivalent to 
T or L (but not both); in our context, this corresponds to saying that the clas- 
sifying topos Set[T] is two-valued (equivalently, connected). Ryll-Nardzewski’s 
criterion is normally stated only for single-sorted theories; but in fact it works 
equally well for many-sorted ones, provided that the cardinality of a Set-model 
M of a many-sorted theory is understood to be the cardinality of the disjoint 
union [J ses-sor, MA. 


Corollary 3.4.8 Let T be a coherent theory such that Set[T] is Boolean. Then 
T has finitely many completions Ty,...,Tn, all of which are coherent and Xo- 
categorical; and in fact Set/T| is the coproduct of the Set[T;] in BYop/Set. 
Moreover, each Set|T,} is equivalent to Cont(G;), where G; is the automorphism 
group of the unique countable model M; of T;, equipped with the topology in which 
the pointwise stabilizers of finite subsets of M; form a base of neighbourhoods of 
the identity. 


Proof The completions of a first-order theory T correspond to ultrafilters in 
the (classical) Lindenbaum algebra of T, i.e. the Boolean algebra of classical-T- 
provable-equivalence classes of sentences over the signature of T. But 3.4.6 tells 
us that this algebra is finite, so there are only finitely many such ultrafilters; 
moreover, they are all principal, and generated by the atoms of the Lindenbaum 
algebra. If ¢; is one such atom, the corresponding completion T; is obtained 
from T by adding the axiom (T Hj) ¢i); its classifying topos is the slice category 
Set[{T]/[[]- di], where G is the generic T-model, and hence it is still Boolean. 
So all the completions of T satisfy the Ryli-Nardzewski criterion. Finally, since 
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the Set[T,] are connected as well as Boolean, they are of the form Cont(G;) by 
C5.2.14(c), where G; may be taken to be the localic automorphism group of any 
point of Set[T;]. But if we take the point corresponding to the unique countable 
model, we obtain the group G; described in the statement. 0 


Condition (a) of 3.4.6(ii) is perhaps less familiar to classical model-theorists 
than (b); however, we note that it is satisfied by the Morleyization (as defined 
in 1.5.13) of any first-order theory T. (In 1.5.13 we showed that every first-order 
formula over the original signature © of T is classically provably equivalent, in 
its Morleyization T’, to a coherent formula over the enlarged signature X’ of 
the latter; but it is easy to see that the same is true for any first-order formula 
over X’, since we can reduce it to a first-order formula over © by replacing each 
instance of one of the new relation symbols by the corresponding instance of 
the first-order formula over © which it ‘represents’.) It is also clear that the 
Morleyization of T satisfies condition (b) iff T itself does. 

We may also decompose condition (a) as the conjunction of two simpler 
conditions (a,) and (a2): 


(a1) the negation of every atomic formula over © is (classically) T-provably 
equivalent to a coherent formula; and 


(a2) every first-order formula over © is classically T-provably equivalent to an 
existential formula (i.e. one of the form (AZ) where ¢ is quantifier-free 
and £ is a (possibly empty) string of variables). 


For both these conditions are clearly implied by (a); conversely, if they both 
hold, we may replace an arbitrary first-order formula by a T-provably equiva- 
lent existential formula, then replace formulae of the form (=y) by (=¢ V w), 
pull negations inside conjunctions and disjunctions using the De Morgan laws 
(cf. 4.6.1), and finally replace any negated atomic formulae in the resulting 
formula by their coherent equivalents. Now we have 


Proposition 3.4.9 Let T be a coherent theory over a signature X. Then 


(i) T satisfies condition (a1) iff every homomorphism of T-models in Set is 
an embedding (cf. 1.2.10). 
(ii) T satisfies condition (a2) iff every embedding of T-models in Set is an 
elementary embedding. 
(iii) T satisfies condition (a) of 3.4.6(ii) if every homomorphism of T-models 
in Set is an elementary embedding. 


Proof Part (ii) of this proposition is familiar to classical model-theorists, who 
call a first-order theory model-complete if it satisfies these conditions. We shall 
prove only part (i) in detail, and refer the reader to [446, 8.3.1], for the proof 
of (ii) (which is in any case rather similar to that of (i)); part (iii) is of course 
immediate from (i) and (ii). 
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First suppose T satisfies (a1). Then, for any atomic formula-in-context Z.¢ 
and any homomorphism h: M — N of T-models in Set, we have a commutative 
diagram 

[Z] >—> M(A,..-,An) << [7.74] u 


[z.d] >—~> N(4;,..., An) <—<K [Z.-¢] 


(where Aj,...,An is the sequence of sorts of the variables in 7), by 1.2.9. Since 
the pairs of subobjects in each row are complementary, it follows that the left- 
hand square must be a pullback; but the latter condition, for all atomic ¢, is the 
definition of what it means for h to be an embedding. 

Conversely, suppose every homomorphism is an embedding. Let £.¢ be an 
atomic formula-in-context; we shall find it convenient to extend our signature © 
to X’ by adding a string of new constants Z of the same length and type as the 
context Z. Let T be the set of all coherent formulae y in the context Z such that 
((y A ¢) kz L) is provable in T; we need to show that (T fz (yV ¢)) is provable 
for some y €T. Let T’ be the theory over X’ obtained by adding 


Mea Fp Dy eT} 


to the axioms of T; suppose T’ has a model M in Set. Let ©” be the signature 
obtained from X’ by adding a new constant of sort A for each element of MA, 
for each sort A of X, let X be the set of all atomic sentences x over X” such that 
(T Hi x) is satisfied in M, and let T” be the theory obtained by adding all these 
sequents to the axioms of T (not T’). (It is then easy to see that a T”-model 
is essentially the same thing as a T-model N equipped with a homomorphism 
h: M — N.) For each x € X, let X be the formula over © obtained from it on 
replacing the constants in ¢ by the corresponding variables in 7, and any other 
new constants which appear by a string of new variables y. Then, for each finite 
subset {x1,---;Xn} of X, the coherent formula-in-context 


T. (Ay) (XTA \Xn) 


is not in I’, since it becomes true in M when we substitute ¢ for Z. So 


(x1 A+ A Xn A gfé/]) Fy +) 


is not provable in T; hence the theory obtained from T” by adding (T Hj) ¢[€/Z]) 
is consistent, since each finite subset of it is consistent, and by completeness it 
has a model hk: M — N. But h is an embedding, and ¢[¢/Z] is satisfied in N, so 
it must be satisfied in M. Since this is true for any model M of T’, we deduce 
by classical completeness that (T Hp) ¢[¢/#]) must be provable in it; hence it 
is deducible from some finite subset of the axioms of T’. If y1,...,Ym are the 
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members of T such that the sequents (7;[¢/Z] Hi} -L) occur among these axioms, 
then it is easy to see that the sequent 


(T kz (71 VV Im V @)) 


must be satisfied in every T-model, and hence provable in T. But (y1 V-V Ym) 
is in I; so it is the coherent formula we seek. o 


We remark that the conditions (a1) and (a2) are independent, even for the- 
ories which satisfy 3.4.6(ii)(b). Of course, any propositional theory satisfies (a2) 
(indeed, any embedding between models of a propositional theory is an isomor- 
phism), and it satisfies (b) provided its signature has only finitely many relation 
symbols; but the theory of prime filters of a non-Boolean distributive lattice fails 
to satisfy (a). We shall give an example of a theory which satisfies (a1) but not 
(a2) after 3.4.11 below. 

We next discuss some important examples of coherent theories whose 
classifying toposes are Boolean (and connected). 


Example 3.4.10 The theory Dæ of infinite decidable objects is obtained from 
the theory D of 3.2.7 by adding the axioms 


(T TE) N (y # ti )) 


for each natural number n (for n = 0, we interpret this as the sequent (T Hi] 
(dy)T)). As we remarked after 3.2.8, we may construct a classifying topos for 
this theory by taking the classifying topos [Set fm, Set] for D and imposing a 
coverage J on Set’), ‘generated’ by the extra axioms of Do. Specifically, the 
nth axiom says that the inclusion map n — (n+ 1), where n as usual denotes 
the n-element set {0,1,...,2—1}, generates a J-covering cosieve on n; it is then 
easy to see that every morphism of Set;,, must generate a J-covering Cosieve, 
and hence that J consists of all nonempty cosieves on objects of Set sm. So the 
classifying topos Set[D,.] ~ Sh(Set$m J) is equivalent to sh__[Set fm, Set]; 
note that (Set m> J) is an atomic site, as we saw in C3.5.9(c). 

It is obvious that Da is No-categorical, since its models in Set are just infinite 
sets (and homomorphisms of models are injections between them). If we take its 
‘standard’ countable model to be the set N of natural numbers, then the group 
which we obtain from it as in 3.4.8 is just the coherent group G discussed in 
3.4.1. In fact it is not hard to give a direct proof that Sh(Set{?,, J) is equivalent 
to Cont(G): given a natural number n, let H„ denote the pointwise stabilizer of 
n (ie. of {0,1,...,n—1}) in G, and let A, be the set of left cosets of H, with G 
acting by left translation. Clearly, A, is a continuous G-set; and since every open 
subgroup of G contains a conjugate of H,, for some n, we see that the atoms of 
Cont(G) are precisely the quotients of the An. Given any injection f: m => n, 
we may extend f to a permutation f’ of N, uniquely up to right multiplication by 


926 CLASSIFYING TOPOSES D3.4 


elements of Hm, and hence we obtain a well-defined G-equivariant map A, — Am 
sending gH,, to gf’ Hm; it is easy to see that every G-equivariant map A, — Am 
is of this form, and hence that the full subcategory of Cont(G) on the objects 
Án is equivalent to Set“? . Given that the A, form a separating set for Cont(G), 
the claimed equivalence now follows from the Comparison Lemma (C2.2.3). 


Example 3.4.11 Our second example is the theory Lə of dense linearly 
ordered objects without endpoints. This is defined over a single-sorted signa- 
ture © having one binary relation-symbol < apart from equality; we define the 
theory L of (decidably) linearly ordered objects to have axioms 


(((£ < y) A (y <2)) Fay 4) and 


(T Fs ((£ =y) V (£ < y) V (y < 2))) , 
and Læ is obtained from L by adding the further axioms (T Hp (4z)T), 


(E < y) Fay (BZ)((£ < 2) A(z < y))) and 


(T Fe (3y, z)((y < 2) A (z < z))). 


(We could have added the axioms which say that < is irreflexive and transitive, 
but they are both deducible from the axioms of L.) We note that L is a disjunc- 
tive theory in the sense of 1.3.6; as we did for D in 3.2.7, we may identify its 
classifying topos with a functor category, namely [Ord fm, Set], where Ord sm 
is the category of finite ordinals and order-preserving injections between them 
(equivalently, the category of finite models of L in Set). As in the previous 
example, we may generate a coverage J on Ord}, from the additional axioms 
of Læ: the first one says that the injection 0 >—> 1 generates a covering cosieve 
on 0, the second that the injection 2 — 3 which omits the middle element of 3 
generates a covering cosieve on 2, and the third that the injection 1 — 3 which 
corresponds to the middle element also generates a covering cosieve. It is then 
straightforward to verify that each individual morphism of Ord fm must gener- 
ate a covering cosieve, so that J consists of all nonempty cosieves, and hence the 
classifying topos Set[L.,] ~ Sh(Ord$m J) is equivalent to sh__[Ord fm, Set]. 

As before, the theory La is well known to be No-categorical: the fact that 
every countable dense linearly ordered set without endpoints is isomorphic to the 
ordered set (Q, <) of rational numbers is probably the best-known categoricity 
result in mathematics. So we may also identify Set[L..] with Cont(G) where 
G is the group of order-preserving permutations of Q, topologized so that the 
(pointwise) stabilizers of finite subsets of Q form a base of neighbourhoods of the 
identity; the fact that G is a coherent topological group (i.e. that these subgroups 
have finite bi-index) may be proved by methods like those of 3.4.1. And we may 
also establish the equivalence Sh(Ord*?,, J) = Cont(G) directly, by identifying 
each object n of Ordsm with the action of G on the left cosets of the stabilizer 
of an n-element set. Details of the proof can be found in [1085]. 
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Note that if we had omitted the nonemptiness axiom (T H (3x)T) from 
our definition of Læ, we should have obtained a theory which satisfies (b) and 
(a1) but not (a2): it has just two completions, both of which are No-categorical 
(one is Læ, and the other is the theory whose only model is empty), but the 
formula =(3x) TF is not provably equivalent in it to any existential formula (and 
the inclusion @ — Q is an embedding of Set-models of the theory which is not 
elementary). 


Example 3.4.12 For completeness, we also discuss the topos of sheaves on 
the atomic site (Setje, J) of finite sets and surjective functions, mentioned in 
C2.1.12(c) and C3.5.9(c), as a classifying topos. The Stone duality between 
Set; and the category Bool, of finite Boolean algebras restricts to an equiv- 
alence between Set and the category Bool;,, of finite Boolean algebras and 
injections. Now the functor category [Bool ;, Set] is a classifying topos for the 
algebraic theory of Boolean algebras, by 3.1.2; and arguments like those of 3.2.7 
show that [Set%?, Set] = [Bool ,,,Set} classifies decidable Boolean algebras, 
that is Boolean algebras equipped with a binary relation # and axioms say- 
ing that # is the complement of the diagonal. And imposing the coverage on 
Bool}, in which every inhabited sieve covers corresponds to adding the axiom 
which says that the Boolean algebra is atomless, i.e. 


(O#@) Fz (Ay)((y < £) A (0# y) A(y#2))) - 


Once again, it is well known that the theory of atomless Boolean algebras is 
complete and No-categorical; its unique countable model is the free Boolean 
algebra on a countable infinity of generators, or equivalently the algebra of clopen 
subsets of the Cantor space K. Thus we may conclude that Sh(Sety;., J) is 
equivalent to the topos Cont(G), where G is the group of automorphisms of 
this Boolean algebra, topologized as usual so that the pointwise stabilizers of 
finite subalgebras form a base of neighbourhoods of the identity. Equivalently, 
by Stone duality, we may regard G as the group of self-homeomorphisms of the 
Cantor space, equipped with a topology which turns out to be generated by sets 
of the form 
{g€G|geU andg EV} 


where U and V are open sets in the usual compact-open topology on the set of all 
continuous maps K — K. (The compact-open topology itself does not restrict 
to a group topology on G; the operation g ++ g~! is not continuous, which is 
why we have to modify the topology as above.) 


Finally in this section, we consider how much of the theory developed above 
survives when we generalize from coherent to geometric theories. The answer, 
unfortunately, is ‘not very much’. Lemma 3.4.5 remains true with ‘coherent’ 
replaced by ‘geometric’ throughout, and with a simpler proof — the reason is 
that every subobject in Set[T] ~ Gr of an object of CF°™ lies in C2°°™, and 


so the Booleanness of Set[T] implies that C#*°" has Boolean subobject lattices. 
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Hence, by the easy direction of 3.4.9(i), we can deduce that if T is a geometric 
theory whose classifying topos is Boolean, then every homomorphism of T-models 
in Set is an embedding. But the absence of the finiteness result 3.4.4 makes it 
much harder to prove any substantial results in the opposite direction. 

From another point of view, the difficulty is that, once we have removed the 
restriction to coherent toposes, the properties of Booleanness and atomicity no 
longer coincide. In fact we can give a syntactic condition (albeit a less useful 
one than 3.4.6) for an arbitrary geometric theory T to have an atomic classifying 
topos: we now describe this. Given T, we say that a geometric formula-in-context 
£.d is T-complete (or simply complete) if the sequent (@ tz +) is not provable 
in T, but for every geometric formula y in the same context either (¢ Fz y) or 
((@ Aw) tz L) is provable. This is clearly equivalent to saying that the theory 
obtained from T by adding a string of new constants € of the same length and 
type as Z, plus the extra axiom (T F;, ¢[{€/2]), is a complete theory; in particular, 
T itself is complete iff []. T is T-complete. 

By an atomic system for a geometric theory T, we mean a function assigning 
to each context £ over the signature of T a set Az of geometric formulae in that 
context, such that 


(i) (T Fe Voca: @) is provable in T, for each Z; 
(ii) for any two distinct members ¢ and y of Az, the sequent ((¢ A Y) Fz L) 
is provable in T; 
(iii) if 6 € Azy, then either @ is inconsistent or there is a member of A; 
T-provably equivalent to (Sy); 


(iv) if € Az and w is any atomic formula in the context 7, then either (¢ Fz Y) 
or ((ġ Aw) Fz L) is provable in T. 


Proposition 3.4.13 For a geometric theory T, the following are equivalent: 
(i) The classifying topos Set{T] is atomic (over Set). 


(ii) For every contezt £ over the signature of T, there is a set Bz of T-complete 
formulae in that context such that (T Fz Voeg, 9) is provable in T. 


(iii) T has an atomic system. 


Proof (i) © (ii): By 1.4.4(iv), condition (ii) is equivalent to saying that, for 
every context Z, the subobject lattice of {Z.T} in C#°°™ is atomic (and hence 
Boolean). This is clearly implied by (i); but it easily implies that every subobject 
lattice in C#°°™ is atomic, and hence we may obtain the same result for the 
oo-pretopos Gr, i.e. for Set[T]. (Alternatively, we could apply the Comparison 
Lemma (C2.2.3) to the inclusion in CF°"" of its full subcategory whose objects 
are T-complete formulae-in-context, and then observe that the latter, with its 
induced coverage, is an atomic site as defined in C3.5.8.) 

(ii) e (iii): Clearly, if we are given a family of sets Bẹ as in (ii), we may 
obtain an atomic system from them by eliminating duplication from each Bz: 
that is, we define A; to contain one member from each provable-equivalence 
class of formulae in Bz. Conversely, if we are given an atomic system of sets Ar. 


D3.4 BOOLEAN CLASSIFYING TOPOSES 929 


we obtain a family of sets Bz as in (ii) by eliminating any inconsistent formulae 
(that is, formulae ¢ for which (@ Fz L) is provable) from each Az: to prove that 
this works, we must show that each consistent member of Az is T-complete, i.e. 
that if @ € Az and Z.w is any geometric formula-in-context then either (¢ Fz w) 
or ((@A Ww) Fz L) is provable. We do this by induction on the structure of 
w: for atomic y, it is just clause (iv) of the definition of an atomic system. If 
Y = V,-,9i, then by induction we have either (¢ Fg 0;) for some i (whence 
(p Fz w)) or ((6AG;) Fz L) for all i (whence ((@ A Y) Fz L)). Similarly if w is a 
finite conjunction. Finally, suppose y = (dy). Let Ar be the set of formulae 
x in Az, for which (Ay)x is provably equivalent to ¢; we note that if x’ is any 
other member of Az y, then (dy)x’ is inconsistent with ¢ by conditions (ii) and 
(iii) of the definition, from which we may deduce ((@ A x’) Fz, L). Hence by 
condition (i) ¢ is provably equivalent (in the context Z, y) to the disjunction of 
all the members of Ag . It now follows easily that if some member x of A$, 
satisfies (x kz, 0) then we have (@ Fz w), and if none of them do so we have 
((PA 0) Fz L), whence ((6A W) kz L). o 


We note that, in the context of 3.4.13, T is (equivalent to) a coherent theory 
iff the sets Az of 3.4.13(iii) may be taken to be finite for all 7. Also, T is complete 
iff (it is consistent and) A} may be taken to be the singleton {T}. 


Example 3.4.14 As an application of 3.4.13, we construct an example of a 
connected atomic topos with no points, as promised in C5.2.14(a). The topos will 
be constructed as Set/T], where T is a complete theory satisfying the conditions 
of 3.4.13 but having no models in Set: the theory in question was first studied 
by J. Malitz [793] (see also [777]). We take a signature © having two sorts A and 
B, a constant ag: 1 —> A, three binary relation symbols # — A, A, < => A,A 
and €> A,B (as usual, we write these in infix notation), and a family of 
unary relation symbols R; — B, i € I, where J is a set whose cardinality is 
Strictly greater than that of the continuum. For convenience, we shall also use 
the notation x for the transitive closure of <, i.e. 


((z%y) ss VV (Azi,..-,2n)((x@ < 21) A (21 < 22) A A (Zn < v))) 
néeN 


(the case n = 0 of this disjunction being interpreted as (x < y)). 
The first group of axioms states that, in any model M, the interpretations 
of # and < make MA into a decidable complete binary tree with root Map: 


(T Fey ((£ = y) V (z #y))) 
(aga) Fe L) 
(T Fe ((a9 = z) V (ao X2))) 
(((y < £) A (z < x)) Fr,y,z (Y = 2)) 
(T Fe (Ay, z)((z < y) A (£ < z) A^ (y #z))) 
(((£ x y) A (£ < z) A (x£ < w)) Fryzw ((y = z) V (z = w) v (w = y))) 
((1Xy)Fr y (1 #y)). 
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The next group says that, for each element b of MB, the set {a € MA | 
(a,b) € [€]m} is a (full) branch of the tree MA, and that distinct elements 
of MB yield distinct branches: 


(T Fa (ao E€ u)) 
(((x € u) A (y < £)) Fu,z,y (y € u)) 
(E Eu) Fra GELA WEu) 
(a E€ u) Ay € u)) Fu,z,y ((£ = y) V (£ Xy) V (y<z))) 
(T Fuw ((u = v) v (Ax, y, z)((£ < y) A (z < 2) A (y # 2) A^ (y E€ u) A (z € v)))). 


Here the variables u and v are of sort B, and z,y,z are of sort A. Finally, we 
write down axioms saying that M B is partitioned into the sets M R;, and that 
each of these is dense in the set of all branches of M A (topologized as the Cantor 
set): 
(T Fu Vier Ri(u)) 
((Ri(u)ARj(u))Fut) (#3) 
(TF, (du)((2 € u) A Ri(u))) (for each i). 


It is clear that T cannot have any models in Set; for in any Set-model of the first 
two groups of axioms M B would have cardinality at most that of the continuum, 
and hence could not be partitioned into an J-indexed family of nonempty subsets. 
On the other hand, if J were a countably infinite set, then we could construct 
a model of T; and since the geometric propositional theory of bijections J — N 
is consistent (cf. the remarks after C1.2.9), we can construct a non-degenerate 
Boolean Set-topos in which T has a model; so in particular it is consistent. 

To verify that T has an atomic system in the sense of 3.4.13, we shall intro- 
duce a number of derived predicates. Let C,(x) (n € N) denote the predicate 
asserting that x sits at ‘level n’ in the binary tree, i.e. (for n > 2) the formula 
(Syi,---,Yn—1)((a0 < y1) A (y1 ~ Y2) A+++ A (Yn-1 x 2)). Similarly, let Dn (£, y) 
denote the assertion that x and y have a common predecessor at level n in the 
tree, but not at level n + 1; let E,,(z,u) denote the assertion that the level-n 
member of the branch u is a predecessor of x (or equal to x) but its level-(n + 1) 
member is not, and let F(u, v) be the assertion that the branches u and v have 
a common member at level n but not at level n + 1. It is easily verified that, 
from the axioms given above, we can deduce things like 


(T Faw (u =v) v V Fa(u,v))) 


nEN 


and that the disjunction on the right-hand side of this sequent is provably dis- 
joint. Now, given a context £1, £2,..-, Zr, U1, U2,..., Us over © (with the z; of 
sort A and the ux of sort B), we define a pattern in this context to be a finite 
conjunction of formulae which includes 


(i) for each j < r, just one of the formulae C,,(x;); 


(ii) for each pair (j1, j2), just one of the formulae D,,(z,,,25,); 
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(iii) for each k < s, just one of the formulae R;(ux); 


(iv) for each pair (kı, k2), either the formula (ux, = uz, ) or one of the formulae 
Fy, (uk,, Uka); and 


(v) for each pair (j,k), just one of the formulae E, (zj, ux). 


Clearly, in any context with more than one variable there will be many possible 
patterns which are inconsistent; but since we allowed inconsistent formulae to 
appear in the members of an atomic system, this does not matter. We define 
Azz to be the set of all possible patterns in the context Z, wu: the verification that 
this is indeed an atomic system is long but entirely straightforward. We note also 
that the only possible pattern in the empty context is the empty conjunction T; 
hence Set[T] is not only atomic but connected. 


Suggestions for further reading: Blass & Stéedrov [133], Bunge & Reyes [206], 
Makkai [777], Sčedrov [1085]. 


D3.5 Conceptual completeness 


The classical completeness theorem says that if something is true in all models 
of a theory, then it should be provable in the theory. But there is a stonger 
notion of completeness, which asserts that if something exists (in a ‘natural’ 
way) in all models of a theory, then ‘the theory should have a name for it’. The 
functional completeness result 3.1.3 is an example of this type of completeness. 
More generally, we should like to prove that if every model M comes equipped 
with some particular subobject Rm — M(Aj,...,An) (where Aj,...,An are 
sorts of the appropriate signature), and every homomorphism of models preserves 
this relation, then there should exist some formula-in-context z’.¢ (where z;: Ai 
for each i) whose interpretation in each model M is precisely Rm. (To take a 
simple example: the notion of semilattice may be defined — as a commutative 
idempotent monoid — in a way which makes no mention of a partial ordering; but 
from the fact that every semilattice possesses a canonical partial order, and every 
semilattice homomorphism is order-preserving, we should expect the ordering to 
be definable by a formula-in-context over the signature for monoids — and indeed 
it is, by the formula x, y.(mary = x), where m is the monoid operation.) 

If we interpret ‘every model M’ as meaning ‘every model in a category of 
the appropriate kind’, then this result, like the ordinary completeness result, 
becomes an easy consequence of the existence of the syntactic category; for we 
saw in 1.4.4(iv) that every subobject of an object of the form My(Aj,...,An) = 
{Z.T} is the interpretation in My of some formula-in-context Z.¢. But, if we 
wish to deduce definability results merely from the existence of relations on 
T-models in Set, then we have some further work to do. The following proof 
may remind the reader of the proof which we gave for 3.4.9(i): indeed, the latter 
may be seen as a special case of it. 
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Theorem 3.5.1 (Definability Theorem for coherent logic) Let T be a 
coherent theory over a signature %, let A,---Ay be a string of sorts of &, and 
suppose we are given, for every Set-model M of T, a distinguished subset Rm 
of M(A,,..., An) in such a way that each T-model homomorphism h: M —> N 
in Set maps Rm into Ry. Then there exists a coherent formula-in-contezt x. 
over ©, where xi: A; for each i, such that |Z.¢]u = Rm for each M. 


Proof First we extend © to a new signature ©, by adding a new relation 
symbol R — A; --- Án; then we may make each T-model M in Set into a £1- 
structure by interpreting the new symbol as Rm, and we define T; to be the set 
of all coherent sequents over ©; which are satisfied in all such structures. Clearly, 
Tı includes all the axioms of T. Next, we define F to be the set of all coherent 
formulae y over ©, in the particular context Z, for which the sequent (y +z R(Z)) 
is in Tı. We need to show that there is some y € I’ for which (R(z) Hz 7) is also 
in Ty. 

Suppose not; then for each y we can find a T,-model M and a string of 


elements m; € MA; such that (mj,...,m,) belongs to Rm but not to the 
interpretation of Z.y in M. Now let us enlarge ©; to Leg by adding a string 
of new constants c,...,Cn of sorts Ai,...,An, and let Tz be the theory over 


“2 consisting of the axioms of Tı plus all sequents (7[é/z] Fi} L) with y € F, 
and the single sequent (T Hi R(é)). We claim that T2 is consistent; for any 
derivation of (T Hp) L) from it would involve only finitely many of the axioms 
(y[€/z] F} 1), and since finite disjunctions of coherent formulae are coherent, 
the first sentence of this paragraph assures us that we can find a model for this 
subtheory of T2. 

Let Mo be a model of T2. Now let £; be the signature obtained from £2 by 
adding a new constant d,,: A for each m € MoA, as A ranges over all the sorts 
of ©; we make Mo into a Y3-structure in the obvious way. Let A be the set of 
all atomic sentences over X3 which are true in Mo, and consider the theory T3 
over 43 which consists of the axioms of T1, all sequents (T H} ô) with ô € A, 
and the sequent (R(c) Fi} L). 

It is easy to see that a model of T3 is essentially the same as a model N of Tı, 
together with a homomorphism h: My —> N (which sends an element m € MoA 
to the interpretation of dm in N), such that the n-tuple (h(mj),...,h(mn)) 
(where m; denotes the interpretation of c; in Mo) does not lie in Ry. But since 
(™m,...,™Mn) does lie in Rm, the hypothesis of the theorem says that no such N 
can exist. So the theory T3 is inconsistent; that is, (T -p 1) is derivable from 
it. As before, it must in fact be derivable from a finite subset of T3; let 51,..., Ôk 
be the members of A for which (T H 6;) is in this finite set, let 6 denote their 
conjunction, and let 6 be the formula obtained from ô on replacing each constant 
dm by a variable ym of the appropriate sort (but leaving the constants c;, if they 
occur). Since the dm do not occur in R(é) or in any of the axioms of Tı, it is 
easy to see that we can in fact derive a contradiction from 


Tı U{(R(Eb L), (T A (49)5)} . 


D3.5 CONCEPTUAL COMPLETENESS 933 


This means that the sequent ((Jy¥)¢ Fz R(Z)), where ¢ is obtained from 6 on 
replacing the constants c; by variables x;, is satisfied in all T,-models; thus 
Z. (Əy) belongs to the set I defined earlier. But, by construction, the n-tuple 
(m1,...,Mn) belongs to the interpretation of this formula in Mo, whereas it 
does not belong to the interpretation of any member of I’. This contradiction 
completes the proof of the theorem. o 


Remarks 3.5.2 (a) The hypothesis, in the statement of 3.5.1, that we are 
given a choice of Rm for every T-model M in Set can be relaxed: it is sufficient 
to have such a choice for every model in some given class M of models, and 
to know that these choices are compatible with all homomorphisms between 
models in M — provided M has suitable closure properties to ensure that the 
model N which we construct in the course of the proof of 3.5.1 belongs to it. (For 
example, it would be sufficient to suppose that M is a cosieve in T-Mod(Set).). 
We shall then arrive at a coherent formula-in-context 7. whose interpretation 
in every model in M is our given subset, although we have no control over its 
interpretation in other models of T — and, in general, it will not be unique up to 
T-provable equivalence (whereas, by classical completeness, it will be unique if 
we specify its interpretation in every T-model). 

(b) The definability theorem is not true for regular or cartesian logic, for an 
obvious reason: if we take Rm (for each model M) to be the union of two subsets 
defined by regular formulae-in-context, then it will be preserved by arbitrary 
homomorphisms, but will not in general be definable by a single regular formula 
(cf. 3.3.11). Nevertheless, we shall see later in this section that the conceptual 
completeness theorem, which for coherent logic may be viewed as a generalization 
of the result just proved, does hold for these two fragments of first-order logic. 
We should also mention that there is a ‘classical definability theorem’ for full 
(finitary) first-order logic, generally associated with the name of E. W. Beth, in 
which the hypothesis on the chosen subsets Rp is simply that they are preserved 
by arbitrary isomorphisms of T-models in Set: see (446, 6.6.4 and 10.5.1]. 


One possible way of regarding Theorem 3.5.1 is as follows. Using the notation 
of the proof, we have an interpretation of T in T4, that is a functor J (preserving 
the appropriate structure) from the syntactic category of T to that of Tı. Compo- 
sition with I yields a functor I* : T;-Mod(Set) — T-Mod(Set): of course, this is 
just the functor which ‘forgets’ the interpretation of the extra relation symbol R. 
This functor is part of an equivalence of categories; observe that each T-model 
has a unique extension to a T,-model, since the ‘naturality’ hypothesis ensures 
that there is a unique choice for Rm in every M (the identity morphism on M is 
a T-model homomorphism!). And the conclusion of the theorem says (roughly) 
that J itself is an equivalence of categories (or at least a weak equivalence, as we 
defined it in Section A1.1), since the ‘new’ object {Z.R(Z)} of Cr, is isomorphic 
to one in the image of I. (Incidentally, since everything we shall do in this sec- 
tion depends on the classical completeness theorem, and therefore requires the 
axiom of choice — or at least the Boolean Prime Ideal Theorem, cf. 1.5.11 — we 
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shall not worry too much about the difference between ‘equivalence’ and ‘weak 
equivalence’. ) 

It would thus be of interest to have a theorem of the following form: ‘Let 
C and D be small categories with structure (cartesian, regular or coherent, 
according to taste), and I: C — D a structure-preserving functor. If the functor 
I*: R(D, Set) — K (C, Set) induced by composition with J is a weak equivalence 
(where R denotes the meta-2-category of cartesian/regular/coherent categories), 
then J is a weak equivalence’. Such a result is known as a conceptual completeness 
theorem for cartesian/regular/coherent logic. 

For cartesian logic, the theorem is indeed true in the form just stated, and 
even in a stronger form: 


Lemma 3.5.3 
(i) Let C and D be small cartesian categories such that Cart (C, Set) ~ 
Cart (D, Set). Then C and D are equivalent. 


(ii) Let S and T be cartesian theories such that S-Mod(Set) ~ T-Mod(Set). 
Then S and T are Morita-equivalent. 


Proof (i) We recall from C4.2.2(v) and B3.2.5 that, for a small cartesian cate- 
gory C, the category Cart (C, Set) may be identified with the free filtered-colimit 
completion Ind-C°P of C°P; moreover, C°P is fully embedded in this category 
by the Yoneda embedding, and its objects may be recovered as the finitely- 
presentable objects of Ind-C°P by C4.2.2(ii) (note that, since C is cartesian, it 
is Cauchy-complete). So any equivalence Ind-C°? ~ Ind-D°? must restrict to an 
equivalence C ~ D. 

(ii) The second statement of the lemma is a simple translation of the first 
into the language of first-order theories, by 1.4.7 and 1.4.8. o 


Remark 3.5.4 For coherent logic, the conceptual completeness theorem can- 
not be true in this simple-minded form: there can be equivalences S-Mod (Set) ~ 
T-Mod(Set) which are not induced by any functor between the syntactic cate- 
gories of S and T. For a counterexample, consider the propositional theory Tg 
of prime filters of a Boolean algebra B, defined as in 1.1.7(m). Since the prime 
filters of a Boolean algebra are all maximal (cf. 4.6.16 below), the category of 
Set-models of Tg is (equivalent to) the discrete category whose objects are the 
prime filters. So an equivalence between two such categories, for Boolean alge- 
bras Bı and B2, would simply say that their spectra have the same cardinality; 
whereas, by 3.3.14, Morita equivalence of the theories would say that the spectra 
of Bı and B2 are homeomorphic (and hence, by Stone duality, that Bı and B2 
are isomorphic). It is easy to give examples of pairs of Boolean algebras satisfying 
the first condition but not the second. 


Indeed, for regular and coherent logic, the conceptual completeness theorem 
does not even hold in the form given before 3.5.3. For if C is any non-effective 
small regular category, then the embedding of C in its effectivization is not a weak 
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equivalence, but it induces an equivalence Reg (C, Set) ~ Reg (Eff(C), Set) by 
A3.3.10, since Set is effective regular. Similarly for the embedding of any small 
coherent category C in its pretopos completion Eff(Pos(C)). Thus the best we 
can hope for is a conceptual completeness theorem in which we restrict the 
categories C and D to be effective regular in the first case, and pretoposes in 
the second. (This is of course an important part of the reason why we defined 
Morita equivalence for regular and coherent theories in the way in which we did.) 
However, we note that this does not affect the definability result from which 
we started in this section: for the processes of effectivizing a regular category, 
and of positivizing a coherent category, do not add any new subobjects of the 
objects of the original category. (In the terminology to be introduced below, the 
embedding of a regular category in its effectivization, or of a coherent category 
in its positivization, is full on subobjects.) 

In what follows, we shall (as in Section D3.3) deal almost exclusively with 
the coherent case; but the theorem is also true in the regular case, and may be 
proved by very similar methods. We shall discuss the changes which need to be 
made for the regular case in 3.5.12 below. In fact we may break the theorem 
down into a series of lemmas, each of which derives properties of the functor J 
from ‘dual’ properties of I*. For the first of these, the assumption that we are 
dealing with pretoposes rather than arbitrary coherent categories is not needed: 


Lemma 3.5.5 Let C and D be small coherent categories, and I:C ~D a 
coherent functor. If the functor I*: Coh (D,Set) — Coh(C,Set) induced by 
composition with I is essentially surjective on objects, then I is conservative. 


Proof Let Tc and Tp denote the (coherent) theories of coherent func- 
tors defined on C and D, as constructed after 1.4.11. Let m: A’ — A be a 
monomorphism in C such that J(m) is an isomorphism. Then the sequent 


(T Fe (Ay)(m(y) = z)) 


(where x and y are variables of sorts A and A’ respectively) is satisfied in any 
Tc-model which is the image under ]* of a Tp-model. But the hypothesis of the 
lemma says that every Tc-model in Set is isomorphic to such a model; so by 
the completeness theorem 1.5.10(ii) it is provable in Tc, and hence satisfied in 
the model which corresponds to the identity functor on C. In other words, m 
is a cover, and hence an isomorphism, in C. So it follows from A1.2.4 that J is 
conservative. o 


The next lemma is really the heart of the proof of conceptual completeness 
(as may be seen from the fact that the definability theorem is used in proving it). 
To state it, we need a new definition: we say that a functor I: C — D (which is 
assumed to preserve monomorphisms) is full on subobjects if, given any object A 
of C and a subobject B — IA in D, there exists A’ — A in C such that JA’ = B 
in Subp(J A). 


936 CLASSIFYING TOPOSES D3.5 


Lemma 3.5.6 Let C and D be small coherent categories, and I1:C ~ Da 
coherent functor. If the induced functor I*: Coh (D, Set) — Coh (C, Set) is full, 
then I is full on subobjects. If in addition I* is essentially surjective on objects, 
then I is full. 


Proof Suppose given a monomorphism B — JA in D. As before, we let Te 
and Tp denote the theories of coherent functors defined on C and D. If I* is 
not essentially surjective on objects, then not every Tc-model is of the form 
I*(N), where N is a Tp-model; but, as we remarked in 3.5.2(a), this does not 
matter. For any model M which is of this form, we may choose a Tp-model 
N with I*(N) = M and then define Ry C MA to be (the image of) the 
composite NB — NIA = MA; then the hypothesis of the lemma ensures that 
this choice of subobjects is preserved by arbitrary Tce-model homomorphisms. 
So by the definability theorem 3.5.1 there is a coherent formula-in-context z. 
(where x: A) such that, if A’ —> A denotes the interpretation of this formula 
in the Tce-model corresponding to the identity functor on C, then JA’ — JA 
and B — JA are mapped to isomorphic subobjects by every coherent functor 
D — Set. An easy application of classical completeness then shows that we must 
have IA’ © B in Sub(JA). 

For the second assertion, we have simply to note that if a cartesian func- 
tor I is both conservative and full on subobjects then it must be full. For if 
g: IA — IB is any morphism in D, its graph is a subobject of I(A x B), and 
so must be isomorphic to JA’ for some (d,e): A’ — A x B in C. But then the 
conservativity of J implies that d is an isomorphism, so this subobject is the 
graph of a morphism f = ed~! in C, which clearly satisfies If = g. O 


For the next stage of the proof, we again need a new definition. Let us say 
that a coherent functor I: C —> D is finitely covering if, for each object B of D, 
we can find a finite family of objects (Ai,..., An) of C, subobjects C; — IA; in 
D, and morphisms C; — B in D the union of whose images is the whole of B. 


Lemma 3.5.7 Let C and D be small coherent categories, and I:C > D a 
coherent functor such that I*: Coh (D, Set) — Coh (C, Set) is faithful. Then I 
is finitely covering. 


Proof Let B be an object of D, and consider the set X of pairs (A, ø) where 
A is an object of C and ¢ is a coherent formula over the signature of Tp, having 
free variables x and y of sorts JA and B respectively, for which it is provable in 
Tp that ‘¢ denotes a partial map from JA to B’; that is, the sequent 


(OA gly’/y]) Fey (y= y')) 


is provable in Tp. Add a new constant c of sort B to the signature of Tp, and 
consider the theory T, over this enlarged signature whose axioms are those of Tp 
plus all axioms of the form (g/c/y] +, +) where (A,¢) € X. If I is not finitely 
covering at the object B, then it is easy to see that each finite subset of T, has 
a model, and hence T; is consistent. 
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Consider a particular model of Tı; we regard it as a pair (M, mo) where M 
is a coherent functor D — Set and mp is the particular element of MB which 
interprets the constant c. Our aim now is to construct another coherent functor 
N:D — Set and two natural transformations a, 3: M 3 N which agree at all 
objects of the form JA but satisfy ag(mo) 4 Ba(mo); this will contradict the 
hypothesis that J* is faithful, and so complete the proof. Much as in the proof of 
3.5.1, we construct the triple (N,a, 3) as a model of a suitable coherent theory 
which we can show to be consistent. 

First we enlarge our signature again, by adding two new constants d,, and 
€m, both of sort C, for each element m € MC (and each object C of D). Let A 
be the set of all sequents (T H} 6), where ô is an atomic sentence involving the 
d,, but not the e,,, which are satisfied in M when each d,, is interpreted as m, 
and let E be the set of similar sequents involving the em but not the dm. Let T 
be the set of all sequents (T Fi} (dm = €m)) where the sort of dm and em is in 
the image of J. Then it is clear that a model of the theory 


To =TpUTUAUE 


is essentially the same thing as a coherent functor N: D — Set equipped with 
two natural transformations M = N (sending m to the interpretations of dm 
and em respectively) which agree on objects of the form JA. It thus suffices to 
show that we may consistently add the sequent ((dm, = emo) Fg 1) to this 
theory. 

Suppose some finite subset T3 of T2 U {((dmo = emo) Fy) L)} is inconsistent. 
By adding more axioms to T3 if necessary, we may assume that it contains 
exactly the corresponding members of A and of E; also that it contains the 
sequent (T Fi) (dm = €m)) whenever dm is mentioned in one of the members of 
A which appear in T3 and has sort in the image of I. Let ô be the conjunction of 
the atomic sentences ð; such that (T H} 6;) appears in T3; then we may replace 
all these sequents by a single sequent (T H} 6), and similarly we may replace 
the sequents in E which appear by a single sequent (T F 1) €). Moreover, we may 
write 6 = old, d2, ding /#, Y, z| and € = @[é), €2, €m,/Z, Y, z], where ¢ is a formula 
in which we have substituted free variables for the constants appearing in ô, 
and we have separated the latter into three groups: those whose sorts are in the 
image of J, those other than dm, whose sorts are not in the image of J, and dmo 
itself. But now the failure of T3 to have a model tells us that every Tp-model 
must satisfy the sequent 


(ODA A (39) Gl2'/2]) Ezz (2 = 2’) 5 


in other words, it is provable in Tp that the formula (39) denotes a partial map 
IA — B, where A is the product (in C) of the sorts of the variables in 7. So the 
pair (A, (S¥)¢) belongs to the set X that we constructed at the beginning of the 
proof — which contradicts the fact that we chose (M, mo) to be a model of T}. 
g 
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To complete the proof of the conceptual completeness theorem, we need 


Lemma 3.5.8 Let C be a pretopos, D a coherent category, and I:C ~Da 
coherent functor which is full on subobjects, faithful and finitely covering. Then 
I is essentially surjective on objects. 


Proof Given any object B of D, we have a finite covering family of partial 
maps (IA; — B|1<j <n). Since J is full on subobjects, the domain of each 
member of this family may be taken to be of the form J Aj for some Ai, — A; in C; 
then we may form the disjoint coproduct A = U7?_, Aj in C, and since I preserves 
disjoint coproducts we have a cover TA — B in D. Let S > IAx IA = 1(Ax A) 
be the kernel-pair of this cover; then we may again write S = IR for some 
R — A x A in C, and since 7 is faithful and hence conservative by A1.4.9, it 
is easy to see that R ‘inherits’ the property of being an equivalence relation 
from S. Let A —> A be a cover in C whose kernel-pair is R = A; then [A —> TA 
and JA — B are both coequalizers of S = IA in D, and hence JA & B. go 


Putting everything together, we now have 


Theorem 3.5.9 (Conceptual Completeness for coherent logic) 
(i) Let C and D be small pretoposes, and I: C —> D a coherent functor such 
that I*: Coh (D, Set) + Coh (C, Set) is a weak equivalence. Then I is a 
weak equivalence. 


(ii) Let S and T be coherent theories, and suppose we are given an interpretation 


of S in T for which the induced functor T-Mod(Set) — S-Mod(Set) is a 
weak equivalence. Then S and T are Morita-equivalent. 


Proof The first statement is immediate from combining the last four lemmas. 
The second is simply its translation into the language of coherent theories. C 


We may also state the theorem in a purely topos-theoretic form, as follows: 


Corollary 3.5.10 Let E and F be coherent toposes, and let f: F + € be 
a geometric morphism such that f* maps coherent objects of E to coherent 
objects of F, and such that composition with f induces a weak equivalence 
Top/Set (Set, F) — Top/Set (Set, €). Then f is an equivalence. 


Proof Since f* preserves coherent objects, it restricts to a coherent functor 
between the pretoposes of coherent objects of € and F; the second hypothesis 
on f implies that this functor satisfies the hypothesis of 3.5.9(i). (a 


As an application of conceptual completeness (in fact of 3.5.7), we show that 
the situation encountered in Example 3.1.15 cannot occur amongst coherent 
toposes. 


Corollary 3.5.11 
(i) Let E be a coherent topos such that the category Top/Set (Set,€) is a 
preorder. Then E is localic over Set. 
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(ii) Let T be a coherent theory such that the category T-Mod(Set) is a preorder. 
Then T is Morita-equivalent to a propositional theory. 


Proof The two statements are clearly equivalent; we shall prove the first. 
Suppose E ~ Sh(C, T) where C is a small pretopos and T is its coherent coverage; 
then, as we saw in 3.3.15, we may identify its localic reflection £ (over Set) 
with Sh(Subc(1),7’) where T’ is the coherent coverage on Subc(1). Moreover, 
the hyperconnected geometric morphism h: E€ — L may be identified with that 
induced by the inclusion functor I: Sube(1) — C, as we saw in C2.3.22. Now I 
is clearly conservative and full on subobjects; and the functor Coh (C, Set) — 
Cobh (Subc(1), Set) which it induces is trivially faithful, since its domain is a 
preorder. So by 3.5.7 I is finitely covering. It is now easy to see that J induces a 
(weak) equivalence from the pretopos completion of Subc(1) to C; hence A is an 
equivalence, and € is localic. E 


Remark 3.5.12 As we mentioned earlier, the definability theorem 3.5.1 
does not hold (in the form in which we stated it) for regular logic. Nev- 
ertheless, the conceptual completeness theorem does hold: if C and D are 
effective regular categories, and I:C +D is a regular functor such that 
I*: Reg (D, Set) — Reg (C, Set) is a weak equivalence, then I is a weak equiva- 
lence. The key ingredient which we need, in order to adapt the proof just given 
for coherent logic to the regular case, is Lemma 3.3.11. For example, in the 
proof of 3.5.6, the definability theorem yields, for a subobject m: B — IA in D, 
a coherent formula-in-context z.¢ (where x is of sort A) over the signature of 
Tc, such that the regular formula z.(Sy)(my = 2) is provably equivalent in Tp 
to (the translation via J* of) x.ġ. But we may write ¢ as a finite disjunction 
Viz: $i of regular formulae, by 1.3.8(ii); so in the case when Tp is a regular 
theory we may conclude from 3.3.11 that x.(ay)(my = z) is actually equivalent 
to one of the x.¢;. Similar modifications need to be made to the proof of 3.5.7 
(in which we replace the finite covering family of partial maps by a single partial 
map); we omit the details. 


Remark 3.5.13 We should also mention that a substantially different proof of 
the conceptual completeness theorem has been given by A. M. Pitts [972]. Pitts’s 
proof is more categorical in spirit, and derives the result from an ‘interpolation 
property’ for cocomma squares in the 2-category of pretoposes. In [974], Pitts 
extended this result to the 2-category of Heyting pretoposes (that is, pretoposes 
which are Heyting categories) and obtained a conceptual completeness theorem 
for full first-order logic. Unfortunately, the amount of machinery which has to 
be developed on the way to Pitts’s proof is such as to place it beyond the scope 
of this book. 


Suggestions for further reading: Ballard & Boshuck [43], Butz & 
Moerdijk [216], Makkai [781], Makkai & Reyes [790], Pitts [972-974], 
Zawadowski [1260]. 


D4 
HIGHER-ORDER LOGIC 


D4.1 Interpreting higher-order logic in a topos 


We now turn from first-order logic to higher-order logic: the principal extra 
feature of this logic, compared with that which we studied in Chapter D1, is 
that we have variables running over types whose intended interpretation is the 
set of all subsets of (the set denoted by) some other type, or the set of all 
functions between two such types — and we can therefore quantify over elements 
of such ‘higher types’. The extra feature of our formal languages which makes 
this possible is the presence of type constructors, which build new types from old. 
In fact, as we remarked after Definition 1.1.1, the first of these type constructors 
(the product type constructor) was implicitly present already in our account of 
first-order logic, in that our function symbols and relation symbols had types 
assigned to them which were finite strings of sorts, and which were formally 
interpreted as products of those sorts. Now that we need other type constructors, 
it makes sense to add the product type constructor explicitly to our definition 
of a signature. We shall find it convenient to maintain the distinction between 
sorts which are the basic types given to us, along with the primitive function 
symbols and relation symbols, as part of the specification of our signature, and 
the types of the signature which may be either sorts or ‘compound types’ built 
up using type constructors. 


Definition 4.1.1 A signature © for higher-order logic is defined by specifying 
a set L-Sort of sorts, a set Ł-Fun of function symbols and a set -Rel of relation 
symbols as in 1.1.1, together with a number of type constructors chosen from the 
following list, which recursively define the set ©-Typ of types: 

(i) Basic types: Each sort is a type. 


(ii) Product types: There is a distinguished type 1 (the empty product); and if 
A and B are types, then A x B is a type. 


(iii) Function types: If A and B are types, then [A— B] is a type. 
(iv) Power types: If A is a type, then PA is a type. 
(v) List types: If A is a type, then LA is a type. 


To each function symbol f € &-Fun we assign an ordered pair (A, B) of elements 
of &-Typ; as before, we write f: A — B to indicate that f has the pair of types 
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(A, B). Similarly, each relation symbol R € S-Rel has a type in Ł-Typ, and we 
write R — A to indicate that R has type A. 


An alternative approach (which is used in [682], for example) is to assume 
given the entire set ©-Typ as part of the primitive data, together with binary 
operations (A,B + A x B), (A,B +> [A— B]) and so on, on this set. Clearly, 
Definition 4.1.1 is more restrictive, in that we require ©-Typ to be freely gen- 
erated by the type constructors from the subset /-Sort of basic types; however, 
there are occasions when it is useful to be able to argue by induction over the 
‘complexity’ of types, and for this purpose the recursive definition of types which 
we have given is essential. 

If our signature has the type constructors (ii) and (iv), we shall often write Q 
for the particular type P1. Similarly, if it has constructors (ii) and (v), we write 
N (the type of natural numbers) for L1. Sometimes we may wish to consider 
signatures which have the type N but not the general list type constructor. 
From now on, all our signatures will contain the product type constructor; thus 
we have restricted all our primitive function and relation symbols to be unary. 
(If our signature also contains function types, then we can further restrict our 
function symbols to be constants (function symbols of type (1, B) for some B), 
since a function symbol f : A — B can be replaced by f: 1 — [A— B]. And if © 
has power types, then we can replace all primitive relation symbols by constants 
as well: a relation R — A becomes a constant "R^: 1 — PA.) 

Associated with each type constructor, we have term constructors enabling us 
to build terms of the various compound types. The term constructor for power 
types in fact builds terms from formulae, so over a signature which contains 
power types we have to define terms and formulae by a simultaneous recursion; 
nevertheless, we shall present the recursive clauses of the definition for terms 
and for formulae separately in the two definitions which follow. Another feature 
which was not present in first-order logic is that some of the term constructors 
have the effect of binding variables, and so we have to keep track of the set F V(t) 
of free variables in a term t, as well as the free variables in a formula. 


Definition 4.1.2 Let © be a higher-order signature. The collection of terms 
over È is defined recursively, together with the type of each term (we write t: A 
to indicate that the term ¢ has type A) and the (finite) set F V(t) of free variables 
of each term t, by the following clauses: 


(i) We have a stock of variables x: A for each type A; FV(z) = {zx}. 

(ii) If f: A — B is a function symbol of © and t: A, then f(t): B; FV(f(t)) = 
F V(t). (Since we have now restricted ourselves to unary function symbols, 
we often write simply ft rather than f(t).) 


(iii) (a) We have a distinguished term >: 1; FV(*) = 0. 
(b) If s: A and t: B, then (s,t): A x B; FV((s,t)) = FV(s) UF V(E). 
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(iv) Ift: A x B, then fst(t): A and snd(t): B; both of these terms have the 
same free variables as t. 


(v) If ¢ : B and z is a variable of type A, then Ar: A.t : [A> B]; 
FV(Az: A.t) = FV(t) \ {x}. (If we do not need to specify the type of 
the variable z, then we simply write Axv.t for Ax: A.t.) 

(vi) If s: [A— B] and t: A, then app(s,t): B; FV(app(s,t)) = FV(s) U FV (t). 
(We often abbreviate app(s,t) to s(t), or even to st, but for formal 
purposes it is useful to have a notation which distinguishes between appli- 
cation of a function symbol to a term, as in (ii), and application of a term 
whose type is a function type.) 


(vii) If ¢ is a formula and z is a variable of type A, then {x: A | ¢}: PA; 
FV({x: A | ¢}) = FV(¢) \ {x}. (Again, we may abbreviate this to {x | ¢} 
if it is not necessary to specify the type of z.) 


(viii) (a) For each type A, there is a distinguished term [] of type LA; 
FV((]) = 0. 

(b) If s: A and t: LA, then cons(s,t): LA; FV(cons(s,t)) = FV(s) U 
FV(t). (Intuitively, cons(s, t) is the list obtained by appending s to 
the list t. In the particular case A = 1, we normally write succ(t) for 
cons(x, t) and zero for [].) 


(ix) The remaining term constructor associated with list types, the iterator, 
is more complicated. Suppose we have variables z, y of types A, B respec- 
tively (here A and B may be the same type, but z and y must be distinct), 
and suppose given terms s,t,u of types A, A, LB respectively, such that 
x and y are not free in either t or u. Then iter, „(s,t,u) is a term of type 
A, in which all occurrences of x and y in s are bound, but the other free 
variables of s, t and u remain free. (The intuition behind this definition 
is that A denotes a set of ‘values’ and B a set of ‘instructions’: the term s 
(which is implicitly assumed to involve the variables x and y, even though 
it need not do so in practice) is thought of as a ‘machine’ which, given 
an input value z and an instruction y, produces an output value from 
them. Then iterz,,(s,t,u) denotes the value at which we arrive, starting 
from the input value t, if we make the machine follow the ‘program’ (list 
of instructions) u.) In the particular case B = 1 (so that LB = N), we 
can simplify this constructor by omitting the variable y: thus we simply 
write iter,(s,t,u) in this case. 


Of course, clauses (v) and (vi) of the definition apply only if we have function 
types in E, clause (vii) only if we have power types, and clauses (viii) and (ix) 
only if we have list types. Note that, except for power types, the clauses come in 
pairs consisting of one or more ‘constructors’ which build terms of a compound 
type from terms of simpler types (clauses (iii), (v), (viii)) and one or more 
‘destructors’ which build terms of simpler types from terms of the compound 
type (clauses (iv), (vi) and (ix)). The reason why we have no ‘destructor’ for 
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power types is that, just as the constructor for power types builds terms out of 
formulae rather than terms, the corresponding destructor builds formulae out of 
terms of a power type ~ it thus appears as clause (iii) of the next definition. 


Definition 4.1.3 The atomic formulae over © are of three kinds: 


(i) If R — A is a relation symbol and t: A, then R(t) is an atomic formula; 
FV(R(t)) = FV(t). 
(ii) If s and t are terms of the same type A, then (s =, t) is an atomic formula; 
FV(s =4 t) = FV(s) UF V(t). 
(iii) If s: A and t: PA for some A, then (s €4 t) is an atomic formula; 
FV(s €4 t) = FV(s) UFV(t). 


The compound formulae over © are built up from the atomic formulae exactly 
as in clauses (iii)—(x) of Definition 1.1.3. 


We normally omit the type subscripts from =4 and €,4 whenever it is safe 
to do so. 

In order to define the notion of /-structure in a category C, we need to know 
that C has the categorical structure required to interpret the type constructors 
present in ©. Thus, if © contains product types, we require C to have finite 
products; if © contains function types, we require C to be cartesian closed; if © 
contains power types, we require C to have power objects (i.e. to be a topos), 
and if © contains list types, then we require C to have list objects as defined in 
A2.5.15. (In fact if C is not cartesian closed, then we need to strengthen the defi- 
nition of list object along the lines suggested in A2.5.3: we say that (LA,0,4,s,) 
is a list object over A if, given morphisms b: Č — B and t: A x B xC >B, 
there is a unique f: LA x C — B making 


0a X1 sax l1 
1 x C — > LA xC mm AxLAxC 


72 f (1 x f,73) 


C S Ba AxBxC 
commute. If C is cartesian closed, then this version of the definition may easily be 
deduced from the special case C = 1 which we gave as the definition in A2.5.15, 
just as we deduced A2.5.2 from A2.5.1 for natural number objects.) We shall 
summarize the above restrictions on C by saying ‘C is a suitable category for X’. 


Definition 4.1.4 Let © be a higher-order signature, and C a suitable category 
for ©. A /-structure M in C is defined by specifying a function =-Sort — ob C, 
which we denote A +> MA and which we extend to a function ©-Typ — ob C ‘in 
the obvious way’; i.e. we define M1 to be the terminal object 1 of C, M(A x B) 
to be MA x MB, M[A-— B] to be the exponential MB™”4, M(PA) to be the 
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power object P(M A) and M(LA) to be the list object L(M A). In addition, M 
is required to specify a morphism Mf: MA — MB for each function symbol 
f: A— B of £, and a subobject MR — MA for each relation symbol R — A 
of È. i 


Note that, in contrast to Definition 1.2.1, we have not defined the notion of 
a homomorphism of X-structures. The reason is that the power object functor 
is contravariant, and the exponential functor (A, B) ++ B4@ is contravariant in 
its first variable; so if we are given a family of morphisms MA — NA for each 
sort A, we will not in general obtain morphisms MA — NA for each type A. 
Nevertheless, we can define a notion of isomorphism of S-structures in an obvious 
way; and these are the only morphisms of higher-order structures which we shall 
need to consider in practice. 

The notions of context, of term-in-context, of formula-in-contezt and of 
sequent are all defined just as they were in the first-order case. 


Definition 4.1.5 If M is a /-structure in a suitable category C (and in addi- 
tion C has the structure required to interpret the propositional connectives and 
quantifiers involved), we assign to each term-in-context z.t of type B an inter- 
pretation [Z.t]m: MA) x--- x MA, —> MB (where Aj,..., An is the string of 
types of the variables in the context 7), and to each formula-in-context 7.¢ an 
interpretation [Z.ġ]m — MA, x --- x M An, by the following clauses (in which 


we omit the subscript M’s, and write A for the product type A; x =: x An): 
(i) If t is a variable z;, then [T.t] is the projection 7;. 
(ii) If t is f(s) (where s: C, say), then [z.t] is the composite 


[z.s] Mf 
MA ——> MC —— MB. 


(iii) If t is x, then |Z.t] is the unique morphism MA — 1. 
(iv) If B = B, x Bo and t is (r,s), then [Z.t] is 


Z.ri,|z.s 
MA Meh est) MB, x M B2. 


(v) If t is fst(s) where u: B x C, then |Z. ¢] is 


MA ——> MB x MC — MB 
(and similarly if t is snd(s)). 
(vi) If B = [C— D] and t is Az.s (where we assume for convenience - by 
renaming the bound variable if necessary — that z does not appear in the 
context T), then [Z.t] is the exponential transpose of 


Iz, z.s] 
MA x MC ———> MD. 
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(vii) If t is app(r,s) where r: [C — B], then [Z.t] is 


T.r, rage 
ma OD pene l= Hi 


(viii) If B = PC and t is {z: C | ġ} (where, once again, we assume that z does 
not appear in 7), then [z.t] is the name of the relation 


[Z,z.¢] >> MAx MC. 
(ix) If B = LC and t is [], then [z. t] is 


OMC 
MA ———> 1 —— L(MC). 


(x) Similarly, if B = LC and t is cons(r, s), then [z.t] is 


Mma EAEN yox LIMO) —!S LMC). 


(xi) If ¢ is iter, .(r,s,u) where y: B and z: C, then [Z.t] is the composite 


([Z-u], 1) f 
MASS (MC) XMAS, 


where f is the unique morphism making 


omc X 1 smc X1 
1x MA—“2" L(MC)x MA <“ MC x L(MC) x MA 


T2 f (1 x f,73) 


[z.s] [z, y, Z-r] 
MA ——— MB <c MC x MBx MA 


commute. 


(xii) If ¢ is the atomic formula (s €p t), then [7%.¢] — MA is the subobject 
obtained by pulling back the universal relation €y;7g—> MB x P(MB) 
along 


wa ESE De x P(MB), 


(xii-xxiii) The interpretations of the other atomic formulae, and of all 
compound formulae, are defined exactly as in clauses (i)-(x) of 
Definition 1.2.6. 


Of course, we say a /-structure M satisfies a sequent (@ kz w) if [¢.¢] < 
[Z. y] in Sub(M A); and we say M is a model for a theory T over © (that is, a 
set of sequents over È) if it satisfies all the sequents in T. 
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As in the first-order case, we have a Substitution Property for the terms and 
formulae of our higher-order languages, which is proved in the same way as 1.2.4 
and 1.2.7. 


Lemma 4.1.6 Suppose M is a ¥-structure in a suitable category C (and that 
the first-order structure of C, to the extent that it is used in the interpretation of 
the terms and formulae mentioned below, is stable under pullback). Let 7.t be a 
term-in-context (respectively, let y.p be a formula-in-context) over the signature 
X, let ï be a string of terms of the same length and type as the string y, and let 
X be a context suitable for all the terms in &. Then [Z.t[5/y\] is the composite 


M(A, paran ee a el) aie xx B,) IS Me 


(respectively, [Z.¢[&/y]] is the pullback of [¥.¢] — M(B, x--- x Bm) along the 
first factor of the above composite), where Ai, B, and C are the types of zi, y; 
and t respectively. 0 


We next record the relationships between the term constructors and destruc- 
tors introduced in 4.1.2 and 4.1.3, which express in the formal language the fact 
that they are associated with certain bijections in the categorical structure. We 
say a sequent is valid in a (suitable) category C if it is satisfied by every structure 
(for the appropriate signature) in C. 


Lemma 4.1.7 
(i) In any category with finite products, the sequents (T Hu (w = *)), 


(T Hry (fst((£,y)) = x£)), (T Fay (snd((z,y)) =y)) and 
(T Fz ((fst(z), snd(z)) = z)) 


are valid. (Here w and z are variables of types 1 and A x B respectively; 
the types of x and y are arbitrary.) 


(ii) In any cartesian closed category, the sequents 
(T Faz (app(Ay.t,z) = t[z/y])) and (T Fu (Ay.app(u,y) = u)) 


are valid. (Here t is a term of arbitrary type whose free variables are in 
the string x,y, and u is a variable of function type.) 


(iii) In any topos, the sequents 
((z €a {y: A| o}) dea. O[z/y]) and (Thy ({z: A] xz €4 w} = w)) 


are valid. (Here @ is any formula with free variables in the string Z,y, and 
w is a variable of type PA.) 
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(iv) In any category with finite products and list objects, the three sequents 
(T Fz (terz y(s, w, []) = w)), 


(T Hzuw,w (ters (s, w,cons(u, v)) = sfiter, (s, w, v), u/zx,y])) 


and (T F, (iterz,,(cons(y,z),[],z) = z)) are valid. (In the first two of 
these, s is a term whose free variables are in the string z, y, Z, and in the 
third x,y,z are variables of types LA, A, LA respectively.) 


Proof These are all straightforward when we translate them into assertions 
about morphisms in categories via the clauses of 4.1.5. For example, the two 
sequents in (ii) assert that transpositions in the two directions across the expo- 
nential adjunction are inverse to each other, and (iii) similarly encodes the 
bijection between morphisms A — PB in a topos and subobjects of A x B. 
The first two sequents of (iv) are just the commutativity of the two cells in the 
diagram of 4.1.5(xi), and the third one follows from the fact that the unique 
morphism LA — LA commuting with o, and sz, is the identity. o 


As in Section D1.3, we may associate with any suitable category C a canonical 
signature Sic, with one sort 'A' for each object A of C, one function symbol 
rf: CA "B? for each morphism f: A — B of C, and one relation symbol 
FR? =" A" for each monomorphism R — A in C. (Strictly speaking, we should 
require C to be small here, but in practice we shall use this notation informally 
for non-small categories as well. Provided C has the categorical structure needed 
to interpret the type constructors of Uc, we then have a canonical Xe-structure 
in C (to which we do not bother to give a name).) 

Note, incidentally, that Xe has many more types than C has objects, since 
"A'x' Bis different from "A x B”, and so on, although they are interpreted as 
the same object of C. Informally, we shall tend to identify two types of Uc if they 
‘become equal on removing the  ”’s’, and thus if (for example) f: PA — BY is 
a morphism of C we shall feel free to use "f? as a function symbol of type 
PTA — [-C7—" BY, rather than "PA? — "BON, if it seems appropriate to 
do so. Similarly, we shall feel free to commit such abuses of notation as writing 
{(x,y): Ax B | ¢}, where ¢ is a formula having free variables x and y of types 
A and B, for the term of type P(A? x "B") (or of type "P(A x B)*) which 
should properly be written as {w:"A'x'B" | [fst(w), snd(w)/z,y]}. For the 
formal mechanism needed to make this precise, see 4.2.7 below. 

Of course, we say that C satisfies a higher-order sequent ø over Uc, and write 
C — ø, if the nameless Nc-structure in C satisfies ø. (In the particular case — 
which in fact suffices when we have full first-order logic available, as we remarked 
after 1.1.5 — that ø is of the form (T Hi $), we shall simply say that C satisfies 
the sentence ¢.) Thus we may use the higher-order language over Uc to reason 
about objects and morphisms of C, as we did in the first-order case in 1.3.13 
and 1.3.14. In the particular case when C is a topos, we shall often refer to the 
higher-order language over Uc as the internal language of C. 
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As an example of the uses to which the canonical signature may be put, we 
prove a long-promised result, the ‘Cantor diagonal argument’ for toposes. 


Proposition 4.1.8 
(i) If a topos E contains an object A such that PA is a subquotient of 
A (cf. A4.6.1), then E is degenerate (i.e. equivalent to the terminal 
category 1). 


(ii) If there exists a monomorphism A^ >—> A in a Boolean topos €, then A= 1. 


Proof (i) Note first that if we are given morphisms 


PA z Dos ore 


expressing PA as a subquotient of A, then on applying the covariant and 
contravariant power-object functors we obtain 


Pg Af 
PPA 2. PBZ PA 
3g Pf 


expressing PPA as a retract of PA, by A2.2.5 and A2.3.6(iii); so we might as well 
start from the assumption that we have morphisms i: PA — A and r: A > PA 
with ri = 1p,. Now consider the closed term 


{r: A| A(z Eea Tr (x))} 


of type PA; let us denote this term by c (for Cantor). Is the sentence ¢ = 
(Ti (c) €4 c) satisfied in £? If so, then we have E H a(i Yc) Ea Tr (Ti (c))); 
but we also have € } ("r("i(c)) = ¢), and so we deduce that E H (¢ + 74), 
whence E H (ġ F L), i.e. E H 7¢. But from ~o we can similarly deduce —7¢, and 
hence L, so E | 1; but the latter is exactly the statement that € is degenerate. 

(ii) The hypothesis clearly implies that we have a morphism 1 — A, since 
there exists a morphism 14: 1 — Aĉ; so we need only prove that A — 1 is monic, 
i.e. that the sentence (Vz, y: A)(z = y) is satisfied in £. But since E is Boolean, 
it satisfies 


((Wz,y: A)(x = y) V (Az, y: A)>(z = y)) ; 


let U — 1 and V — 1 be the interpretations of the two halves of this disjunction. 
Interpreting the second half, we see that there is an epimorphism B —> V and a 
monomorphism B*(2) — B*(A) in E/B. But B*(2) is the subobject classifier of 
the latter topos, and so we obtain a monomorphism P(B*(A)) = B*(2)8 (4) — 
B*(A)® (^) & B*(A4)  B*(A). Thus by part (i) E/B is degenerate, so B = 0; 
hence V = 0 and U = 1, as required. o 
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The assumption of Booleanness in part (ii) of 4.1.8 cannot be omitted. We 
shall see in 4.2.16 below that there exist cartesian closed categories C (which 
may be taken to be small) containing nontrivial examples of objects A such 
that A = A^; and by A1.5.6(i) these examples are preserved by the Yoneda 
embedding C — [C°?, Set]. The following example is also of interest: 


Example 4.1.9 There exists a non-degenerate (non-Boolean) topos E with a 
natural number object N, such that NY is a subquotient of N. We construct € 
as the classifying topos Set[T] of a propositional geometric theory T, namely the 
theory of ‘partial surjections from N to NN’, which is (the case A = NN of) the 
intersection of the two theories considered in C1.2.8 and C1.2.9. Explicitly, we 
take a signature with primitive propositions p(n, f) indexed by pairs consisting 
of a natural number n and a function f: N — N, with the axioms 


((p(n, f) A p(n, g)) F L) 


whenever f # g, and 
(TEV {p(n, f) | 2 EN} 


for each f € NN. We shall also write To for the subtheory of T having only the 
first group of axioms, i.e. the theory of partial maps N — NN. 

It is easy to see that the classifying topos of To may be identified with 
the functor category [P°?,Set], where P is the poset of finite conjunctions 
Av, p(ni, fi) which are consistent (i.e. satisfy n; # nj; whenever i # j), ordered 
by entailment. Equivalently, we may think of P as the poset of partial func- 
tions N — N^ with finite domain, ordered by the relation of extension, as in 
A2.1.11(g). Moreover, the coverage on this poset generated by the second group 
of axioms is exactly that considered in A2.1.11(g): explicitly, for each f: N — N, 
each finite partial function ¢: N — N” is covered by the set of all ¢’ which extend 
@ and have f in their image. We note that these covers are all inhabited, so 
E = Set|T] ~ Sh(P,T) is a dense subtopos of [P°?,Set] ~ Set/To], and in 
particular non-degenerate. Moreover, the generic T-model expresses y* (N) as a 
subquotient of y*(N) in E, where y: E — Set is the unique geometric morphism; 
and we know that y*(N) is the natural number object of E, by A4.1.14. But we 
also know that the exponential N” in [P°P, Set] is simply the constant functor 
with value NN, by A1.5.6(ii); and this constant functor is a sheaf by A2.1.11(g), 
and is therefore y*(N%). Moreover, the inclusion Sh(P,T) — [P°?,Set] is a 
cartesian closed functor by A4.2.9, so this object must also be the exponential 
NN as computed in the subtopos. (Alternatively, we could argue using C3.3.11(c) 
that € is a locally connected topos, and so y* preserves exponentials.) 


We remark that the effective topos (which we shall study in Section F2.2) 
provides another example of a topos in which NY is a subquotient of N; thus 
the main interest of 4.1.9 is in the fact that this phenomenon can occur in a 
Grothendieck topos. Note also that if we tried to replace the theory of partial 
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surjections by the theory of injections NN — N, as in C1.2.9 (that is, if we 
added the axioms (T + V{p(n, f) | f € NN} for each n € N), we would not 
obtain a locally connected topos; and in fact the exponential NY in this topos 
would be bigger than 7*(N) (and so would not occur as a subobject of N). 
Intuitively, this is because the presence of a generic monomorphism f: NN — N 
allows us to define ‘new’ functions N — N which were not present in our original 
topos of sets (for example, the function whose value at n is f applied to the 
constant function with value n), whereas the partial surjection which is present 
in the topos of 4.1.9 does not allow us to define any new total functions. Similar 
remarks apply to the theory of surjections N — NN, considered in C1.2.8. In fact 
there can never be an epimorphism N —> NY in a non-degenerate topos, by the 
standard argument used to prove the existence of non-recursive functions: given 
any morphism g: N > N”, the composite 


(9, 1) 
Ves Ne Sy 
transposes to yield a morphism 1 > N~“ disjoint from the image of g. 
As another example of the uses of the canonical signature, we prove the 
Knaster—Tarski fixed-point theorem and use it to discuss the Cantor—Bernstein 
theorem (fulfilling a promise made in B2.3.4). 


Lemma 4.1.10 (Knaster—Tarski Theorem) Let A be a complete internal 
poset in a topos, and suppose f: A— A is an order-preserving map. Then f has 
a fixed point; that is, there exists a: 1 — A such that fa =a. 


Proof Consider the subobject Q — A defined by the term 
{z: A] (2,"f'(x)) ET Ar}, 


where Ay > A x A is the order-relation on A. (Elements of Q are commonly 
called pre-fized points of f.) Let a: 1 — A be the composite 
rQ" V 
1 ———_> PA ——> A, 
where V is the join map of B2.3.9(ii). Then it is easy to verify successively that 


fa is an upper bound for Q, that a is a prefixed point (i.e. it factors through 
Q — A), that fa is a prefixed point, and finally that a is a fixed point. a 


Of course, the fixed point constructed in the proof of 4.1.10 is the unique 
largest fixed point of f. We may also construct the unique smallest fixed point, 
by reversing the order relation wherever it appears; and indeed it may be shown 
that the subobject of fixed points (i.e. the equalizer of f and 14) is itself a 
complete poset in the ordering it inherits from A. 


Corollary 4.1.11 (Cantor—Bernstein Theorem) Suppose given monomor- 
phisms f: A= B and g: B> A in a Boolean topos. Then there exists an 
isomorphism h: A — B. 
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Proof Consider the composite 


pA ope Spee A 
This is clearly order-preserving since Jf and dg are order-preserving and ~ is 
order-reversing, so by 4.1.10 it has a fixed point a9: 1 —> PA. If we write Ap — A 
for the subobject named by ag, and A, — A for its complement, then it is 
clear that f maps Ap isomorphically onto a subobject Bo of B, and g maps 
the complement Bı of Bo isomorphically onto A,. So we obtain our required 
isomorphism h by combining the restriction of f to Ap with the inverse of the 
restriction of g to Bı (and recalling that A is the coproduct of Ap and A). O 


The assumption of Booleanness is not necessary for the conclusion of 4.1.11 
to hold. For example, in a topos of the form [C, Sets] where C is a finite category, 
every object is Dedekind-finite (that is, every monic endomorphism of an object 
is an isomorphism), and so the Cantor—Bernstein theorem holds vacuously in 
the sense that the monomorphisms f and g in the statement must already be 
isomorphisms. On the other hand, the theorem does not hold in all toposes: 


Lemma 4.1.12 Let E be a topos with a natural number object N. If the 
Cantor-Bernstein theorem holds in E, then every subobject of an object of E 
is an N -indexed union of complemented subobjects. 


Proof Let A’ — A be an arbitrary subobject in E€. Consider the objects 
B = N x A and C = BIA’: we clearly have a monomorphism n: B =C, 
but we also have a monomorphism C — B induced by s x 1: B —> B and the 
composite of A’ — A with o x 1: A —> B (cf. A2.5.8). However, if we have an 
isomorphism B — C (or even an epimorphism), then the pullback of v2: A’ — C 
along it is a complemented subobject of N x A (that is, an N-indexed family of 
complemented subobjects of A) whose union is A’. o 


Since it is easy to find toposes containing subobjects which are not countable 
unions of complemented ones (for example, the Sierpiński topos [2, Set]), we see 
that the hypothesis of Booleanness cannot be omitted entirely from 4.1.11. In 
fact B. Banaschewski and G. Brümmer [55] have shown that, if we strengthen 
the conclusion of the Cantor—Bernstein theorem by requiring that he < (fe Ug°) 
in the poset of relations A % B (cf. A3.1.3), then the theorem holds in a topos 
E with a natural number object iff € is Boolean. 


Suggestions for further reading: Banaschewski & Brümmer [55], Kock & 
Reyes [640], Lambek [667], Lambek & Scott [682]. 


D4.2 )-Calculus and cartesian closed categories 


We now illustrate the general framework which we set up in the previous sec- 
tion by describing the equivalence between theories in the typed A-calculus and 
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(small) cartesian closed categories. Historically, the A-calculus was first intro- 
duced (by A. Church [241]) as an untyped ‘theory of functions’ which it was 
hoped would provide a foundation for mathematics: the idea was that every 
term in the calculus should denote a function defined on the whole universe of 
mathematical discourse, so that it could be applied to any other term (or indeed 
to itself) to yield a meaningful result. For a detailed account of the untyped 
A-calculus and its history, see [67]. We shall encounter it in Chapter F2 (and see 
also 4.2.15 below), but for the present we shall content ourselves with describing 
the ‘simply typed’ A-calculus, whose aims are much more modest. (Actually, 
what we describe is not even the simply typed )-calculus as originally conceived: 
the latter did not include product-types, but it was soon realized that they 
could be consistently added - if you have function-types, then the product-type 
A x B is implicitly present, because [A x B-—>C] is canonically isomorphic to 
[A—[B—C]] for any C - and from the categorical point of view it is certainly 
more natural to have the products present from the outset.) 

A A-signature, then, is a signature © which has the type constructors (i), 
(ii) and (iii) of Definition 4.1.1; in addition it may have primitive function sym- 
bols, but the set =-Rel of primitive relation symbols is required to be empty. 
The A-terms over © are those which can be constructed using rules (i)-(vi) of 
Definition 4.1.2; however, the only formulae which we allow in the A-calculus are 
atomic equalities (s =4 t), where s and t are terms of the same type A. Thus 
the A-calculus is ‘logic-free’ in the sense that it contains no logical connectives 
or quantifiers. 


Remark 4.2.1 From another point of view, that of the so-called Curry- 
Howard 1somorphism or ‘propositions-as-types paradigm’, what has happened 
to the logic in the A-calculus is that it has been exported from the formulae into 
the type structure. From this viewpoint, we think of the types over a A-signature 
as ‘propositions’ (the sorts being ‘primitive propositions’ and the binary oper- 
ations x and [—] corresponding to the logical operations of conjunction and 
implication), and the function symbols of © become the ‘axioms’. A A-term- 
in-context T.t is then thought of as a ‘derivation’ of the type A of t from the 
‘hypotheses’ B,,...,B, which are the types of 71,...,2%,; and the term con- 
structors of 4.1.2(i)-(vi) correspond to operations on derivations. The equalities 
which, from the other point of view, are the axioms of our A-theory, become 
assertions that two derivations are ‘essentially the same’. 


A A-theory over © is a set of formulae-in-context, i.e. expressions of the form 
E.(s =, t), where Z is a context containing all the variables occurring free in s$ 
or in t. If we wished to maintain our tradition of axiomatizing theories by means 
of sequents, we should have to write these in the form (T Fz (s =, t)), but since 
all the sequents we ever consider in the A-calculus are of this form we might as 
well omit the T and the H. We shall also, as usual, omit the type subscripts from 
equality signs whenever we can safely do so; and we shall also feel free to omit 
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the contexts when (as is almost always the case here) they are the canonical ones 
for the formulae which follow. 


Definition 4.2.2 The rules of inference of the typed A-calculus are as follows: 
as usual, the following clauses have the standing hypothesis that the contexts 
appearing in them contain all the free variables in the terms which follow, and 
each rule is supposed to apply at all appropriate types. 


(a) The only structural rule (cf. 1.3.1(a)) is the substitution rule 


Z.(s =t) 
¥.(s[r/a] = t[r/z}) 
where r is any string of terms of the same length and type as 7, and ¥ is 


any context including all the free variables occurring in 7. (Note that, as in 
1.3.1(a), this rule includes the weakening rule, but not its converse.) 


(b) The equality rules consist of the axiom z.(x = x), the rules 


z,y.(x=y) and 2,y,z.(c=y) 2£,y,2.(y =z) 
T, y. (y = T) T, Y, Zz. (£ = 2) 
and the rule 
T. (sı = ti) T. (Sn = tn) 


£.(r[8/y] = r[t/9)) 


where 7 is a context containing all the free variables in r. (Note that, since 
we no longer have finite conjunction in our language, we have had to include 
the symmetry and transitivity of equality as rules of inference, rather than 
deducing them as in 1.3.1(6).) 


(c) The rules for product types are the axioms z.(z =, *) where z has type 1, 
and 


z,y.(fst((z,y)) =4 z), 2,y.(snd((z,y)) =B y) and 
z.((fst(z), snd(z)) =4xB 2) 


where x,y,z have types A, B, A x B respectively. 


Cc 
Ne” 


The rules for function types are the two axioms Z.(app(Ay.s,t) =B s{t/y]) 
(the @-rule) and £.(Ay.app(t, y) =,448] t) where y is a variable of type A 
not occurring free in t (the 7-rule), plus the rule 


T, y. (s =B t) 
£E.(Ay.s =(A+B] Ay .t) 


which ‘strengthens’ the third equality rule by eliminating the bound variable 
y: A from the context of its conclusion. 
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If © is a A-signature, then we may consider -structures (as in 4.1.4) in 
any cartesian closed category C; and, given such a structure M, we may use 
clauses (i)—(vii) of Definition 4.1.5 to interpret the A-terms-in-context over © as 
morphisms of C. Of course, we say an equation-in-context 7.(s = t) is satisfied in 
M if the interpretations [%.s],4 and [Z.t] are equal, and we say M is a model 
of a A-theory T if it satisfies all the axioms of T. We have the usual Soundness 
Theorem: 


Proposition 4.2.3 If an equation-in-context £.(s=t) is derivable in a 
A-theory T, then it is satisfied in all models of T in cartesian closed categories. 


Proof As usual, we simply have to check that each of the rules of 4.2.2 pre- 
serves validity, i.e. that if its hypotheses are all satisfied in some b-structure 
M then so is its conclusion. Most of the axioms were already commented on in 
4.1.7; the remaining axioms and rules are equally straightforward. o 


To obtain a converse to 4.2.3, we have to construct a syntactic category for 
an arbitrary A-theory T. This is actually very much easier than the construction 
in Section D1.3: because of the logic-free nature of the A-calculus, there is much 
less that needs to be constructed. In fact we take the objects of our category Cr 
to be simply the types of the signature ©, and morphisms A — B are equiva- 
lence classes [x.t] of terms-in-context, where z: A and t: B, and the equivalence 
relation is T-provable equality (i.e. [z.s] = [x.t] iff 2.(s = t) is provable in T). 
(Of course, we also identify a-equivalent terms-in-context, including the identi- 
fication [x.t] = [y.t[y/z]].) The identity morphism A — A is simply [z.z], and 
composition is given by substitution: the composite of 


[x.t] 5 [y.s] 5 


is [zx . s[t/y]]. The substitution and equality rules easily imply that this is indepen- 
dent of the choice of the representatives s and t. To verify that it is associative, we 
need to observe that r[s/z][t/y] is actually the same term as r[s{t/y]/z] provided 
the variable y does not occur free in r. 


Lemma 4.2.4 The category Cr just defined is cartesian closed. 


Proof The terminal object of Cr is the type 1; for any object A, the unique 
morphism A — 1 is [x.*]. Similarly, the product of A and B is A x B; the 
product projections are [z.fst(z)] and [z.snd(z)] (where z: A x B), and the mor- 
phism C — A x B induced by [w.s]: C — A and [w.t]: C > B is [w.(s,t)]. 
The exponential B4 is [A—B]; the evaluation map [A> B] x A > B is 
[w.app(fst(w),snd(w))], and given any morphism [z.t]: C x A > B, its trans- 
pose C > [A> B] is [w.Azx.t[(w, x) /z]]. The verification that all of this works 
is Straightforward from the rules in 4.2.2. O 
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We should perhaps re-emphasize at this point that in the statement of 
Lemma 4.2.4, and indeed throughout this section, the phrase ‘cartesian closed 
category’ means a category having finite products and exponentials, but not 
necessarily any other finite limits (since the logic-free nature of the A-calculus 
means that we do not have any means of constructing such limits syntactically). 
In the terminology introduced in Section A1.5, we are here dealing with ‘improp- 
erly’ cartesian closed categories, in contrast to what we have been doing almost 
everywhere else in this book. 


Proposition 4.2.5 The category Cr contains a %-structure My which satis- 
fies exactly those equations-in-context that are provable in T. Moreover, for any 
cartesian closed category D, there is a bijection between natural isomorphism 
classes of cartesian closed functors Cr — D and isomorphism classes of T-models 
in D, which is induced in one direction by the mapping F > F(Mr). 


Proof Again, this is all straightforward verification. The structure Mr sends 
each base sort to itself, and each primitive function symbol f: A > B to |x. f(x)], 
where x: A. An easy induction then shows that for any term-in-context x.t, we 
have [z.t]m = [z.t]. (Of course, the language contains terms whose contexts 
have more than one variable, but we can reduce to the latter by observing that 
we always have [z, y.t] = [w.t[fst(w), snd(w)/z, y]] where w is a variable of the 
appropriate product type.) Hence the equations-in-context satisfied in Mr are 
exactly those provable in T. Given a T-model N in an arbitrary cartesian closed 
category D, the corresponding functor Fy : Cr > D sends A to NA, and [z.¢] to 
[x .t]v; it is clear that Fy is a cartesian closed functor, and that Fy(Mr) = N. 
In the opposite direction, since any cartesian closed functor F:Cr —> D must 
preserve interpretations of arbitrary A-terms-in-context, it is easily seen to be 
naturally isomorphic to Fy where N = F(Mr). o 


From 4.2.5, we may deduce various forms of the completeness theorem for 
A-theories: for example, 


Corollary 4.2.6 Let T be a \-theory, and ¢ an equation-in-context over the 
signature of T. If @ is satisfied in all \-models in toposes of the form |C, Set], 
then it is provable in T. 


Proof Combine 4.2.5 with the fact (A1.5.6(i)) that the Yoneda embedding 
Cr — [C;?, Set] is a cartesian closed functor (and faithful, so that it reflects the 
validity of equations). o 


Remark 4.2.7 However, we do not have a ‘classical completeness theorem’ 
for A-theories in terms of models in Set, or even in Boolean toposes. Consider 
the theory over the signature with one base sort A and two function symbols 
r: A — [A— A] and i: [A—A] — A, with one axiom asserting that ri is the 
identity. In any model M of this theory in a Boolean topos, we must have 
MA =S 1 by 4.1.8(ii), so the equation-in-context z, y.(z =, y) will be satisfied. 


956 HIGHER-ORDER LOGIC D4.2 


But in 4.2.16 below, as we remarked after 4.1.8, we shall encounter cartesian 
closed categories containing objects A such that A^ is a retract of A and A > 1 
is not monic; so this equation is not provable in the theory. 


In the converse direction to 4.2.5, we have 


Example 4.2.8 For any small cartesian closed category C, there is a A-theory 
Tc such that, for any cartesian closed D, isomorphism classes of Te-models in D 
correspond bijectively to natural isomorphism classes of cartesian closed functors 
C — D. To construct Te, we take a basic sort "A? for each object A of C, anda 
function symbol T f?: "A? —'"B? for each morphism f: A — B in C. (The use 
of the notation TA? is inescapable here, since we need to distinguish between 
the types "A x B? and TA? x TB?, and so on.) In order to ensure that the 
‘formal’ product and function types in © coincide with the ‘actual’ products and 
exponentials in C, we need a further collection of function symbols, all of which 
will be denoted by ¢ (since they ‘represent’ identity morphisms; strictly speaking, 
they should be decorated with appropriate subscripts to distinguish them from 
one another, but in practice they are unlikely to be confused). Explicitly, we have 
a function symbol +: 1 — "17 (where the ‘1’ on the right is the terminal object of 
C, and that on the left is the empty product type); for each pair of objects (A, B), 
we have :"A™?x"B1?—"A x B7, and we also have t: ["A7>"B] — "B41. 
The theory Te has the following axioms (where we omit the contexts, since they 
are all canonical): 


(a) for each object A, ("14 ‘(r) =r ,7 z£), and for each commutative triangle 


(OCF (e)) =r Th (2); 
(b) the axiom (4(*) =r ,1 x), and for each pair of objects (A, B), the three axioms 
("71 (e(w)) =r 47 fst(w)), (“m2 (e(w)) =r pn snd(w)) and 
(m (2), 12 "(2))) =r 4y B 2) 


where w and z have types" A’x "B? and" A x B? respectively, and 7, and 
T2 are the product projections in C; 


(c) for each pair of objects (A, B), the axioms 
(Fev(u((E(w),2))) =" p7 app(w,z)) and (i(Ax.Tev(e((z,2)))) =F gam 2) 


where w, z, z have types ["A'’—" B1], TA? and " B4" respectively. 
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It is easy to see that, for any cartesian closed category D, a d-structure in 
D which satisfies the axioms in (a) is (up to isomorphism) ‘the same thing 
as’ a functor F': C — D (together with some chosen morphisms lp — F(le), 
FA x FB = F(A x B) and FBF^ — F(B4), about which the axioms in (a) 
make no assertions). Moreover, it satisfies the axioms in (b) iff the first two types 
of chosen morphisms are inverse to the canonical comparison maps (equivalently, 
F preserves finite products); and, given that this happens, it satisfies the axioms 
in (c) iff the chosen morphisms of the third type are inverse to the exponential 
comparison maps 04,5: F(B4) — FB¥4 (equivalently, F is a cartesian closed 
functor). 


Remarks 4.2.9 (a) If we start from an arbitrary small cartesian closed cate- 
gory C, construct the theory T¢ as in 4.2.8, and then build the syntactic category 
of the latter, we will arrive at a category equivalent to C. (It is not isomorphic 
to C, since it has many more objects; but from every object of the syntactic cat- 
egory we have a canonical isomorphism to one of the form "A! where A € ob C, 
namely an appropriate composite of (morphisms denoted by) ¢’s.) Similarly, if 
we start from a A-theory T, build its syntactic category Cr and then construct a 
new theory from the latter as in 4.2.8, the new theory will be ‘Morita-equivalent’ 
to the old in the sense that there will be natural bijections between the isomor- 
phism classes of their models in arbitrary cartesian closed categories. Thus we 
have shown that (up to equivalence) small cartesian closed categories are ‘the 
same thing as’ Morita-equivalence classes of -theortes. 

(b) However, we may also construct a A-theory from something much less 
than a cartesian closed category. Starting from a directed graph G (as defined 
before 2.1.1), we may form the signature whose sorts are the vertices of G and 
whose function symbols are the arrows of G; if we write T(G) for the empty 
theory over this signature, then T(G)-models in a cartesian closed category D 
are the same thing as graph morphisms G — D (cf. 2.1.4(a)). Alternatively, if 
G is actually a (small) category, then we may take the axioms of 4.2.8(a) to 
obtain a theory whose models are functors defined on G. Thus the construction 
of the syntactic category Cr in this case yields a left (pseudo-)adjoint for the 
forgetful functor from (small) cartesian closed categories to small categories. 
(To make this statement 2-categorically respectable, we must here regard Cat 
and the category of small cartesian closed categories as being made into ‘locally 
grouplike’ 2-categories: that is, their only 2-cells are natural isomorphisms. This 
is because, as we observed after 4.1.4, a non-invertible natural transformation 
between functors G <3 C will not give rise to a natural transformation between 
the corresponding cartesian closed functors Croo) = D.) 


An important operation on A-theories is that of adding a new function symbol 
to the signature (without adding any new axioms to the theory ~ though, from 
the point of view described in 4.2.1, this operation is precisely that of adding 
a new axiom to the logical system represented by the type structure). Since 
the effect of adding a function symbol f: A — B is precisely the same as that 
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of adding f: 1 > B4, we might as well restrict ourselves to the case when the 
domain of the function symbol is 1. What effect does this have on the syntactic 
category of our theory? 


Lemma 4.2.10 Let T be a d-theory over a signature ©, and let Ta] denote 
the same theory over the signature obtained by adjoining a new function symbol 
a:1 — A tod (where A is a particular type of ©). Then the syntactic category 
Cia] is equivalent to the full subcategory (Cr) a of the slice category Cr/A whose 
objects are those of the form A*B, B € ob Cr (cf. A1.5.4). 


Proof Clearly, Cyjq) has the same objects as Cr; and morphisms B > C 
in Cyjq) are provable-equality-classes of terms-in-context y.t over the enlarged 
signature J [a], where y: B and t: C. But we may identify such terms-in-context 
with terms-in-context zo, y.t’ over ©, where qo is a particular variable of type A 
(which is distinct from y even if A = B, and which not allowed to become bound), 
by the simple device of replacing the constant a by the variable xro wherever it 
appears in t. Since the axioms of T make no mention of a, it is clear that this 
does not affect the relation of provable equality. Thus we have established a 
bijection between morphisms B — C in Cria} and morphisms A x B — C in Cr- 
equivalently, morphisms A*B — A*C in Cy/A. It is straightforward to verify 
that this bijection respects composition, and so yields an isomorphism between 
Cra] and the Kleisli category of the comonad A x (—) on Cr; but, as we saw in 
A1.5.4, the latter is equivalent to the full subcategory of C/A described in the 
statement. o 


In view of 4.2.9(a), we may interpret 4.2.10 as saying that, given a small 
cartesian closed category C and an object A of C, the category Ca of A1.5.4 is 
the free cartesian-closed-category-with-an-element-of-A generated by C; that is, 
given any cartesian closed functor F: C — D and any morphism a: 1 —> FA in 
D, there is a unique extension (up to canonical isomorphism) of F to a cartesian 
closed functor F: C4 — D which sends the diagonal map A —> A x A (regarded 
as a morphism 1 — A*A in Ca) to a. In fact it is not hard to prove this directly, 
if we identify C4 with the Kleisli category of A x (—): we have F(B) = F(B) 
for each object B and, given a morphism f: B — C in C4 (that is, a morphism 
Ax B—C in C), its image under F is the composite 


axl F(f) 
FB 1 x FB ——> FA x FB — > FC. 


Of course, it requires verification that this is a (cartesian closed) functor; but 
that is straightforward. 

Lemma 4.2.10 also has an interpretation in terms of the propositions-as-types 
paradigm of 4.2.1, as a ‘functional completeness’ result: it says that if you add a 
new proof a of A to the deduction system represented by (the type structure of) 
a given A-theory, then any deduction of C from B in the new system is uniquely 
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representable as a ‘polynomial in a’ whose ‘coefficients’ are deductions in the old 
system. For more details of this point of view, see [682]. 

In Section A2.5 we studied the notion of natural number object, and we saw 
(cf. A2.5.3) that the notion makes sense in any cartesian closed category. It is 
therefore of interest to consider what happens if we add the type N to the typed 
A-calculus. Unfortunately, because of the logic-free nature of the A-calculus, we 
cannot fully express the idea that N is a natural number object in it: we can 
assert the existence of recursively defined morphisms, by means of the term 
constructor iter, of 4.1.2(ix), but we cannot assert their uniqueness simply by 
writing down equations. However, by strengthening the notion of iterator, we 
can repair at least some of the damage. 


Definition 4.2.11 (a) By a AN-signature, we mean a )-signature containing 
the distinguished type N, and having the term-constructors zero and succ defined 
in 4.1.2(viii), plus a term-constructor reCy n (the recursor) which operates as 
follows: given variables x: A (where A is arbitrary) and n: N, plus terms s: A, 
t: A and u: N such that neither x nor n is free in t or u, recy n(s,t, u) is a term 
of type A, in which all occurrences of x and n are bound. 

(b) The rules of inference of the AN-calculus are those of 4.2.2, plus the 
axioms y, Z. (recy n(s, y, Zero) = y), 


M, Y, Z. (rece,n(s, y, succ(m)) = s[recz .n(s,y,m), m/z, n}) 


and m. (recy n(succ(x), zero, m) =m). (These correspond to the special case 
A = 1 of the sequents of 4.1.7(iv): in the first two, Z is a string containing all the 
free variables of s other than z or n.) 


Essentially, the recursor allows the construction of terms defined by recur- 
sion as in A2.5.2, and not just as in A2.5.1. The reason why we need this is 
that the proof of existence of the recursively defined morphism in (the first part 
of) A2.5.2 used both the existence and uniqueness clauses of A2.5.1. The follow- 
ing proposition is now entirely straightforward, and we omit the details of the 
proof. 


Proposition 4.2.12 If T is a theory in the AN-calculus, then the syntactic 
category Cr (constructed as in 4.2.4) is cartesian closed and contains a weak 
natural number object, that is an object N equipped with morphisms o: 1— N 
and s: N — N which satisfies the ‘existence’ part of the conclusion of A2.5.2, 
but not the ‘uniqueness’ part. B 


Using the recursor, we may convert the commutative diagrams of A2.5.4 into 
linguistic definitions of the arithmetic operations on N: for example, we define 
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addition, the predecessor map and truncated subtraction by 


plus(z, y) = recz »(succ(z),z,y) , 
pred(r) = recz w(w, zero, x) , and 
tsub(x, y) = rec, w(pred(z), z, y) . 


However, since we lack the uniqueness clause of A2.5.2, we cannot prove that 
these operations satisfy the usual laws of arithmetic. We may of course add 
axioms to any AN-theory to assert that particular laws hold; we shall be par- 
ticularly interested in two of them, namely z,y.(plus(z,y) = plus(y,z)) and 
x, y.(tsub(plus(z, y), y) = 2). 

The point of these two equations is that they tell us that the type N comes 
equipped with a Mal’cev operation, that is a ternary operation m(z,y,z) = 
tsub(plus(z, z), y) satisfying the equations m(z,y,y) = x and m(z,z,y) = y. 
(Malcev operations play an important rôle in universal algebra; see [1114], for 
example.) In our present context, we shall say that a type A over a AN-signature 
is a Mal’cev type (relative to a particular \N-theory T) if there exists a term m of 
type A, having three free variables x, y, z of type A, for which the equations-in- 
context x, z.(m[x/y] = z) and x, z.(m[z/y] = x) are derivable in T. Thus, if the 
two equations at the end of the last paragraph are derivable, N is a Mal’cev type. 

The reason for being interested in Mal’cev types is that, if A is such a type, 
then any term representing a function N — A has a recursive definition. Specif- 
ically, suppose t = t(n) is a term of type A involving a variable n of type N, and 
let s = s(x,7) be a term of type A with variables z,n of types A, N respectively. 
(We are departing from our usual custom of not displaying the free variables in 
a term, in order to improve the readability of the expressions involving multiple 
substitutions which follow.) By the Uniqueness Principle for the pair (s,t), we 
mean the assertion that, if the equation-in-context n.(t(succ(n)) = s(t(n), n)) is 
derivable in our theory, then the equation n.(t(n) = rec, p(s(x, p), t(zero), n)) is 
derivable. 

Now suppose A is a Mal’cev type; let m(z,y,z) be a Mal’cev term for it. 
Given s and t as above, we define a new term 8(z,n) by 


8(z,n) = m(t(succ(7)), s(t(nr), n), s(x, n)) . 


Lemma 4.2.13 Let A be a Mal’cev type over a AN-signature. Then the 
Uniqueness Principle holds for all pairs (s,t) iff the equation-in-context 


n.(t(n) = recr p(s(x, p), t(zero), n)) 


is derivable for all pairs (s, t). 
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Proof First suppose the Uniqueness Principle holds for the pair (S,t). Since 
m is a Mal’cev term, we can derive 


n.(8(t(n),n) = t(succ(n))) 


by substituting t(n) for z in the definition of s. So the conclusion of the Unique- 
ness Principle is derivable; but that is exactly the equation-in-context displayed 
in the statement of the lemma. Conversely, suppose that this equation is deriv- 
able, and that we can also derive n. (t(succ(n)) = s(t(n),n)). Again using the fact 
that u is a Mal’cev term, we derive z,n.(S(z,n) = s(z,n)), so we may substitute 
s for § in the equation displayed in the statement, to deduce the conclusion of 
the Uniqueness Principle for (s, t). 0 


Corollary 4.2.14 Let T be a AN-theory in which the equations-in-contezt 
z,y.(plus(z, y) = plus(y,z)) and z, y.(tsub(plus(z, y), y) = z), and all equations 
of the form n.(t(n) = recy »(S(z, p), t(zero),n)) where the type of s and t is a 
Mal’cev type, are derivable. Then the full subcategory D of Cr whose objects 
are the Mal’cev types relative to T is a cartesian closed category with a natural 
number object. 


Proof D is closed under finite products in Cr, because if m(z,y, z) 
and m’(z’,y’,z’) are Mal’cev terms for types A and A’ respectively, then 
(m(fst(z), fst(y), fst(z)), m’(snd(x), snd(y),snd(z))) is a Mal’cev term for A x A’. 
Similarly, it is an exponential ideal, since if A has a Mal’cev term m(z, y, z) we 
may define one for [B — A], namely Aw.m(app(z, w), app(y, w), app(z, w)) where 
w is of type B. So it is cartesian closed; and, as we have already noted, the two 
arithmetical equations in the statement ensure that it contains the weak natural 
number object N of Cr. But then 4.2.13 ensures that N is actually a natural 
number object in D. E 


However, we cannot obtain every (small) cartesian closed category with a 
natural number object in this way, since it is not true in general that every 
object in such a category admits a Mal’cev operation. 

We conclude this section by considering how small a cartesian closed category 
C can be. Clearly, if it has only one object, then that object must be terminal, 
and the category is not very interesting. Can it have just two objects? If so, 
then one of them is the terminal object 1, and the other (A, say) must satisfy 
either A x A = A = A^ or Ax A = A and A^ = 1. The second possibility 
leads to the conclusion that A is subterminal, and hence that C is a two-element 
Heyting algebra, which is again uninteresting; but the first possibility can lead 
to interesting examples. We note that if A = A^ then 1 is a retract of A (since 
we always have a morphism 1,4: 1 — A“), and so we might as well forget about 
the object 1 for the moment, since we can recover it when necessary by splitting 
an idempotent (cf. A1.1.8). Thus we are reduced to considering a monoid M (the 
monoid of endomorphisms of A) equipped with the following structure. 
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Definition 4.2.15 By a C-monoid we mean a monoid M equipped with three 
distinguished elements p, q and e, a binary operation (—,—) and a unary oper- 
ation (—)* satisfying the following equations (in which we denote the monoid 
operation by juxtaposition): 


(i) p(z,y) = z and q(z,y) = y for all x and y. 
(ii) (pz,qz) = z for all z. 
(iii) e(a*p,q) = x and (e(yp,q))* = y for all z and y. 


The idea is that p and q are the product projections A = Ax A= A, (-,-) 
is the binary operation which converts a pair of morphisms A 3 A into a single 
morphism A — A x A = A, (—)* converts a morphism A = A x A — A into 
its transpose A — A“ = A, and e is the evaluation map A = A^ x A > A. 
Since the theory of C-monoids is algebraic, we may clearly construct nontrivial 
examples. In particular, given a C-monoid M, we may construct the ‘polynomial 
C-monoid’ M [a] freely generated by M together with one extra element a. The 
functional completeness result 4.2.10 can be used to give an explicit description 
of the latter: every element of M |a] is of the form e(z(aq)*, 1) for a unique x € M. 


Lemma 4.2.16 Let M be a C-monoid. Then the Cauchy completion M[E] of 
M (cf. A1.1.9) is cartesian closed. 


Proof The terminal object is taken to be q*. To verify that this is indeed 
idempotent in M, we need the following calculation: 


y*z = (e(y* xp, q))* 

(e(y*p(zp, q), a(x, q)))* 
= (e(y*p, g)(zp, q))* 

= (y(xzp,q))* , 


in which all steps except the third follow directly from the axioms of 4.2.15, 
and the third is an easy exercise. Substituting q for y, we obtain g*z = q*, so 
in particular q* is idempotent. But this calculation also tells us that q* is the 
unique morphism u —> q* in M [E], for any idempotent u. 

The product of two idempotents u and v is (up, vg), with projections given by 
up and vq; if x: w > u and y: w — v are morphisms of M[E], then the induced 
morphism w — u x v is given by (x,y). Similarly, the exponential v” is defined 
to be (ve(p, ug))*; the evaluation map v” x u — v is ve(p, ug), and the transpose 
of x: w xX u — v is simply x*: w > v”. The verification that all of this works is 
straightforward. o 


Of course, M itself appears as the monoid of endomorphisms of the idempo- 
tent 1 in M[Č]; this object satisfies 1 = 1 x 1 = 1! (though it is not the terminal 
object, unfortunately for the notation!), and so the full subcategory of M [Č] on 
this object and the terminal object q* is also cartesian closed. 

C-monoids correspond to the untyped à-calculus, which is defined just as in 
4.2.2 except that we omit all the type information (and all references to the 
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particular type 1 and the corresponding constant *). That is, we have a stock of 
variables, plus a possibly empty set of constants, which we combine into terms 
using the term constructors fst, snd, (—, —), Az .— and app, and axioms and rules 
of inference as in 4.2.2, except for the first axiom of 4.2.2(c). (Once again, this is 
not the untyped A-calculus as originally conceived by Church, since the latter did 
not have the term constructors fst, snd or (—,—); Lambek and Scott [682] call 
what we have just described the extended »-calculus.) Given a theory T in this 
calculus, we may construct its syntactic C-monoid My, whose elements are the 
provable-equality-classes of closed terms, with multiplication defined by [s]{t] = 
[Ax.app(s, app(t, z))] and identity element [Ar.z]. The C-monoid structure is 
defined by setting p = [Az.fst(z)], q = [Az .snd(x)], e = [Az . app(fst(£), snd(x))], 
and so on. For more information, we refer the reader to [682]. 


Suggestions for further reading: Barendregt [67], Crole [266], Lambek[668, 
673], Lambek & Scott [682], Scott [1104]. 


D4.3 Toposes as type theories 


We are now ready to describe the theories which correspond to (small) toposes in 
the same way that -theories correspond to cartesian closed categories. Normally 
these would be called ‘higher-order intuitionistic type theories’, or some similar 
phrase; however, taking our cue from the name ‘)-theory’, we shall simply call 
them 1-theories (the letter 7 standing for ‘topos’ or ‘type’, according to taste). 

A T-signature, then, is one which has the type constructors for finite products 
and power types, together with the term constructors of 4.1.2(iii), (iv) and (vii). 
For the moment, we shall not require the signature to have list types, but we 
shall put them in later as we did with the type N in Section D4.2. Since toposes 
are cartesian closed, we could if we wished put in function type constructors as 
well, but since we do not need them we shall keep the logic as simple as possible 
by leaving them out. In contrast to the \-calculus, we allow primitive relation 
symbols as well as function symbols in a 7-signature (though in practice we shall 
rarely use the former); and we have the full range of (finitary) logical connectives 
and quantifiers, as specified in 1.1.3(iii)-(x), available in our language, as well 
as all three types of atomic formulae described in 4.1.3. 


Definition 4.3.1 The rules of inference of the 7-calculus are all the first-order 
rules of 1.3.1(a)-(g), the product type axioms of 4.2.2(c), and the following 
axioms for power types: (T +, (w = {x: A | x €, w})) for w a variable of type 
PA, and ((z €4 {y: A | ¢}) 4kz,z o[z/y]) for ¢ any formula with free variables 
in the string Z, y. 


Remark 4.3.2 There is a good deal of redundancy in this presentation of the 
T-calculus: in fact we could eliminate all the logical connectives and quantifiers as 
Primitives of our language except for = and V. To see this let Q, as usual, denote 
the type P1; if t is a term of type Q, we shall abbreviate the formula (* €, t) 
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to t. (Informally, we think of terms of type 2 as being the same thing as formu- 
lae, since the axioms for power types ensure that we have a bijection between 
provable-equivalence classes of the latter and provable-equality classes of the for- 
mer; in the other direction, a formula ¢ is mapped to the term ¢ = {z: 1 | $}, 
where z is a variable not occurring in œ.) 

We now assert that the sentence (Vp: Q)p will do as a substitute for the 
constant L: to see this, we have to show that the sequent (L Hz ¢) of 1.3.1(d) is 
derivable, for an arbitrary ¢. But we have the following derivation: 


((vp)p Fij (Yp)p) 
((Vp)P Fp P) ((z € {y | o}) Fee lz/y)) 


((Vp)p Fz * € {y | o})) (Œœ € {y] o) Fe of*/y]) 
((Vp)p Fz ġ) 


where y and z are variables of type 1 not occurring in ¢. A similar argument 
shows that (Yp: Q)(p=>p) will do for T, that (¢ V y) may be replaced by 


(Vp: 2)((b=>p) > ((¥ =p) =>P)) 


(where p is a variable of type Q not appearing in ¢ or w), and that (¢ A w) may 
similarly be replaced by 


(Vp: Q)((b= (b= p)) =P) . 


(We remark that we have seen the above tricks for deriving finite disjunction 
from implication and universal quantification over Q before, in A1.6.4. However, 
the ones for finite conjunction did not appear there, since we were working in a 
context where we already knew that finite intersections of subobjects existed.) 
Finally, we may replace the existential quantification (dz: A)d by 


(Vp: Q)((Va: A)(¢ =P) =D) . 


This does not exhaust the possible simplifications of 4.3.1. We may also 
eliminate the term-constructors fst and snd from our language, since a subterm 
of the form fst(t), where t: A x B, may be replaced by a new variable z: A 
satisfying the formula (Jy: B)(t = (x, y)). We may even eliminate equality as a 
primitive symbol, replacing it by the ‘Leibnizian’ definition that (s =, t) means 
(Vw: PA)((s €4 w) & (t €4 w)), where & is defined in the usual way from > 
and A, and w is a variable not occurring in s or t. However, such ‘reductionist’ 
tricks run counter to the actual practice of mathematics: a bit of redundancy in 
one’s list of primitives rarely does any harm. 


By a 7-theory over a signature ©, we as usual mean a set T of sequents over 
2, to be thought of as the (non-logical) axioms of the theory. (However, since 
we have full first-order logic available in the t-calculus, we could alternatively 
axiomatize our theories by sets of sentences, replacing a sequent (@ Fz 7) by the 
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sentence (YZ)(ġ = Ņ). In practice, if a sequent of the form (T Fy, ¢) is derivable 
in a T-theory T, we shall tend to say simply that the sentence ¢ is derivable 
in T.) And we have the usual notion of a model for a 7-theory T in a topos £, 
namely a structure M for its signature (defined as in 4.1.4) which satisfies all 
the axioms of T. The Soundness Theorem is yet again straightforward: 


Proposition 4.3.3 Let T be a T-theory over a signature È. If a sequent o is 
derivable in T (using the rules specified in 4.3.1), then it is satisfied in any model 
of T in a topos. 


Proof As usual, we have to verify soundness of each of the individual axioms 
and rules of inference. But the soundness of the first-order axioms (in arbitrary 
Heyting categories) was verified in 1.3.2, and that of the axioms for product and 
power types in 4.1.7(i) and (iii). o 


To obtain a converse to 4.3.3, we seek as usual to construct a syntactic 
category Er from an arbitrary 7-theory T, and to prove that it is a topos (and 
that it contains a conservative model of T). With the extra logical structure now 
at our disposal, we may somewhat simplify the construction of this category, 
as compared with the first-order syntactic categories which we constructed in 
Section D1.4. In the first place, we note that, for the objects of our category, 
it suffices to consider (a-equivalence classes of) formulae-in-context x.o where 
the context is a single variable x; for, in any “-structure, the interpretation 
of z,y.@ will coincide with (or at least be canonically isomorphic to) that of 
z.@lfst(z), snd(z)/x, y] where z is a variable whose type is the product of those 
of x and y. (Similarly, a sentence ¢ in the empty context may be replaced by z.@, 
where z is a variable of type 1.) But then we may also exploit the correspondence 
between (provable-equivalence classes of) formulae in a context with one variable 
x of type A, and (provable-equality classes of) closed terms of type PA (which 
sends z.ġ to {x: A | o} and a closed term a to z.(z €4 a)), and thus regard 
the objects of our syntactic category as a-equivalence classes of closed terms 
whose type is a power type PA for some A. In this way, we may eliminate the 
contexts altogether from our description of Er. (In truth, this is only a very 
minor notational change: the object of the syntactic category which was denoted 
{x.@} in Section D1.4 will now be denoted {x | ¢}.) It will be convenient to 
have a notation for (the object of Er corresponding to) the particular closed 
term {x: A | T}, where A is a type of ©; we shall denote it by A. 

If a and b are closed terms of types PA and PB respectively, then a morphism 
a — bin Er will be represented by a closed term f of type P(A x B), satisfying 
the usual functionality axioms (suitably translated into our new terminology): 
in other words, such that the sequents 


(((t,y) € f) Fry ((x € a) A (y E b))), 
((x € a) Fe (Ay) ( (x,y) € f)) , and 
((((£,y) € fP) A(z, y) E F) Fru YY =y’)) 
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are provable in T. Two such terms f and f’ will of course represent the same 
morphism iff the sentence (f = f’) is provable in T; as usual, we shall write |f] for 
the provable-equality class of a closed term f. The composite of two morphisms 
[f]: @ — b and [g]: b  c is the provable-equality class of the closed term 


{2: Ax C | (3y: B)(((fst(z),¥) € F) A ((y,snd(z)) € 9))} - 


Just as in the first-order case, it is straightforward to verify that this composition 
is well-defined and associative, and that the identity morphism on an object a 
is given by [{z: A x A | ((fst(z) = snd(z)) A (fst(z) € a))}]; so we have 


Lemma 4.3.4 Er is a category. oO 


It is similarly straightforward to establish 
Lemma 4.3.5 Er has finite limits. 


Proof The terminal object may be taken to be I (or {x: 1 | (z = *)}, if you 
prefer); given an arbitrary object a: PA, it is easy to see that a term f: P(A x1) 
represents a morphism a — T iff it is provably equal to {z | fst(z) € a}. We 
define the product of two objects a: PA and b: PB to be the closed term 
{z: A x B | ((fst(z) € a) A (snd(z) € b))}; the first product projection a x b >a 
is represented by the term 


{w: (A x B) x A | ((fst(w) € a x b) A (fst(fst(w)) = snd(w)))} , 


with a similar expression for the second projection. Given morphisms [f]: c — a 
and [g]:c— b, where c: PC, the induced morphism ([f],[g]): ca x b is 
represented by the term 


{w: C x (A x B) | (((fst(w), fst(snd(w))) € f) A ((fst(w), snd(snd(w))) € g))} - 


Given two morphisms [f], [g]: a = b, the domain (e, say) of their equalizer is the 
object 


{x: A | (dy: B)(((x,y) E€ f) A ((z,y) € 9))} , 


and the monomorphism e > a is represented by {z: A x A | ((fst(z) = snd(z)) ^ 
(fst(z) € e))}. (As in 1.4.3, we note that the construction of equalizers depends on 
the choice of representatives f and g, not just on their provable-equality classes; 
but we could eliminate these choices by defining objects as well as morphisms to 
be provable-equality classes of terms.) As usual, the verification that all of this 
works is tedious but straightforward. 
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Lemma 4.3.6 Let [f]: @ — b be a morphism of Er. 


(i) [J] is an isomorphism iff the sequents 


((y € b) Fy (Az)((z,y) E f)) and 
((((z,y) E€ f) A ((2',y) € f)) Frs’ y (£ = 2’)) 


are provable in T. 


(ii) [f] is a monomorphism iff the second sequent above is provable in T. 


Proof (i) If the sequents above are provable, then the term f° = {z: B x A | 
((snd(z), fst(z)) € f)} is easily seen to represent a morphism b — a, which 
is inverse to f. Conversely, if [f][g] and [g][f] are both identity morphisms in 
Er then g is provably equal to f°, and so f must satisfy the sequents in the 
statement. 

(ii) Following the constructions of 4.3.5, we see that (the domain of) the 
kernel-pair of [f] is {z: A x A | (Ay: B)(((fst(z),y) € f) A ((snd(z),y) € fD}, 
and the second sequent is equivalent to saying that the terms representing the 
two projections from this object to a are provably equal. o 


Unsurprisingly, it is also possible to show that a morphism [f] is epic in Er 
iff the first of the two sequents of 4.3.6(i) is provable; but we shall not need this. 


Corollary 4.3.7 Leta: PA be an object of Ey. Then any subobject of a is 
isomorphic to one of the form |i]: a’ — a, where a’: PA is a term such that 
((z € a’) Fz (£ € a)) is provable in T and i is the term 


{z: Ax A | ((fst(z) = snd(z)) A (fst(z) € a’))} . 


Moreover, for two such subobjects [i]: a’ — a and {i'|: a” — a, we have a’ < a” 
in Sub(A) iff the sequent ((x € a’) Fz (x E€ a”)) is provable in T. 


Proof First, we note from 4.3.6(ii) that any morphism of the specified form is 
monic. Given an arbitrary monomorphism [f]: b — a, let a’ be the term {z: A | 
(Sy: B)((y, x) € f)}. Then it is straightforward to verify that f also represents 
a morphism b — a’ which is a factorization of |f] through [i], and that this 
factorization is an isomorphism. The final assertion is similarly straightforward, 
since any factorization of [i] through [i’] must be represented by the termi. O 


Proposition 4.3.8 Er is a topos. 


Proof After 4.3.5, it suffices (by A2.3.4) to show that Er has power objects. 
Given an object a : PA, we define its power object Pa to be the term 
{w: PA | (Vr: A)((z € w)= (z € a))}, of type PPA. The membership relation 
Ea is defined to be {z: PA x A | ((fst(z) € Pa) A (snd(z) € fst(z)))}, made into a 
Subobject of Pa x a as in 4.3.7. Given an arbitrary subobject c — b x a, which 
we may assume to be represented as in 4.3.7 (so that c is a closed term of type 
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P(B x A)), we define its name b — Pa to be the morphism represented by the 
term 


{z: Bx PA] ((fst(z) € b) A (snd(z) = {x: A | ((fst(z), 2) € c)}))}. 


It is straightforward to verify that this does represent a morphism, and that the 
latter is the unique morphism b — Pa for which there is a pullback square 


e ———— > Ea 


bx a —> Paxa 
in Er. o 


Remark 4.3.9 As we observed for the first-order case in 1.4.15, we could have 
been still more economical in our construction of Er, by exploiting the theory of 
allegories developed in Chapter A3 - that is, by constructing it as the category 
of maps of a suitable power allegory. For this approach, we begin by defining 
an allegory Ar, whose objects are simply the types of ©, and whose morphisms 
A% B are provable-equality classes of closed terms r of type P(A x B), ordered 
by [r] < [s] iff ((z € r) Fz (z € s)) is provable in T. The composite of two 
morphisms fr]: A + B and [s]: B + C is the provable-equality class of 


{z: A x C | (Jy: B)(((fst(z), y) € r) A ((y, snd(z)) € s))}, 
and the opposite of [r] is the class of 
{z: Bx A | ((snd(z), fst(z)) € r)} . 


It is straightforward to verify that Ar is an allegory, and in fact a division allegory 
in the sense of A3.4.1: given [r]: A + B and [s]: A + C, we define [s]/[r]: B + C 
to be the provable-equality class of 


{z: Bx C | (Wax: A)(((a, fst(z)) € r) => ((z, snd(z)) € s))}. 


Indeed, Ar is a power allegory in the sense of A3.4.5: the power object of an 
object A is none other than the power type PA, and the canonical morphism 
€4 PA A is the provable-equality class of {z | (snd(z) € fst(z))}. Ar is not 
in general tabular; but it is pre-tabular, since the hom-poset Ar (A, B) has a 
top element [{z: A x B | T}], which is tabulated by the product projections 
Ax B — A and A x B — B (we leave to the reader the task of defining these 
and verifying that they are maps). Its hom-posets also have bottom elements 
(the bottom element of Ar (A,B) being [{z: A x B | 1}]), so by A3.4.9 we 
obtain a topos by splitting the set K of cores in Ar and then cutting down to 
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the category of maps of Ar[Ñ]. And it is not hard to verify that, if we carry 
through the above constructions, we arrive at a category equivalent to Er as 
defined earlier in this section. 


Next, we define the canonical /-structure Mr in Er. For a base sort A, we 
define MrA to be A. If A is a compound type, it is not true that Mr.A is simply 
A, but an easy induction shows that it is always of the form {x: A | ¢} for some 
formula ¢ such that (T F» ¢) is provable (in the empty theory over ©), and 
hence it is canonically isomorphic to A. If f: A — B is a function symbol of ¥, 
we define Myf to be [{z: A x B | (f(fst(z)) = snd(z))}] (which is easily checked 
to be a morphism MA — MB in Er); similarly, if R — A is a relation symbol, 


MrR is {x: A | R(x)}, made into a subobject of M A as in 4.3.7. 


Lemma 4.3.10 
(i) If x.t is any term-in-contezt over ©, (where the context has just one vari- 
able x: A, and t: B, say), then [z.t] is the morphism MA > MB 
represented by the term {z: A x B | (snd(z) = t[fst(z)/z])}. 


(ii) Ifa.¢ is any formula-in-context over © (again, having a context with just 
one variable x: A), then [x.¢] m is the subobject of MA corresponding (as 
in 4.3.7) to the term {x: A | } of type PA. 


(iii) A sequent (blz Y) over È is satisfied in My iff it is provable in T. 


Proof (i) and (ii) are proved by a straightforward (simultaneous) induction. 
For (iii), we note that the axioms for product types enable us to replace any 
sequent (¢@ Fz Y) by a sequent (¢’ +, Y’) whose context has a single variable, 
such that the former is provable in T iff the latter is. The result is then immediate 
from (ii) and the last assertion of 4.3.7. E 


From (iii), we immediately deduce that M is a conservative model of T. Hence 
we have 


Corollary 4.3.11 (Completeness Theorem for r-calculus) Let T be a 
T-theory, and o a sequent over the signature of T. Ifo is satisfied in all models 
of T in toposes, then it is provable in T. C] 


Remark 4.3.12 Given a (finitary) first-order theory T, we may of course 
regard it as a 7-theory whose axioms all happen to lie in the first-order part of 
the language over its signature. Thus 4.3.11 almost contains an alternative proof 
of the result, already noted in 3.1.18, that validity in models in toposes is suffi- 
cient to determine provability in T. (‘Almost’ because the notion of provability 
has changed, in that we have added the axioms for power types: of course, we 
should not expect these to allow us to derive any new first-order conclusions 
from first-order axioms, but it is not a priori obvious that they do not.) 


Given an arbitrary topos F and a r-theory T, we write T-Mod(F) for the 
groupoid of T-models in F and isomorphisms between them. (The reason for 
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restricting to isomorphisms was explained after 4.1.4.) And we write Log for the 
2-category of toposes, logical functors and natural isomorphisms between them. 
Any logical functor F — G induces a functor T-Mod(F) — T-Mod(G); this is 
not quite trivial, because in our definition of a U-structure M we required that 
we should have equalities (not just isomorphisms) M(A x B) = MA x MB 
and M(PA) = P(MA), and a logical functor need not strictly preserve these. 
However, since our collection of types was built up recursively from the base 
sorts, we may define FM by setting F M(A) equal to F(M A) for each base sort 
A, and then for each compound type we have a canonical isomorphism from 
F(M A) to FM(A), across which we may transport the rest of the structure. 
Now we have 


Proposition 4.3.13 For any topos F and any T-theory T, the assignment F —> 
F(Mr) is part of an equivalence of categories 


Log (Er, F) ~ T-Mod(F) . 


Proof We construct a functor in the opposite direction, as follows. Given a 
T-model N in F, we define a functor Fy : Ey —> F by sending an object a of Er to 
the subobject of NA named by [a], : 1 — N(PA) = P(NA), and a morphism 
[f]: a — b to the morphism F(a) > Fy (b) whose graph is the object named by 
[ fl. (The soundness theorem 4.3.3 ensures that the latter is indeed the graph of 
a morphism Fy (a) — Fp (b), and that (up to isomorphism) it is independent of 
the choice of the representative f.) The verification that Fy is a logical functor is 
immediate from the definition of the topos structure on Er in 4.3.5 and 4.3.8 (and 
the soundness theorem, again), and the verification that the assignment N +> Fn 
is functorial is equally straightforward. Since any logical functor preserves the 
interpretations of arbitrary terms and formulae over ©, it is clear that if F is any 
such functor Er — F then we have isomorphisms F(a) = Fpqm)(a) for all objects 
a of Er, which form a natural isomorphism F = Fy where N = F(Mr). Similarly, 
if N is any T-model in F, we have a canonical isomorphism N & Fy(Mr). O 


Examples 4.3.14 (a) Proposition 4.3.13 can be thought of as saying that Er 
is the topos ‘freely generated’ by the T-model Mr. Even in the case when T is 
the empty theory over the ‘empty signature’, this is a nontrivial assertion: it says 
that the 2-category Log has an initial object (in the 2-categorical sense). (The 
‘empty signature’ is not literally empty: although it has no base sorts (and no 
primitive function symbols, or primitive relations apart from membership and 
equality), it has many types such as 1, P1, PP1, P1 x P1 and so on; thus its 
collection of terms and formulae is already quite a rich one.) The initial object 
of Log is often called simply the free topos; we shall devote much of Chapter F3 
to further study of it (and its close relatives). 

(b) Slightly more generally, if G is any small directed graph, we may construct 
the free topos generated by G, in the sense that logical functors from it to 4 
topos F correspond to diagrams of shape G in F. To do this, we simply take the 
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empty r-theory over the signature whose base sorts are the vertices of G, with 
the arrows of G taken as function symbols. 

(c) More generally still, given any finitary sketch S (in the sense of 2.1.2(d)), 
we may construct a topos Es such that logical functors Es — F correspond to 
S-models in F. (This is not quite as dramatic as it seems, because we lose most of 
the categorical structure of -Mod (F): we can ‘see’ only the isomorphisms in this 
category as natural isomorphisms between logical functors. However, following 
on from 2.4.12(c), we note that there exists a topos € for which there are logical 
functors € — Set with arbitrarily large finite groups of natural automorphisms, 
but none with infinite automorphism groups; thus the theory of logical functors 
defined on E cannot be Morita-equivalent to a finitary first-order theory.) It is 
easy to see how we may write down the conditions that certain finite diagrams 
commute, or that certain finite cones are limits, as sequents over the signa- 
ture described in (b) (enriched with suitable additional function symbols ¢ as in 
4.2.9; cf. 4.3.15 below). To say that certain finite cocones are colimits, we exploit 
the monadicity of the contravariant power-object functor P: FP — F (A2.2.7), 
which implies that a (finite) diagram in F is a colimit iff its image under P is a 
limit. 

(d) Suppose given a r-theory T over a signature X, and let A be a partic- 
ular type of ©. As with A-theories (cf. 4.2.11), we often wish to consider the 
theory Tja] obtained by adding a new function symbol a: 1 — A to X, and no 
new axioms. Thanks to the ‘fundamental theorem of topos theory’ (A2.3.2), we 
may identify the syntactic category of Tja] (up to equivalence) without doing 
any hard work: Erfa) is equivalent to the slice category Ey /A. For, given a logical 
functor F: E — F and a morphism b: 1 — F(B) in F, we obtain a logical functor 


F/B b* 
E/B ——> F/F(B) ——— F ; 


and conversely, given G: E/B — F, we have the composite 


E ae E/B EAN F 
and the morphism G(A): 1 — G(B*B), where A is the diagonal map of B 
regarded as a morphism 1g — B*B in €/B. It is easy to see that these two 
constructions are inverse to each other up to natural isomorphism; so, applying 
the result when € = Er and B = A = MrA, we deduce that logical functors 
defined on Er/A correspond to models of T[a]. 

(e) A very similar argument shows that, if T’ is obtained from T by adding a 
single sentence ¢ as an additional axiom (and leaving the signature unchanged), 
then Ey may be identified with Er/[¢] 1. In particular, we note that the free 
topos of (a) contains a subterminal object (the interpretation of the sentence 
(Vp: 2)(~ V 7p)), such that the corresponding slice category is initial (in the 
2-categorical sense) in the 2-category BooLog of Boolean toposes and logical 
functors. In fact the initial object of this 2-category is not hard to identify 
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explicitly; it is equivalent to Sets, since if B is a Boolean topos then the unique 
coherent functor Set; — B which we constructed in A1.4.7 is actually logical. 
(However, things become more interesting once we add a natural number object; 
cf. 4.3.19(d) below.) 


Proposition 4.3.13 is one half of the assertion that 7-theories are ‘the same 
thing as’ small toposes, in the same sense that A-theories are the same thing as 
small cartesian closed categories. For the other half, we need 


Proposition 4.3.15 Given a small topos E, there exists a r-theory Te such 
that for any topos F we have 


Log (E, F) ~ Te-Mod(F) , 
naturally in F. 


Proof The construction is very similar to that of 4.2.9, and we shall not 
give it in great detail. For our signature, we take (i) a basic sort "A? for 
each object A of E; (ii) a function symbol "f7: A7 — "B7 for each mor- 
phism f: A — B in €; (iii) a function symbol ¿: 1 +17 and function symbols 
t:'A'x'!B'—° Ax B` for each pair of objects (A, B); and (iv) a function 
symbol ¢: P A” —'" PA” for each object A. The axioms are (the sequents cor- 
responding to) the equations taken as axioms in 4.2.9(a) and (b), plus the 
following: 


(c) for each monomorphism m: A’ >> A in E with classifying map ¢: A >Q, 
the two axioms (("“m"(y) ="m(y’)) Fy y (y = y’)) and 


(To (a) =" T (e(*))) Fe (Gy) m? (y) = 2); 
(d) for each object A, the axioms 
(x € w) AF rw (Tev (( (E(w), 2))) = "T (e(*)))) 
where x and w are variables of types "A7 and P" A” respectively, and 


(T Fu (u = e({a: A | (Tev (e((u, £))) = "T (e(*)))}))) 


where u is of type "PA". (Here ev denotes the evaluation map PA x A >Q 
in E.) 


Now we saw in 4.2.9 that a model M (in a topos Ff, say) for the axioms in groups 
(a) and (b) is essentially the same thing as a functor F: E > F preserving finite 
products; such a model satisfies the axioms in (c) iff the corresponding functor 
F preserves arbitrary pullbacks of T: 1 — Q (which, by a remark that we made 
following A1.6.1, is equivalent to saying that it preserves all finite limits); and 
finally M satisfies the axioms in (d) iff the interpretation in M of i yields a 
two-sided inverse for the canonical comparison map ¢4: F(PA) — P(FA) for 
each A, i.e. iff F is a logical functor. 
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As we did with A-theories in 4.2.13, we now wish to extend our notion of 
T-theory by adding list types (or at least the particular list type N = L1) to 
our signature. (Since, as we saw in A2.5.17, any topos with a natural number 
object has list objects — and since any logical functor between toposes with 
natural number objects preserves the natural number object by A2.2.10(i) and 
A2.5.6(ii), and hence also preserves list objects — it does not make any essential 
difference which we do.) 


Definition 4.3.16 (a) By a 7L-signature, we mean a higher-order signature © 
having the type constructors of 4.1.1(i), (ii), (iv) and (v), and the term construc- 
tors of 4.1.2(i)—-(iv) and (vii)—(ix), plus the three types of atomic formulae of 4.1.3 
and the full range of (finitary) logical connectives and quantifiers. A 7L-theory 
is a set of sequents (or sentences, if you prefer) over such a signature. 

(b) The rules of the 7L-calculus are those of the 7-calculus as given in 4.3.1, 
plus the following rules for list types: the axioms 


(T eer (iterz y(s, w, [D = w)) and 
(T Keuu,w (iters,y(s,w, cons(u,v)) = sfiterz y (s, w, v), u/x,y})) , 


where Z is any context containing the free variables of s other than z and y, and 
‘Peano’s fifth postulate’ 


((([] € 2) A (Vx)(Wy)((y € z) = (cons(x, y) € z))) Fe (Wy)(y € 2)) 
where 2, y, z are of types A, LA, P(LA) respectively. 


If we are merely given the particular list type N in our signature, we shall call 
it a 7N-signature, and speak of 7N-theories and the 7N-calculus. In practice, 
we usually work with the latter; but for many purposes it is no harder to work 
with arbitrary list types. 


Remarks 4.3.17 (a) The Soundness Theorem for the 7L-calculus is as usual 
easy to prove: ifo is provable in a TL-theory T, then it is satisfied in all models 
of T in toposes with natural number objects. For we verified the soundness of 
the first two axioms of 4.3.16(b) in 4.1.7(iv) above; the validity of Peano’s fifth 
postulate, for the particular case of L1 = N, was verified in A2.5.9, and the 
same argument is easily seen to work for general list objects. 

(b) The reader may have been surprised not to see the third sequent of 
4.1.7(iv) (that is, (T H, (iter,,,(cons(y,z),[{],z) = 2))) appearing as one 
of the axioms of the rL-calculus. However, now that we have power types 
available, and hence can formulate Peano’s fifth postulate, we may derive 
this sequent from the other axioms. Specifically, if t denotes the closed term 
{z: LA | (iter; „(cons(y, £), [],z) = z)}, then the first two axioms of 4.3.16(b) 
easily imply the sequents (T  ([] € ¢)) and ((y € t) Fay (cons(z, y) € t)), 
whence by the Peano postulate we deduce (Vy: LA)(y € t). 
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(c) In a similar vein, we may show that (the analogues for list objects of) the 
third and fourth Peano postulates are provable in the rL-calculus; that is, we 
have ((cons(x, y) = []) Fz, +) and 

((cons(x, y) = cons(z’,y’)) Fr,2/yy ((@ = 2") A (y = 9'))) - 

For the first of these, we note that if we substitute cons(z, y) and Í] for the 
variable z in the term iter, Map ii ,2), we obtain T and L respectively (where, 
as in 4.3.2, $ denotes the term {z: 1 | 4} of type 9). So from (cons(z, y) = []) 

we may deduce (T = E); from which we obtain (* € 1) and hence L. Similarly, 
if we substitute cons(x, y) for z in the term iter, „ (v, £”, z) we recover z; so from 
(cons(z, y) = cons(x’, y’)) we may deduce (x = 2’) (in a context which contains a 
third variable x” of type A; but we may then eliminate the latter by the method 


which we noted at the end of 1.3.1(a)). And, if we substitute cons(z,y) for z in 
the term snd(t), where t is the term 


iteru »((cons(v, fst(u)), fst(u)), ([], []), 2) 


of type LA x LA, we recover y. For the derivability of the sequent in (b) above 
enables us to deduce (fst(t) = z), and hence (t[cons(z, y)/z] = (cons(z, y), y)). 


Given a TL-theory T, we construct its syntactic category Er exactly as we 
did for a 7-theory earlier in this section; the proof that it is a topos of course 
remains valid. So we need only prove 


Proposition 4.3.18 Let T be a rL-theory over a signature &. Then, for each 
type A of 1, the object LA of Er is a list object over A, in the sense of A2.5.15. 
In particular, Er has list objects. 


Proof The structure maps 04:1— LA and sa: Ax LA — LA are respec- 
tively represented by the closed terms {z: 1 x LA | (snd(z) = [])} and 


{z: (A x LA) x LA | (snd(z) = cons(fst(fst(z)), snd(fst(z))))} . 


Given an arbitrary object b: PB and morphisms [c]: 1 — b, [t]: A x b — b, we 
first consider the closed term 
g = {w: P(LA x B) | ((Vu)((u € w) > (snd(u) € b))A 
(Sy: B)(((+,y) € e) A (([], y) € w))A 
(Vz: LA)(Vy: B)(Vx: A)(((z,y) € w)> 
(3y': B)(({(z,y),y') € t) A ((cons(z, z), y’) € w))))} 
(informally, g is the set of all relations [w]: LA % b which are ‘compatible with 


the recursion data [c], [¢]’). Then we form the term of type P(L.A x B) which is 
the intersection of all the members of g, that is 


f ={u: LA x B | (vw: P(LA x B))((w € g)= (u € w))} . 


Since the relation {u: LA x B | (snd(u) € b)} clearly belongs to g, it is clear that 
f is (or rather represents) a relation LA % b in Er, and that it is compatible with 
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the recursion data. We claim that it is functional, in the sense defined earlier: 
for this we just have to show that 


{z: LA | (Aly: B)((z,y) € f)} 


satisfies the hypotheses of the fifth Peano postulate (where the unique existential 
quantifier J!y is as usual defined by 


(Aly) = (Ay) (GA (vy')(ely’/y] > (y = y'))) -) 


But this is a straightforward induction, using the definition of compatibility with 
the recursion data and the third and fourth Peano postulates as established in 
4.3.17(c). A similar induction tells us that [f] is the unique morphism LA — b 
satisfying the given recursion data; alternatively, we could argue that if [f’] is 
another such, then (f’ € g) must hold, whence we deduce [f] < [f’] in the 
allegory Rel(€r), and conclude [f] = [f’] by A3.2.3. 

Thus we have verified that every object in Er of the form A has a list object. A 
similar, but more complicated, argument could be given to construct list objects 
for arbitrary objects of the syntactic category; but we do not need to do so, since 
we may conclude in particular that N = [1 is a natural number object in Er, 
and so we may apply A2.5.17 to obtain arbitrary list objects. o 


Remarks 4.3.19 (a) Proposition 4.3.18 implicitly contains a proof of the fact, 
which we mentioned in Section A2.5, that the five Peano postulates suffice to 
characterize the natural number object in a topos; for we did not make use of 
the first two sequents of 4.3.16(b) in the proof, but only of the third and fourth 
Peano postulates which we deduced from them. However, we shall provide a 
more explicit proof of this result in 5.1.2 below. 

(b) The proof of 4.3.10 now extends in the obvious way to show that, for any 
TL-theory T, the canonical =-structure My in Er is a conservative T-model. So 
we obtain the Completeness Theorem for the r L-calculus (i.e. the converse of 
the Soundness Theorem stated in 4.3.17(a)). 

(c) Since, as we observed earlier, any logical functor between toposes preserves 
list objects, it is clear that a logical functor F: E — F (between toposes with 
natural number objects) induces a functor T-Mod(&) — T-Mod(F), for any 
TL-theory T. So we have an immediate extension of 4.3.13, proved in exactly the 
same way: for any TL-theory T and any topos F with a natural number object, 
the functor Log(Er, F) + T-Mod(F) is one half of an equivalence of categories. 

(d) In particular we may, as in 4.3.14(a), consider the empty 7L-theory 
over the empty signature, and thus obtain the free topos with natural number 
object, i.e. the initial object in the full sub-2-category Log, C Log of toposes 
with natural number objects. (Note that, by A2.5.6(ii), this subcategory is a 
cosieve in Log.) And by slicing this topos over a suitable subterminal object 
as in 4.3.14(e), we may obtain the initial object of the 2-category BooLogy 
of Boolean toposes with natural number objects. However, the latter is appre- 
ciably more complicated than the initial object of BooLog: in particular, its 
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unique (up to isomorphism) logical functor to Set is not conservative. For in 
the 7L-calculus (indeed, in the r N-calculus) plus the Law of Excluded Middle, 
we may carry through the proof of Gödel’s Incompleteness Theorem; and if ¢ 
is the unprovable sentence constructed in that proof, then [¢] — 1 is not an 
isomorphism in the free Boolean topos with natural number object (since ¢ is 
unprovable), but it is mapped to an isomorphism in Set. (And if we slice the 
free Boolean topos with natural number object over the complement of [¢], we 
obtain a non-degenerate Boolean topos which admits no logical functor to Set.) 
On the other hand, we shall see in Section F3.3 that the initial object of Log is, 
in a sense, no less complicated than the initial object of Logy. 

(e) Just as in 4.3.15, for any small topos € with natural number object we 
may construct a 7L-theory Te such that T¢e-models in F correspond to logical 
functors E — F. Indeed, we do not need to add any further axioms to the list 
given in the proof of 4.3.15 - though we could if we wished add function sym- 
bols ¢: "LA — L" A` for each type A, together with axioms which say that the 
interpretation of ¢ in a model M is a two-sided inverse for the canonical com- 
parison map F(LA) — L(F A) which exists for any product-preserving functor 
between categories with list objects. We leave the formulation of these axioms 
to the reader. 


Suggestions for further reading: Bell [95], Boileau & Joyal [138], 
Fourman [358], Lambek & Scott [682], Lavendhomme & Lucas [688]. 


D4.4 Predicative type theories 


In this section (which may easily be skipped by readers who are anxious to 
press on with the study of applications of higher-order logic in toposes), we shall 
attempt to describe the relationship between the ideas of the three preceding 
sections and the predicative type theories (often associated with the name of 
P. Martin-Lof, cf. [800]) which are studied by many logicians as a framework for 
constructive mathematics. 

We have seen from an early stage in this book that topos theory is inherently 
non-classical, in that relatively few toposes satisfy the Law of Excluded Middle 
(cf. 1.3.3); and we have (particularly in Parts B and C) freely used the word 
‘constructive’ to describe the style of argument which it is appropriate to use 
when working over a base topos S which may not be Boolean. However, for many 
of those who adopt a constructivist view of mathematics, the notion of topos 
(or the equivalent notion of r-theory which we described in the last section) 
is unacceptable, because of the presence of power objects (or power types, as 
the case may be), and the freedom which they give one to use impredicative 
definitions. 

To explain the meaning of the word ‘impredicative’ in this context, it is help- 
ful to consider a specific example: one which will be fresh in the mind of anyone 
who has just read Section D4.3 occurs in the proof of Proposition 4.3.18. Here we 
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wished to construct a particular morphism [f]: LA — b in our syntactic category 
Er; and we did so by considering the intersection of all relations LA % b with 
a particular ‘compatibility property’. But [f] itself, once we have constructed 
it, turns out to be such a relation; in other words, to ‘construct’ [f], we have 
quantified over a set which we could not have constructed (in the strict sense of 
specifying all its members) without constructing [f] itself. (A similar example 
occurs in our proof of the Knaster—Tarski theorem 4.1.10, where we obtained a 
fixed point of f: A — A as the join of a set of ‘pre-fixed points’ which included 
the fixed point we were trying to construct.) Although this type of argument 
is commonplace in the everyday practice of classical mathematicians (it occurs 
almost everywhere that closure operations are to be found), it is anathema to a 
strict constructivist. Such people therefore reject the existence of the ‘set’ which 
we denoted by the term g in the proof of 4.3.18; and since they do not wish 
to jettison the Separation Principle that sets may be ‘separated out’ from pre- 
existing sets by forming the subset of elements satisfying a given formula, they 
are bound to reject the existence of the power type P(LA x B) from which g was 
separated out by this principle. (For another reason why impredicative power 
types may be thought ‘too powerful’ by constructivists, see 4.5.8 below.) 

The use of function types (or of list types) does not lead to the same problems 
of impredicativity; so, if we are searching for a predicative type theory to serve 
as a foundation for mathematics, a natural starting-point might be the (typed) 
AN-calculus of Section D4.2. However, this is clearly too weak, since it does 
not contain any of the first-order logical connectives and quantifiers. There was 
a good reason for this omission: namely that, as we saw in Section A1.5, a 
cartesian closed category (even if properly cartesian closed) need not have many 
of the other ‘set-like’ properties studied in Chapter Al — in particular, it need 
not be regular, and so need not support a sound interpretation even of the 
regular fragment of first-order logic. But we also saw the remedy for this failing, 
in A1.5.13: a category which is locally cartesian closed is a Heyting category 
(and thus supports a sound interpretation of full intuitionistic first-order logic), 
provided it is also cocartesian. So we might reasonably seek a type calculus 
which corresponds to locally cartesian closed categories, in the same way that 
A-calculus and r-calculus correspond to cartesian closed categories and toposes 
respectively. In honour of P. Martin-Lof, who was the first to propose such a type 
calculus in [799], this calculus is often referred to as Martin-Lof type theory; for 
brevity, we shall call it the pA-calculus. 

In this book, we do not have space even to begin to describe the large body of 
constructive mathematics which has been developed within the zA-calculus. But, 
since we have had a good deal to say about locally cartesian closed categories 
in the book, it seems appropriate to give at least a sketch of how this notion 
may be translated into type-theoretic terms; and that is our goal in the present 
section. There are many different ways of presenting the yA-calculus; the one we 
have chosen is designed to emphasize its connection with locally cartesian closed 
categories. However, one feature which they all share, and which differentiates 
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them from the other calculi we have studied in this chapter, is the fact that the 
collection of types is no longer ‘fixed once for all’: in addition to ‘constant types’, 
we also have ‘variable types’ indexed by variables which run over other types, 
and we shall be able to form ‘dependent sums and products’ of such types. 


Example 4.4.1 To explain these notions, the following simple example (bor- 
rowed from [490]) may be helpful. Suppose we wish to define a type whose 
individuals are dates (of the Christian era, as usually recorded): our first idea 
might be that this is simply a product type D x M x Y (day, month, year), 
where the individuals of type D are the integers from 1 to 31, those of type M 
are the integers from 1 to 12, and those of type Y are arbitrary positive integers. 
But this product type would contain individuals such as 31.9.2001 and 29.2.2002 
(in the usual notation), which do not represent actual dates; in order to exclude 
them, we need to think of D as a variable type D(m, y), depending on variables 
m and y of types M and Y, and the type of dates is then the dependent sum 
Ly:y Um:mD(m,y). Similarly, the day on which a monthly payment falls due 
might naively be thought of as an individual of the function type [M x Y —> D], 
but it really belongs to the dependent product Iy:y Im:m D(m, y). (Indeed, the 
type of dates might itself be dependent on variables of other types, such as the 
country under consideration: for example, 10.9.1752 is a valid date in France, 
but not in England.) 


Our language thus has a new feature, namely type declarations which are 
‘sequents’ of the form 
(C+ (A € &-Typ)) 


where T is a context and A is some expression involving free variables z1,..., En 
which appear in I’. The type constructors which we used in earlier sections of this 
chapter become ‘rules of inference’ for manipulating such type declarations; for 
example, the presence of binary product types, as in 4.1.1(ii), would correspond 


to the rule 
(T H (A € d-Typ)) (ft (Be =-Typ)) 


(TF (Ax B € X-Typ)). 


Moreover, the notion of context becomes more complicated than it has been 
hitherto (which explains our change of notation from Z to T): a context is now a 
finite sequence (x1: A1), (£2: A2), - -. , (En: An) of distinct typed variables, with 
the proviso that for each i < n the type declaration 


(2) z Aı), ees (£i—1: Aj_1) H (A; E Ł-Typ)) 


is derivable in our system. (So in particular (A, € X-Typ) must be derivable in 
the empty context [].) 

If T is a context such that (T + (A € 2-Typ)) is derivable, we shall often 
express this by saying that A is a type in the context T, or that T is a suitable 
contezt for A. In addition to type declarations, our language will contain term 
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declarations of the form (T F (t: A)), where A is a type in the context I, and 
we Shall have rules of inference for these too. (The primitive term declarations 
take the place of function symbols in our language: a function symbol f: A — B 
becomes a term declaration ((z: A) + (f(x): B)). For simplicity, we shall not 
allow any primitive relation symbols other than equality, since we know from 
1.4.9 that we can eliminate them in favour of function symbols.) 


Definition 4.4.2 A yu -signature is specified by sets of basic type declarations 
(T F (A e &-Typ)), and primitive term declarations (T + (t: A)) from which we 
may derive further declarations by the following rules: 


(a) the Weakening Rule 
(CF 8) 
(AF @) 
whenever T is a sub-context of A (that is, A contains all the variables in F - 
possibly not in the same order, but at least in an order that makes it a legit- 
imate context), and 0 denotes either an expression of the form (A € =-Typ) 
or one of the form (t: A); 


(b) the Variable Declaration Rule 
(T E- (A € &-Typ)) 


(Œ, (tn41: A) F (tn41: A)) 


where 2,41 is a variable not appearing in I; 
(c) the Substitution Rule 


(T, (y: B)F 6) (TF (t: B)) 
(E F @lt/y])) 


where, once again, 0 may be either a type declaration or a term declaration — 
but in the latter case the substitution of t for y must be made in both the 
term and the type; 


(d) the Dependent Sum Rules 


(T, (x: A) (B € Ł-Typ)) 


(CF (@Œz:4B € X-Typ)) 


(here all occurrences of z in the expression £,- 4 B — which we often abbreviate 
to Ł,B, if the type of z is clear from the context — are taken to be bound), 
and 


(TE (s: A)) (T, (z: A)F (B € d-Typ)) (TF (t: Bls/z])) 
PE ((s,t): Eza B 


plus the rules that from (F F (t: Ea 4B)) we may infer (T F- (fst(t): A)) and 
(T F- (snd(t): B[fst(t)/z])); 
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(e) the Dependent Product Rules 


(T, (a: A) (B € &-Typ)), (T, (x: A) F (t: B)) 
rF (z:4B € &-Typ)) (CF (Ag: A.t: Ilz:aB)) 


and 
(Tt (s:l-4B)) (CF (t: A)) 
(T H (app(s, t): Blt/x 


where, once again, the occurrences of x in the expression I} 4B (and in 
Az: A.t) are taken to be bound; and 


(f) the Unit Type Rules which are simply the axioms ([] + (1 € £-Typ)) and 
({] F (*: 1)). 

The above rules define the basic notion of a signature for the A-calculus. How- 

ever, for practical purposes, we shall wish to add the formation of equalizers to 

the type structure in the form of Equality Types, for which we have the following 

rules: 


(g) Given two term declarations (I, (x: A) (s: B)) and (I, (x: A) F (t: B)), we 
may infer the type declaration (T + (Eq,(s,t) € £-Typ)) (where all occur- 
rences of the variable x in Eq,(s,t) are bound) and the term declaration 


(T, (y: Eq,(s,t)) F (e(y): A)). 


If we are given a pA-signature with these extra rules, we shall call it a AE- 
signature. 


The rules for dependent sums and dependent products are strongly reminis- 
cent of the term constructors for product types and function types, which we 
introduced in 4.1.2. This is of course no accident, since product and function 
types are a special case of dependent sums and products: if the type B does not 
in fact depend on the variable x: A, then ©, 8B will play the rôle of the product 
type Ax B, and II, B will play the rôle of [A — B]. Thus we do not need to include 
ordinary (binary) product and function types explicitly in Definition 4.4.2; but 
we do still need to include the unit type 1. 

Note, incidentally, that 4.4.2(c) allows us to substitute a term only for the last 
variable in our context (though the limited freedom to permute contexts, allowed 
to us by (a), to some extent compensates for this). The reason is the obvious one 
that if we substituted for an earlier variable, on which some of the later types in 
the context were dependent, the removal of that variable from the context would 
leave something which could not be made into a valid context. For the same 
reason, dependent sums and products may be indexed only by the last variable 
in a context. 

Before going on to consider the formulae of the pA-calculus, we next explain 
how the types and terms are to be interpreted in a locally cartesian closed cat- 
egory C. The basic idea is that a context F = (zı : Ai),...,(%n: An) will be 
interpreted as a composable string MA, > MA,_; > --- ~ MA) —> MA, of 
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morphisms of C. (We shall write MT for the domain M An of the first morphism 
in the sequence; if F is the empty context, we interpret MT as the terminal 
object 1 of C.) If B is a type whose existence is derivable in I’, then it will be 
interpreted as a further morphism M B — MY; and any term declaration (t: B) 
in the context I will be interpreted as a section of MB — MT. (Properly, we 
ought to use some such notation as M(I.B) for the interpretation of a type B 
in a context I, since the interpretation will definitely depend on T as well as B; 
but this notation is really too cumbersome for practical use.) Formally, we do 
this as follows: 


Definition 4.4.3 Let © be a pA-signature, and C a locally cartesian closed 
category. A structure M for X in C assigns to each context I a composable 
string of morphisms as above, to each type declaration (T + (B € Ł-Typ)) a 
morphism MB — MT, and to each term declaration (T + (t: B)) a morphism 
MY — MB such that the composite MT — MB — MT is the identity, as 
follows: the interpretations of the basic type declarations and primitive term 
declarations are assigned arbitrarily. The interpretations of contexts are defined 
inductively: if A =T, (an41: An+1) and we have already interpreted the context 
I and the type declaration (F  (A,+1 € U-Typ)), then the interpretation of A is 
the string M An+ı — MA, —> --- —> MA, obtained by concatenating these two 
interpretations. Derived type and term declarations are interpreted according to 
the following rules: 

(a) For the weakening rule, we deal with two sub-cases: first, when F is an 
initial segment of A, and secondly when A is obtained from I by permuting its 
terms. In the first case, say F = (4): Aj),...,(@,: An) and A = (zı: Aj), 
...,(m: Am) where m > n, then the composite of the morphisms which lie 
in the interpretation of A but not in that of [ is a morphism MA — MT 
in C, and we obtain the interpretation of a type or term declaration 0 in the 
context A simply by pulling back its interpretation in the context T along this 
morphism. To deal with the case when A is obtained by permuting I, it suffices 
to consider that when A is obtained by transposing two adjacent members of 
T, say (a; : Ai) and (£i+ı : Aiz1), since such transpositions generate the full 
permutation group. Now, since F and A are both valid contexts, the types A; 
and A,,, must both exist in the context © = (z1: Aj),...,(%i-1: Ai—1); so their 
type declarations in this context are interpreted by morphisms MA; — MA,_\ 
and MAj;,; — MA,_), and the interpretation of the type declaration of A;+ı 
in the context ©,(2z;: A;) is the pullback of the latter along the former, i.e. the 
projection MA; X mA, _, MAj41 > MA,. Similarly, the interpretation of the type 
declaration of A; in the context O, (zi+1: Ai+1) is the projection M/A,41 <ara,_, 
MA; — MAj4,. But the domains of these two projections are (canonically) 
isomorphic, and this isomorphism now ‘propagates up’ the interpretations of the 
rest of IT and A to yield a canonical isomorphism MA = MT. So we obtain 
the interpretation of 0 in the new context A from its interpretation in I by 
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composing with this isomorphism (or its inverse, depending on whether @ is a 
type or a term declaration). 

(b) For the variable declaration rule, if we have interpreted the type decla- 
ration of An+1ı in the context F as a morphism MA,4; > MT = M An, then 
its interpretation in the extended context I,(£n+1 : An+i) is the projection 
M Ån+1 Xm a, MAnii > MAn4i, and we interpret the variable declaration 
(2n41: An+1i) in this context as the diagonal map. 

(c) The substitution rule is simply interpreted by pullback: if the term decla- 
ration (t: B) has been interpreted in the context F as a morphism MT — MB, 
then we obtain the interpretation of 6[t/y] in this context from that of 0 in the 
extended context I, (y: B) by pulling back along this morphism. (Note that, if 
the type and/or term appearing in 0 does not actually involve the variable y - 
that is, if it already exists in the context F — we shall arrive back at its original 
interpretation in that context if we add a ‘dummy variable’ y and then substitute 
a term for it.) 

(d-e) The dependent sum and product types ©,.4B and I],.4B are of course 
interpreted (in a context T) by applying the left and right adjoints to pullback 
along MA — MT to the interpretation of B in the extended context I, (x: A). 
For the terms associated with these types, we proceed as follows: 

(i) The interpretation of ((s,t): &,B) in the context T is the top composite 
in the diagram 


MT {5 M(B[s/2]) —> MB 


ur=— “Ss MA 


where Ms and Mt are the interpretations of s and t in the context I, and the 
square is a pullback. (Note that the interpretation of ©,B in the context I is 
simply the composite MB — M A — MT, so this makes sense.) 

(ii) Given the interpretation of a term (t: EB) in L, the interpretation of 
(fst(t): A) is the composite of this morphism with the morphism MB — MA 
which interprets the type B in the context I, (x: A), and the interpretation of 
(snd(t) : Blfst(t)/z]) is its factorization through the pullback which is used to 
define the interpretation of B[fst(t)/z] in F. 

(iii) The interpretation of (Ar.t : IB) in the context I is obtained by 
regarding the interpretation of (t: B) as a morphism f*(1ur) = laa > MB in 
C/M A (where f is the name of the morphism M A — MT interpreting the type 
A), and transposing it across the adjunction (f* + II f) 

(iv) Given the interpretations Ms, Mt of terms (s : Il,B) and (t: A) in 
the context T, we interpret (app(s, t): B[t/z]) as the pullback along Mt of the 
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composite 


f*(Ms) 
MA ——> f* MII,B —> MB 


in C/M A, where f, as before, is the name of the morphism MA — MT and the 
second factor above is the counit of (f* A IT,). 

(f) The unit type 1 is of course interpreted in the empty context as the 
terminal object of C, and the term (*: 1) is interpreted as the identity morphism 
on this object. 

(g) If E has equality types as in 4.4.2(g), then MEq,(s,t) is interpreted in a 
context Į as the composite E — MA — MT, where E — MA is the equalizer 
of the morphisms MA = MB interpreting s and t, and the term e(y): A is 
interpreted in the context I, (y: Eq,(s,t)) as the morphism E —> E xyr MA 
which is the graph of E — MA regarded as a morphism in C/MT. 


Now we may turn to considering the formulae of the A-calculus. Of course, 
our atomic formulae are simply equalities-in-context [.(s =, t), where s and t 
are terms such that the term declarations (F - (s : A)) and (F F (t: A)) 
are derivable in X. We shall distinguish between the full A-calculus, in which 
we allow ourselves the full range of (finitary) first-order logical connectives and 
quantifiers from 1.1.3 — but with the restriction that we may quantify only over 
the last variable in a context (though we are of course free to permute the 
variables in a context, provided it remains a valid context) — and the cartesian 
A-calculus in which we allow only cartesian logic as defined in 1.3.4. If @ and w 
are formulae, we now write the sequent asserting that w is a logical consequence 
of ¢ in the context I as (T, + y) — though if ¢ is the constant T, as it very 
often will be, we omit it from this notation. 


Definition 4.4.4 Given a structure M for a pA-signature © in a locally carte- 
sian closed category C, we assign to each cartesian formula-in-context ['.¢ an 
interpretation [F.ġ]m — MT in a straightforward way; and if C is also a 
Heyting category, then we may extend this interpretation to arbitrary first-order 
formulae-in-context. Explicitly, an atomic formula I’.(s =, t) is interpreted as 
the equalizer of the morphisms MT 3 M A interpreting s and t in the context I, 
the first-order connnectives are interpreted exactly as in 1.2.6, and quantifiers 
in almost exactly the same way — the only difference being that the morphism 
of C along which we apply the left and right adjoints to pullback is no longer 
necessarily a product projection. We say that a sequent (T, H w) is satisfied 
in a structure M if [F.ġ]m < [[.v]m in Sub(MT). And if T is a wA-theory 
(that is, a set of sequents) over ©, we say M is a model of T if it satisfies all the 
sequents in T. 


Definition 4.4.5 The rules of inference of the full (resp. cartesian) A-calculus 
are all the finitary first-order rules of 1.3.1(a-g) (resp. the rules of regular logic) 
with substitution restricted to the last variable in a context, but with weakening 
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allowed in all cases where it makes sense, plus the following rules for dependent 
sums and products, and the unit type: 


(a) If (T E (t: Xz-4B)), then we may infer (T F (t = (fst(t), snd(t)))). Similarly, 
if (T + (s: A)) and (F F (t: B[s/z])), then (T + (fst((s,t)) = s)) and 
(T F (snd((s, t)) = t)). 

(b) If (T + (t: A)) and (T, (x: A) F (s: B)), then (T F (app(Az.s,t) =Bje/7] 
s(t/z])). And if (T F- (t: Is:4 B)), then (F F- (t = Ax: A.app(t, z))). 

(c) For the unit type, we have the single axiom ((x: 1) + (x = *)). 


If we are working over a wAE-signature, then we adopt the following additional 
rules for equality types: 


(d) We have the axiom 


(T, (y: Eq,(s,t)), (y’: Ea,(s,t)), (e(y) = e(y’)) F (y = 9’) 


and the rule that, from the term declarations (F F- (r: A)), (T, (x: A) F 
(s: B)) and (T, (x: A) (t: B)) plus the equation (F + (s[r/z] = t[r/a])), 
we may infer a new term declaration (I | (fact, (r): Eq,(s,t))) and the 
equation (I H- (r = e(facts (r)))). (We normally omit the subscripts from 
the term fact, ¢(r).) 


It will be noted that the rules of 4.4.5(a—c) are exactly those of 4.2.3(c-d), 
adapted to our new setting. In particular, the proof that the rules of 4.2.3 are 
valid in any cartesian closed category easily generalizes to show that those of 
4.4.5 are valid in any locally cartesian closed category; and since the first-order 
rules are valid in any Heyting category, we obtain the usual Soundness Theorem: 


Proposition 4.4.6 Let T be a cartesian (resp. full) pA-theory over a pà- 
signature X, and let o be a sequent over &. If o is derivable in T, then it is 
satisfied in all models of T in locally cartesian closed categories (resp. locally 
cartesian closed Heyting categories). 


It is clear, too, that 4.4.6 extends straightforwardly to theories over pAE- 
signatures: the first axiom of 4.4.5(d) is valid because equalizers are monic, and 
if we interpret the term fact, :(r) by the unique factorization of Mr through the 
equalizer of Ms and Mt, then the rule is easily seen to be sound. 

The reason for adding equality types to our signature becomes apparent when 
we consider the problem of building a syntactic category for a pA-theory. The 
problem is that, if we proceed along the lines of Section D1.4 (that is, if we 
take the objects of the category to be (the denotations of) formulae-in-context 
(x: A).@), we are faced with the requirement to construct exponentials of the 
form (y: B). yE ® e and (in the absence of power types) we cannot readily 
construct this from the particular case @ = y% = T, which we have already ‘built 
into’ the type structure. So, as in Section D4.2, we must take the objects of our 
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syntactic category to be types, and therefore we need to have equalizers, as well 
as finite products, built into the type structure as well. 

Explicitly, given a cartesian pA-theory T over a wAE-signature ©, the objects 
of our syntactic category Cr will be the types of & (in the empty context). If 
A and B are such types, then morphisms A — B will be represented by terms 
t for which the term declaration ((x: A) FĀ (t: B)) is derivable, two such terms 
t and t’ representing the same morphism if ((x : A) H (t = t’)) is derivable 
in T. Composition is defined by substitution, as in Section D4.2; the identity 
morphisms of Cr are provided by the variable declaration rule of 4.4.2(0). 


Proposition 4.4.7 The category Cr defined above is locally cartesian closed. 


Proof First we show that Cr is properly cartesian closed; except for the con- 
struction of equalizers, this is exactly like the proof of 4.2.5. The terminal object 
is the type 1, and the unique morphism A — 1 is represented by ((z: A) F (*: 1)), 
which we obtain from 4.4.2(f) by weakening. The product of two types A and B 
is ©, 4B (where we use the weakening rule to regard B as a type in the con- 
text (x: A), of course); the product projections are induced by fst and snd 
in the obvious way, and morphisms into the product are induced by the term 
constructor (~,—). The equalizer of two morphisms [((z: A) + (s: B))] and 
[((c: A) (t: B))] is of course given by the equality type Eq,(s,t), together 
with the morphism [((y: Eq,(s,t)) + (e(y): A))]. And the exponential B4 is the 
dependent product I,:4B, with structure induced by app and Az.-— as in the 
proof of 4.2.5. 

Next, we observe that, if A is a particular type in the empty context, we may 
similarly build a category Cr[A] whose objects are types in the context (x: A), 
and (since all the rules of the calculus are ‘insensitive’ to adding an additional 
term to the beginning of a context) the same arguments will show that Cr[A] 
is (properly) cartesian closed. We claim that Cr[A] is equivalent to the slice 
category Cr/A. 

To prove this, observe first that we have a functor ®: Cr[A] — Cr/A which 
sends a type B in the context (x: A) to Xs 4B, equipped with the morphism 
to A represented by the term declaration ((z: Es B) F- (fst(z): A)). A morphism 
B — C in Cr[A], represented by a term declaration ((x: A), (y: B) F (t: C)), is 
mapped to the morphism represented by 


((z: EB) ((fst(z), t[fst(z), snd(z)/z,y]): UzC)) . 


It is easy to check that this is indeed functorial. In the opposite direction, the 
functor Y: Cr/A — Cr[A] sends an object B of Cr equipped with a morphism 
[((y: B) + (t: A))]: B — A to the equality type Eq,(t,2) in the context (x: A); 
and if [r]: [t] — [s] isa morphism of Cr/A, we map it to the morphism represented 
by the term declaration 


((w: Eq,(t,x)) F (facts =(r[e(w)/y]): Eq.(s,2))) - 
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(Note that, as usual, the construction of the functor in this direction depends on 
our ability to choose representatives for the morphisms in Cr, as in 1.4.3.) Now, 
given a type B in the context (x: A), we have an isomorphism between B and 
W(B) represented by the term declarations 


(x: A), (y: B) F (fact((z, y)): Eq, (fst(z), r))) 


and 
((z: A), (w: Eq, (fst(z),7)) F (snd(e(w)): Bla/z])) . 


It is straightforward to verify that this is a natural isomorphism between V® and 
the identity; and the construction of a natural isomorphism ®V = 1 is similar. 
So the claim is established, and hence Cr is locally cartesian closed. D 


Remark 4.4.8 In the same way that we defined the category Cr[A] in the 
proof above, we could of course have defined a category Cr[[] for any con- 
text I, and proved that it is cartesian closed. If F is the context (zı : 
A),.--,)(@n: An), this category may be shown to be equivalent to the slice 
category Cr/Xz, Ai Yea Ad Tae Dono An An 


It is now straightforward as usual to define a X-structure in the category Cr, 
and to prove that it satisfies exactly those cartesian sequents which are derivable 
in T. So we obtain the expected completeness theorem: 


Theorem 4.4.9 Let X be a pAE-signature, T a cartesian pA-theory over ©, 
and o a cartesian sequent over ©. Then o is satisfied in all T-models in locally 
cartesian closed categories iff it is derivable in T. o 


If we wish to put back the full first-order logic, the simplest way of doing 
so is to rely on A1.5.13, and to ensure that our syntactic category has enough 
cocartesian structure (finite coproducts and coequalizers of effective equivalence 
relations) to imply that it is a Heyting category. We may do this by building 
yet more type constructors into our signature: in fact we choose to add dis- 
joint coproducts and coequalizers of arbitrary equivalence relations, so that the 
corresponding class of categories is actually that of locally cartesian closed pre- 
toposes. (The construction of quotients by equivalence relations, though it uses 
the power-set axiom in the traditional set-theoretic approach, is predicative and 
hence acceptable to ‘strict’ constructivists.) We have room here for only a brief 
sketch of how this may be done; for a much more detailed account, we refer the 
reader to [490]. 

For coproducts, we first assume given a particular type 2 (in the empty 
context) with closed terms },t of type 2, for which the sequent ((¢ = ț) +) is 
an axiom (in the empty context). (Note that we do not have to add the empty 
coproduct 0 explicitly, since we can obtain it as the equality type Eq, (t, {) where 
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zx is of type 1.) Then we introduce a new type constructor 


(TF (A € Ł-Typ)) ("+ (B e b-Typ)) 
F, (x: 2) F ((case x of A or B) € &-Typ 


with the convention that this type reduces to A (resp. B) if ¢ (resp. ł) is substi- 
tuted for z, and a similar constructor for terms: given terms s,t of types A, B 
in a context I’, we have a term (case x of s or t) of type (case x of A or B) in 
the context I, (x: 2). The coproduct of A and B in the syntactic category will 
then be ©z.2(case x of A or B). (Of course, we need further term constructors 
and axioms to ensure that this is indeed a coproduct; but we shall not describe 
them in detail here.) 

For coequalizers, our type constructor starts from the assumption that we 
have type declarations (F (A € &-Typ)) and (T F- (R € &-Typ)), plus two 
terms s,t: A in the context I’, (y: R) for which the sequents asserting that R 3 A 
is an equivalence relation (in the category Cr[I']) are derivable. Then we may infer 
a new type declaration (T H (A/R € &-Typ)) and a term declaration (F, (x: A) F 
(q(x): A/R)) satisfying appropriate axioms. To ensure that [q(z)|: A > A/R 
really is a coequalizer in Cr[I'], we need a further term constructor which, given 
a term r of type C in the context F, (x: A) for which the sequent 


(E, (y: R) F (r[t/z] = r[s/z])) 


is derivable, produces a new term cofact, ¢(r, w) in which the occurrences of x in 
R are bound, but which has a new free variable w of type A/R. We leave to the 
reader the task of formulating the axioms which say that [q(x)] is a coequalizer 
and ((s], [t]) is its kernel-pair. 

On adding these type and term constructors to the (full) wAE-calculus, we 
arrive at a logical calculus which we might call the wAF-calculus (F for ‘full 
first-order logic’), which is interpretable in arbitrary cartesian closed pretoposes 
and which satisfies a completeness theorem for interpretations in such categories. 
Moreover, unlike the 7-calculus which we studied in the previous section, it is 
acceptable to strict constructivists. However, as will be clear from what we 
have done in this section, that acceptability has been won at the cost of a very 
considerable increase in the complexity of the calculus itself, and so we shall not 
pursue it further. 


Suggestions for further reading: Beeson [90], Jacobs [490], Maietti [772], 
Maietti & Valentini [774], Martin-Lof [799, 800], Sambin [1079, 1080], 
Seely [1107]. 


D4.5 Axioms of choice and booleanness 


For the remaining three sections of this chapter, we turn from the formal business 
of interpreting higher-order logic in categories such as toposes to its exploita- 
tion: that is, to the consideration of certain properties of toposes that are best 
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studied by making use of the higher-order internal logic of toposes. (This theme 
will continue in Chapter D5, where we look again at natural number objects, 
and at various notions of finiteness in toposes.) We begin by considering the rôle 
of the axiom of choice in topos theory. 

It is well known that the classical axiom of choice is equivalent to the state- 
ment that every surjective map is a split epimorphism in the category Set. 
Accordingly, the assertion ‘every epimorphism splits’ is often taken as the correct 
formulation of the axiom of choice for a general topos. However, this statement 
is really the conjunction of two conditions, one of which holds for Set even if we 
do not assume the axiom of choice. In a general topos, it is frequently useful to 
consider these two conditions separately. 


Definition 4.5.1 
(a) We say that supports split in a topos E, or that E satisfies (SS), if every 
subterminal object is projective in €; equivalently, if for every object A the 
epic part of the image factorization of A — 1 is split epic. 


(b) We say that an object A of a topos £ is internally projective if T14: €/A — E 
is a regular functor (equivalently, preserves epimorphisms). And we say E 
satisfies the internal aziom of choice (IC) if every object of £ is internally 
projective. 


Examples 4.5.2 (a) If € is the topos [a°?, Set] where a is an ordinal (regarded 
as a well-ordered set), then it is easy to see that every nonzero subterminal object 
is representable, and hence projective. So E satisfes (SS); but if a > 1 then £ is 
not Boolean, from which it follows by 4.5.8 below that it cannot satisfy (IC). 

(b) If G is a group, then the topos [G, Set] satisfies (IC) provided Set does; for 
the forgetful functor [G, Set] — Set is logical, and so commutes with the functors 
II 4. However, if G has more than one element then [G, Set] does not satisfy (SS), 
since the epimorphism G — 1 (where G acts on itself by left translations) fails 
to split. 

(c) If G is a topological group, then the topos Cont(G) of continuous G-sets 
may fail to satisfy (IC), even if we assume it in Set. For example, if we take 
G = Z with the topology generated by the filter of subgroups of finite index, 
A = N with trivial G-action, and B to be the coproduct of the finite cyclic 
G-sets C;, of all possible sizes, then I14(B — A) is the coreflection in Cont(G) 
of the product U,enC, as computed in [G, Set]; but all G-orbits of the latter 
are infinite, so its coreflection is empty and does not map epimorphically to 
1 =Ta(14). 

(d) Let X be a locale. An open sublocale U of X , regarded as a subterminal 
object of Sh(X), is projective iff every open covering of U has a pairwise-disjoint 
refinement. For, if this condition holds and we are given a local homeomorphism 
E — X such that E — U is epic, then we can cover U by opens over which 
E admits a section, and after refining this cover to a pairwise-disjoint one we 
can patch the sections together. Conversely, if U is projective and (Vi |i €T ) 
is an open cover of U, then the disjoint union E of the V; is the domain of a 
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local homeomorphism over X with support U, and a section s: U — E defines 
a pairwise-disjoint cover of U by the sublocales s~!(V;) C V;. Thus the topos 
Sh(X) satisfies (SS) iff every family of open sublocales of X can be refined to a 
pairwise-disjoint family with the same union: such locales are sometimes called 
strongly zero-dimensional. We note that they include all zero-dimensional locales 
X which are ‘second countable’, i.e. such that O(X) is countably generated as 
a complete join-semilattice. For, given a family U of opens in such a locale, we 
may refine it to a countable family V = {V;, V2, V3,...} of clopens with the same 
union; then, if we define Wi = Vi, W2 = V2 NCV,, W3 = V3 NC(V, U V2), ..., 
we obtain a pairwise-disjoint family with the same union as V (and hence as 
U). (On the other hand, Sh(X) satisfies (IC) iff it satisfies (AC), iff O(X) is 
Boolean; cf. 4.5.15 below.) 

(e) We remark that, even for spatial locales X, the condition of second 
countability in the previous paragraph cannot be weakened to separability (the 
possession of a countable dense subset). For a counterexample, consider the 
‘Sorgenfrey plane’, that is the set R? with the topology generated by all ‘half- 
open rectangles’ [a,b) x [c,d) (a,b,c,d € R). This is clearly zero-dimensional, 
since the complements of all such rectangles are also open; and it is separable 
since Q x Q is dense in it. But the family of open sets 


{{a,a+ 1) x [-a,—-a + 1)|a € R} 


has no pairwise-disjoint refinement. For if U were a pairwise-disjoint family of 
open sets, each contained in a member of the above family, then for each € > 0 
the set 


{a € R | [a a + €) x [-a, —a + €) is contained in a member of U} 


would be countable, and hence only countably many points of the line {(a, —a) | 
a € R} could belong to members of U. 


In order to establish the relationship between the conditions of 4.5.1 and the 
axiom of choice, we need 


Lemma 4.5.3 For an object A of a topos E, the following are equivalent: 


(i) A is internally projective. 
(ii) The functor (—)4: E — E preserves epimorphisms. 
(iii) For every epimorphism e: B —> A, there exists C => 1 such that C* (e) is 
split epic. 


Proof (i) => (ii) since (-)4 is the composite ©4A*, and X4 preserves 
epimorphisms. 
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(ii) = (i) because we have a pullback square 


Half) => BA 


fA 
la, A 
1 ——__> A 


for any f: B — A, and pullback along a fixed morphism preserves epimorphisms. 
(i) = (iii): Given e, take C to be IH,4(e), which maps epimorphically to 
T4(14) & 1. The counit map A*(C) — e in €/A can be regarded as a splitting 
of C*(e) in E/C. 
(iii) > (i) since C* commutes with the functors H4 in an appropriate sense, 
and reflects epimorphisms. 0 


Corollary 4.5.4 For a topos E, the following are equivalent: 


(i) Every object of E is projective. 
(ii) Every epimorphism in E is split. 
(iii) E satisfies (SS) and (IC). 
(iv) E satisfies (IC) and its terminal object is projective. 


Proof (i)e (ii) > (iii) > (iv) is trivial. 

For (iv) > (ii), (IC) says that every epimorphism e in E acquires a splitting 
when pulled back along some C — 1, by 4.5.3; but if C — 1 itself has a splitting 
c: 1 > C, we can pull back the splitting of C*e along it to obtain a splitting for e. 

O 


A topos satisfying the equivalent conditions of 4.5.4 will be said to satisfy 
the (external) axiom of choice (AC). 

The usual set-theoretic formulation of the axiom of choice is concerned not 
with projective objects but with choice objects, that is objects A admitting a 
choice function c: P+ A — A, where P+ A is the object of inhabited subobjects 
of A (that is, the image of €4~> PA x A — PA), such that the sentence 
(vw: P+ A)(c(w) € w) is satisfied ~ equivalently, the graph of c factors through 
€4~ P+ AxA. To establish the relationship between choiceness and projectivity, 
we need to consider entire relations: we say a relation ¢: A % B (in the sense of 
Section A3.1) in a regular category is entire if 6°¢ > 14. 


Lemma 4.5.5 An object A of a regular category is projective (with respect 


to covers) iff every entire relation with domain A contains (the graph of) a 
morphism. 


Proof Let ¢: A% B be a relation. If (u: T > A,v: T — B) is a tabulation 
of ġ, then the assertion that @ is entire says precisely that u is a cover. If s is a 
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splitting of it, then the morphism f = vs clearly satisfies fe < ġ. Conversely, if 
A satisfies the condition, let e: B -> A be a cover. Then e°: A % B is an entire 
relation; so we have f: A— B with fe < e° and hence (ef). < eee? = la, 
whence ef = 14 by A3.2.3(ii). O 


Lemma 4.5.6 An object A of a topos is choice iff every entire relation with 
codomain A contains a morphism. 


Proof To say that ¢: B% A is entire means precisely that "°: B— PA 
factors through P+ A — PA. If we are given a choice function c, then the 
morphism f = cd ġo” has the required properties. For the converse, we consider 
€4 itself as a relation P+ A % A; to say that a morphism P+ A — A is contained 
in this relation is exactly to say that it is a choice function. Q 


Thus we see that, in a topos, (AC) is equivalent to the assertion that every 
object is choice. There is also a notion of ‘internal choiceness’: A is internally 
choice if the object of choice functions for A, constructed as an appropriate 
subobject of AP tA, is well-supported. And the above argument, with pullback 
functors C* inserted at the appropriate places, shows that E satisfies (IC) iff 
every object of € is internally choice. Note, however, that A is internally choice 
iff there exists C —> 1 such that C*A is a choice object in E€/C; in fact we can 
take C to be the object of choice functions for A. Thus there is not much gain in 
generality by working with internally rather than externally choice objects, and 
from now on we shall mainly confine ourselves to the latter. 

In any category, a finite coproduct of projective objects is projective. For 
choice objects, the picture is distinctly different: 


Proposition 4.5.7 The following conditions on a topos E are equivalent: 


(i) E is Boolean. 
(ii) Finite coproducts of choice objects are choice. 
(iii) 2 = 1111 is a choice object. 


(iv) 2 is an internally choice object. 
Proof (i) = (ii): Suppose A; and Az are choice objects, and let 


u 


B T — > AIA 


be a tabulation of an entire relation ¢: B % A; I Ag. Let T; = v*(Aı), let By 
be the image of T, — T — B, let B be the complement of B, in Sub(B), 
and let T2 = u*( B2). Then T2 N T; = 0 in Sub(T), so v maps Tz into A2; and 
Ti and T; tabulate entire relations ¢,;: Bı % A, and ¢2: Bz % Ag respectively. 
By assumption, these contain morphisms fı: Bı > A, and fz: B2 > Ag; but 
B = B, Il By, so these combine to yield a morphism f: B — A, I Ae, which is 
contained in the relation tabulated by Tı I T and hence in ¢. 
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(ii) > (iii): 1 is trivially choice, since any entire relation with codomain 1 is 
already a morphism (cf. A3.2.8); so (ii) implies that 2 = 1 H 1 is choice. 

(iii) > (iv) is trivial. 

(iv) > (i): Clearly, if there exists C — 1 such that €/C is Boolean, then € is 
Boolean, since C* reflects the validity of the identity == = 1p. So we may as well 
assume that 2 is (externally) choice. It further suffices to prove that Sub¢(1) is 
Boolean, since the hypothesis that 2 is choice is preserved by B*: E — E/B for 
any B. So let U be a subterminal object in E€, and form the cokernel-pair 1 3 B 
of U — 1. As in the proof of 4.5.5, we may regard the epimorphism 1111 — B as 
an entire relation B % 2, and hence find a splitting for it; so B may be identified 
with a subobject of 2, and is therefore decidable by A1.4.15. But €°? is regular, 
so U > 1 is the equalizer of 1 = B and is therefore a pullback of the diagonal 
B > B x B; hence it is complemented. g 


Remark 4.5.8 In particular, it follows from 4.5.7 that a topos which sat- 
isfies (IC) is necessarily Boolean. This result (which was first proved by 
R. Diaconescu [281]) is sometimes viewed as a defect of topos theory by those 
constructivists who reject the impredicative power-set axiom, as discussed in 
Section D4.4: to them, the fact that the choiceness of 2 (equivalently, by 4.5.13 
below, its well-orderability) implies the Law of Excluded Middle is further evi- 
dence that the existence of the power object P2 allows one ‘too much freedom’ 
to define subobjects of 2 which ‘ought not to exist’ in a truly constructive con- 
text. We prefer to see it as evidence of the remarkable power of the axiom of 
choice, and as a warning that it should not lightly be assumed in mathematical 
arguments. 


What can we say about the class of choice objects in a general topos? With 
the exception of coproducts, it has good closure properties: 


Lemma 4.5.9 In any topos, the class of choice objects is closed under 
subobjects, quotients and finite products. 


Proof For subobjects. suppose we have m: A’ > A where A is choice and 
@: B% A’ is entire; then m.¢: B® A is still entire, and any morphism con- 
tained in it must factor through m. For quotients, we have already observed 
that any quotient of a choice object appears as a subobject. We also observed 
that 1 is choice, in the proof of 4.5.7; for binary products, suppose A, and A» 
are choice and ¢: B % A, x Ag is entire. Then we may first choose a morphism 
fı: B — A, contained in (71)e¢, then verify that 6 (71)°(fi)e is still entire 
and choose a morphism f2: B — Az contained in its composite with (72)., and 
finally verify that (fı, f2): B — A, x Ape is contained in ¢. 


Thus, in any Boolean topos, the full subcategory of choice objects is a sub- 
pretopos, in which every epimorphism splits. However, it is not a topos in general: 
it is well known that, in models of set theory without (AC), there exist sets A 
such that A is choice but PA is not. (On the other hand, it is interesting to 
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note that, in the topos Cont(G) of 4.5.2(c), the internally choice objects are 
exactly the uniformly continuous G-sets as defined in A2.1.7, at least provided 
we assume (AC) in Set; so they do form a topos in this case.) 

In Set, Zermelo’s Theorem tells us that the sets which admit choice functions 
are exactly those which can be well-ordered. The notion of well-ordering is defin- 
able in an arbitrary topos: an ordered object (A, <) is said to be well-ordered 
if there exists a choice function c: Pt A — A such that the sentence asserting 
(Yw: P* A)(c(w) is the least element of w)’ is satisfied. Clearly, a well-orderable 
object is a choice object; in the converse direction, the usual proof of Zermelo’s 
Theorem uses the law of excluded middle, but is otherwise interpretable in the 
logic of a topos, so we know that any choice object in a Boolean topos admits a 
well-ordering. 

Somewhat surprisingly, the same result holds in non-Boolean toposes. In 
order to prove this, we need a clearer picture of what choice objects look like in 
a non-Boolean topos. We shall need to use the notion of widespread subtermi- 
nal object introduced in A1.6.11. Recall that, in any distributive lattice L, an 
element u is called widespread if the lattice {v € L | v > u} is complemented; 
and we say that a subterminal object U in a topos is (internally) widespread if 
A x U — A is widespread in Sub(A) for all objects A - which is equivalent to 
saying that the sentence 


(Vp: Q)(p V (p=>"U")) 


is valid in the internal logic of the topos, and to saying that the closed subtopos of 
E complementary to €/U (cf. A4.5.3) is Boolean. (Note that we have now ceased 
to distinguish notationally between a term t of type Q and the corresponding for- 
mula (* € t) of the internal language, which we denoted t in Section D4.3.) More 
generally, we say a subobject A’ — A is widespread in € if it is widespread as a 
subterminal object of E/A. We may construct a subobject W — Q ‘classifying’ 
the class of widespread subobjects, namely the interpretation of 


{4:9 | (Yp: Q)(pv (p+ q))} - 


W is sometimes called the Higgs object of €, in honour of D. Higgs who was the 
first to investigate it (in the context that we have already met in Section A1.6). 

It is clear that, for any object B of €, the forgetful functor Ug: E/B — E 
reflects the property of being choice, since it preserves entire relations. We shall 
also need 


Lemma 4.5.10 Let U be a subterminal object in a topos E, and A an object 
which is a sheaf for the closed local operator c(U). Then A is choice as an object 
of E iff it is choice as an object of sh.yy(E). 


Proof The assertion that A is a c(U)-sheaf says that U*(A) & ly; so 
U*(P+A) = Pt(1y) & ly, ie. P*A (as constructed in E) is a c(U)-sheaf. 
It follows that the inclusion functor sh,(v)(E) — E preserves the construction of 
P* from which the result follows. o 
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Lemma 4.5.11 Let A be a choice object of a topos E, and suppose there exists 
a morphism a:1— A. Then the classifying map of a:1-— A factors through 


W =Q. 

A 

B 
The two morphisms (14 x a), (a x 14): A 3 Ax A induce a morphism B — 
A x A, which is monic by A1.4.3; so by 4.5.9 B is a choice object. It remains 
choice when we regard it as an object of €/A via the codiagonal map; but as 
an object of this topos, it lives in the closed subtopos sh,(g)(€/A), and in this 
subtopos it is simply the coproduct of two copies of the terminal object. So 4.5.10 


says that 1 H1 is a choice object in sh,q)(E/A), and hence by 4.5.7 this topos 
is Boolean, i.e. a is internally widespread as a subterminal object of €/A. EI 


Proof Form the pushout 


a 
1 ————_> 
a 


A ———-> 


Proposition 4.5.12 In any topos, the Higgs object W is well-orderable; in 
particular, it is a choice object. 


Proof First we note that, if p and q are variables of type W, then the formulae 


(p V (p = q)) and (q V (q = p)) are valid, whence we have (p V q V (p#q)), OF 
equivalently 


(Vp,g:W)((p=T)V(q=T)V(p=Qq)). 


(In other words, W is a torsor over the two-element monoid {1,e | e? = e}; 


cf. B3.2.4(c).) It follows easily that the binary relation on W defined by the 
term 


{(p,q):W x W | ((p=T)V(p=4))} 


is a total ordering (note that it is the opposite of the ordering induced by the 
usual ordering on Q). We claim that it is a well-ordering. 
To prove this, let B — W be a well-supported subobject. Form the pullback 


U >——_> 1 
~ 
B >> W 


then U is widespread as a subobject of B, so we can find a subobject C — 
B which is a complement for B x U in {B’ € Sub(B) | U < B’}. We have 
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oC Uo(B x U) =o0C UU =oB = 1; but oC > U since there exists a morphism 
U — C, so oC = 1. Now we have 


(vq: B)((4 = T)= (T ETB), 
so from (q = T) we may deduce that U = 1 and hence that C = U. Thus 
(vp: C)(Vq: B)X( (4 = T)= (p = T)), 


and combining this with the property of W established in the first paragraph of 
the proof we have 


(Vp: C)(Yq: B\((p = T) v (p =4)), 


that is, (Yp: C)(Vq: B)(p < q). Combined with the fact that C is well-supported, 
this shows that C & 1 and that the image of C — B is the least element of B. 
Applying this argument to the generic well-supported subobject of W in the 
topos €/P+W, we deduce that the sentence of the internal language asserting 
the well-orderedness of W is satisfied. Q 


Theorem 4.5.13 In any topos, the choice objects are exactly the well-orderable 
objects. 


Proof We have already observed that well-orderable objects are choice, so we 
have to prove the converse. Let A be a choice object in a topos €; by working in 
the topos €/aA, where cA is the support of A, we may assume that A is well- 
supported, and hence that it has an element a: 1 — A. By 4.5.11, the classifying 
map of a factors through W — Q, so we may regard (Xa: A —> W) as an object 
of E/W. Also, since 


1 ———_> 1 
Xa 
A ——> W 


is a pullback, Xa lives in the closed subtopos sh,(t)(E/W) by A4.5.3(ii); and it 
is choice as an object of this topos, by 4.5.10. But this topos is Boolean because 
T is widespread in W; so by Zermelo’s proof we may find a well-ordering of Xa 
in this topos, and by an easy extension of 4.5.10 it remains a well-ordering of Xa 
in €/W. 

We are thus reduced to proving that, given a morphism f: A — B in a topos 
E, a well-ordering < g of B and a well-ordering <ș of f in E/B, the ‘lexicographic’ 
ordering of A given by 


(z <a y) dt (ft <s fy) v (fa = fy) A (2 <s y))) 


is a well-ordering. But the well-known set-theoretic proof of this is construc- 
tive, and so valid in a topos: given an inhabited subobject A’ — A, we let 
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B’ = 3,(A’) — B, find the <,-least element b: 1 — B of B’, and then find the 
least element of the pullback 


A" > A’ 


Eea 


for the well-ordering which is the pullback of <; along b. O 


Although, as we have seen, the axiom of choice in a topos is incompatible 
with non-Booleanness, there is a classically equivalent assertion, namely Zorn’s 
Lemma, which can hold in non-Boolean toposes. It should be noted, however, 
that uses of Zorn’s Lemma in mathematics are almost always followed immedi- 
ately by an appeal to the law of excluded middle: having obtained a maximal 
element of some inductive poset, one assumes it does not have the property one 
is seeking and derives a contradiction by constructing a strictly larger element of 
the poset. Thus the validity of Zorn’s Lemma in a non-Boolean topos is perhaps 
less useful than it appears at first sight ~ though we shall see one significant 
application of it in 4.6.15 below. 

Given a poset (A, <) in a topos, we say a subobject A’ — A is a chain 
if it is totally ordered by (the restriction of) <, i.e. if it satisfies (Vz,y: A’) 
((x < y) V (y < x)). And we say (A, <) is inductive if it satisfies 


(Vw: PA)((w is a chain) => (3x: A){zx is an upper bound for w)) . 


Proposition 4.5.14 Assume (AC) holds in Set, and let E be a localic Set- 
topos. If (A, <) is an inductive poset in E, then there exists a: 1 — A which is 
(internally) a maximal element of A, i.e. satisfies (Yx: A)((a < x)= (a =72)). 


Proof Consider the (external!) poset C of chains in A, ordered by inclusion. 
It is clear that C is an inductive poset in Set, since the union of a chain of chains 
is a chain (and €, being defined over Set, has arbitrary unions of subobjects). 
So it has an (externally) maximal element Ap — A, say. Since A is inductive, 
there exists B — 1 in E and a morphism a’: 1g — B*A which is an upper 
bound for B*( Ao); we claim first that a’ factors through B*( Ao), and is therefore 
the greatest element of this subobject. Since B can be covered by subterminal 
objects, it suffices to prove that, for every f: U > B with U subterminal, f*(a’) 
factors through U*( Ag); but if U is subterminal, then A; = Ag UU > A (where 
U maps to A by the transpose of f*(a’)) is easily seen to be a chain, and so must 
equal Ag by maximality. 

Now since greatest elements are unique when they exist, we see that the 
pullbacks of a’ along the two projections B x B = B must be equal; hence a’ 
is itself the pullback along B — 1 of a global element a: 1 — A, which is the 
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greatest element of Ag. We claim that it is (internally) a maximal element of 
A. For, if we are given an object D and a morphism b: 1p — D*A satisfying 
D*a < b, then it suffices as before to prove that b and D*a coincide when pulled 
back along any g: V — D with V subterminal, and we may do this as before by 
constructing a chain Az = Ag UV containing Ag. 0 


Using 4.5.14, we may give a characterization of those Set-toposes which 
satisfy (AC): 


Theorem 4.5.15 Assume (AC) holds in Set. Then a Set-topos (y: E — Set) 
satisfies (AC) iff E is Boolean and y is localic. 


Proof If € satisfies (AC), then it is Boolean by 4.5.8. We must show that 
the subterminal objects of € form a separating set: suppose given a parallel pair 
f,g: A 3 B with f Æ g. The equalizer E — A of f and g is not the whole of 
A, so its complement E’ — A is nonzero; hence by (SS) we can find a nonzero 
subterminal object U and a morphism h: U — A factoring through E’. Now the 
equalizer of fh and gh is h*(E — A) = (0 — U), so fh Æ gh. 

Conversely, suppose € is Boolean and y is localic. Suppose given an epimor- 
phism f: A > B in E; then we may construct an internal poset S — P(A x B) 
whose elements are (graphs of) partial sections of f, i.e. commutative diagrams 


A 


f 


B’ >_> B 


with the partial ordering given by s < s’ iff s’ extends s. It is clear that S is 
inductive, so it has an (external) maximal element, corresponding to a partial 
section so: Bo — A. We must show that Bo is the whole of B. Suppose not; 
then its complement Bı — B is nonzero, and hence so is the pullback A, of 
A along Bı — B. Thus the coprojections A; 3 A, H A; are not equal, and 
so there exists a nonzero subterminal object U and a morphism U — A. But 
now the composite U — A, ~ A — B is monic, and disjoint from By — B; 
so we may extend our section sg to one defined on Bo IU, contradicting the 
maximality of sọ. Thus we have shown that every epimorphism in € is split, 
i.e. E satisfies (AC). O 


We have seen in the proof of 4.5.15 that if a topos is Boolean and satis- 
fies (SS), then its subterminal objects form a separating set. The assumption of 
Booleanness is essential here: if £ is the topos of sets equipped with an idempo- 
tent endomorphism, then € is 2-valued and its terminal object is projective, but 
not a separator. 
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Another application of (SS) allows us to ‘lift’ a weak form of the axiom of 
choice, namely (countable) dependent choices, from Set to at least some non- 
Boolean toposes. We recall that the ariom of dependent choices (DC) says that, 
if we are given a set A and a subset R C A x A which is entire as a relation 
A ® A (i.e. such that (Vz: A)(Sy: A)((z,y) € R) is satisfied), then for any 
a € A there exists f: N— A with f(0) = a and (Vn: N)((f(n), f(n + 1)) € R). 
This axiom plays an important rôle in analysis; we shall meet it again in our 
discussion of the real numbers in a topos (see 4.7.12 below). 


Lemma 4.5.16 Suppose E = Sh(X) is localic over Set and satisfies (SS) (and 
assume (DC) holds in Set). Then € satisfies (DC). 


Proof Suppose given objects A and R satisfying the hypothesis of (DC) inter- 
nally in €. The key point is that, given an open U C X and an element zx of 
A(U), we can find another element y of A(U) such that ((x, y} € R") holds; for 
the object [{y: A | (z, y) € "R"}] has support U by assumption. Thus, starting 
from any element a of A(U), we can (using (DC) in Set) construct an infinite 
sequence of elements of A(U) such that each is R-related to its successor; but 
since the natural number object in € is just the countable copower of 1, this is 
exactly what we need. O 


Suggestions for further reading: Blass [123, 129], Diaconescu [281], Fourman 
& Sčedrov [365], Freyd [375], Penk [953]. 


D4.6 De Morgan’s law and the Gleason cover 


The assumption of Booleanness for a topos is in some ways an uncomfortably 
restrictive one, in that it eliminates most of the examples of toposes of greatest 
interest. However, there are various intermediate possibilities between the logic of 
an arbitrary topos and that of a Boolean topos; in fact there is a whole sequence 
of successively weaker conditions that we can add to the rules of intuitionistic 
propositional logic, corresponding to an increasing sequence of elements of the 
free Heyting algebra on one generator (the nth condition says that the nth 
element in the sequence is equal to T). As they become weaker, these conditions 
become progressively less interesting (although, of course, there are progressively 
more toposes which satisfy them), but the first of them, which has come to be 
known as De Morgan’s law, is of some interest as it arises naturally in both 
the logical and the geometrical aspects of topos theory. We devote the present 
section to studying it. (For information on the weaker conditions in the sequence, 
see [40, 41].) 

In classical propositional logic, the name ‘De Morgan’s laws’ is given to the 
sequents 


(=(P Vg) dE (=pA-q)) and (>(pAq) 4k (=p V ~g)) 
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which express the fact that negation is a lattice anti-isomorphism in a Boolean 
algebra. The first of these two laws is in fact valid in any Heyting algebra: since 
~(—) = ((—) = L) is self-adjoint on the right, it converts joins to meets. But 
the second one fails in general: the right-to-left entailment is still valid, but the 
left-to-right one is not. 


Lemma 4.6.1 Let H be a Heyting algebra. The following conditions are 
equivalent: 


(i) For all p,q € H, -(pAq) = (>p V ~q). 

(ii) For allp € H, (ap V œp) = T. 
(iii) Every ~~-fixed element of H is complemented. 
(iv) The =~~-fized elements of H form a sublattice. 


(v) a4: H => H is a lattice homomorphism. 


Proof (i) => (ii): Substitute ~p for q in (i). Since (p A >p) = L, we deduce the 
identity of (ii). 

(ii) > (iii): (ii) says that s>p has a complement, namely —p. 

(iii) > (iv): A complemented element is always ~—-fixed, so (iii) implies that 
the -—-fixed elements of H coincide with the complemented elements; but the 
latter always form a sublattice. 

(iv) > (v): =~ always preserves L, T and binary meets; and it preserves 
binary joins as a map from H to the lattice H... of s—-fixed elements, since it 
is left adjoint to the inclusion. So (iv) implies that it preserves binary joins as a 
map H — H. 

(v) => (i): If (v) holds, then we have 


= a(>7p A ~g) 

= ~(p A q) 
using the identity ~ = ~~~ and the fact that the first De Morgan law always 
holds. o 


A Heyting algebra satisfying the equivalent conditions of 4.6.1 is sometimes 
called a Stone algebra. We shall say that a topos E satisfies De Morgan’s law if 
its subobject classifier is an internal Stone algebra in €; equivalently, if for every 
subobject A’ — A in E we have nA’ V ——7A’ & A in Sub(A). There are many 
different ways of expressing this condition: the next proposition lists a selection 
of them. 


Proposition 4.6.2 The following conditions on a topos E are equivalent: 


(i) E satisfies De Morgan’s law. 


(ii) Every ~~n-closed subobject in E has a complement. 
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(iii) The subobject 1:1 — Q has a complement. 


(iv) An object of E is separated for the local operator ~~ (cf. A4.5.9) iff it is 
decidable. 


(v) Every 4-sheaf in E is decidable. 
(vi) QU., is decidable in E. 


(vii) The canonical monomorphism (T,1):2=101— Q_., is an isomor- 
phism in E. 


2 is an internally complete poset in E. 


2 is an injective object of E. 


) 
) 
(x) The class of injective objects of E is closed under binary coproducts. 
) 2 is a retract of Q in E. 

) 


2 is a 7--sheaf in E. 


(xiii) The subcategory sh_.(€) is closed under finite coproducts. 


Proof (i) (ii) is immediate from 4.6.1; and (ii) © (iii) because a subobject 
is ——-closed iff it occurs as the negation of something, iff it is a pullback of L. 

(ii) > (iv): By A4.3.6(a), an object A is separated for a local operator j iff 
its diagonal is j-closed. 

(iv) > (v) is trivial; and (v) > (vi) since Q... is a s7-sheaf by A4.3.7. 

(vi) = (vii): If (v) holds, then T: 1 — Q~ has a complement, which must be 
1:1-—Q_. since the latter is the largest subobject of Q disjoint from T. 

(vii) = (ii) since 2 classifies complemented subobjects in E, and Q~~ classifies 
---closed subobjects. 

(vii) = (viii) since Q~~, being reflective in Q, is always an internally complete 
poset in a topos. 

(viii) > (ix): The underlying object of an internally complete poset is always 
injective, since it is a retract of a power object by B2.3.9(iv). 

(ix) (x): Suppose (ix) holds: let A and B be injective objects, and suppose 
we are given a monomorphism m: ALI B — C. By injectivity of 2, we may find 
a commutative triangle 


AUB> sc 


iil 


and hence a coproduct decomposition C & Ci If Cz such that m restricts to 
monomorphisms A — C and B — C2. Then we may split these monomorphisms 
to obtain a splitting for m. Since monomorphisms are stable under pushout in a 
topos, this is sufficient. The converse is immediate since 1 is always injective. 
(ix) < (xi): 2 is always a subobject of Q; and Q is always injective by A2.2.6. 
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(xi) > (xii): Since 2 is always decidable (cf. A1.4.15), it is s7-separated; 
injectivity forces it to be a sheaf for any local operator for which it is separated. 

(xii) <> (xiii): Suppose (xii) holds; let Ay; and Ag be =—-sheaves, and suppose 
we are given a diagram 


ele eee 
m 
C 


with m ——-dense. Then the composite B — A; O A} — 1111 has a unique 
extension along m; so we obtain coproduct decompositions C = C, H Ch and 
B = B UB, such that the morphisms f and m similarly decompose as fı I fz and 
mı U məs. Then we may extend fı uniquely along m, and fz uniquely along m2 
(since mı and mz, being pullbacks of m along the coprojections, are ~~-dense), 
and verify that these combine to yield the unique extension of f along m. Thus 
a binary coproduct of ~—-sheaves is a ——-sheaf; but the empty coproduct 0 is 
always a ——-sheaf. The converse is again immediate since 1 is always a sheaf. 
(xii) > (vii): Since sh~~ (E£) is Boolean, its subobject classifier Q.. is the 
coproduct of two copies of 1 in it; in other words, it is the associated ——-sheaf 
of 2, the unit of the adjunction being the morphism in (vii). So if 2 is itself a 
—+--sheaf, this morphism is an isomorphism. D 


For yet more conditions equivalent to De Morgan’s law in a topos, see 4.6.14 
and 4.7.11 below. 


Examples 4.6.3 (a) Let C be a small category. We recall that the subobject 
classifier in [C°P, Set] is defined by 


(A) = {sieves on A in C} , 


and L: 1 — Q is the natural transformation which picks out the empty sieve on 
each object of C. So 4.6.2(ii) is equivalent to the statement that the collection of 
nonempty sieves forms a subfunctor 1’ of Q, i.e. that the pullback of a nonempty 
sieve along an arbitrary morphism is nonempty. But, as we saw in A2.1.11(h), 
this in turn is equivalent to saying that C satisfies the ‘right Ore condition’ that 
any pair of morphisms with common codomain can be embedded in a commuta- 
tive square. Note in particular that this condition holds if C has pullbacks; thus, 
for example, we may conclude from 3.1.1 that the clasifying topos (over Set) of 
any cartesian theory satisfies De Morgan’s law. (This may be contrasted with 
the very special nature of those coherent theories whose classifying toposes are 
Boolean; cf. 3.4.6. And see also C4.3.2, for a class of Grothendieck toposes which 
always satisfy De Morgan’s law but are almost never Boolean.) 
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(b) For a locale X, we saw in C1.5.10 that Sh(X) satisfies De Morgan’s law 
iff X is extremally disconnected, i.e. iff the closure of every open sublocale of X 
is Open. 

(c) A particular case of (b) which will be of importance later in this section: 
if B is a Boolean algebra (regarded as a coherent category) and C is its coherent 
coverage (cf. A2.1.11(b)), then Sh(B, C) satisfies De Morgan’s law iff B is com- 
plete. Although, as just stated, this looks rather different from (b), it is a special 
case because C2.2.4(b) tells us that Sh(B, C) is equivalent to Sh(X) where X 
is the Stone space of B (see [520]), that is the locale corresponding to the frame 
I(B) of ideals of B. It is well known that this locale is extremally disconnected 
iff B is complete, but we can give a direct proof as follows. Let J be an ideal of 
B; then an element b of B belongs to its negation ~J (in I(B)) iff bAc = 0 for all 
c € J, iff its complement =b is an upper bound for J. So =J is a principal ideal, 
for all ideals J of B, iff every ideal has a least upper bound, iff B is complete. 
But to say that the negation of every ideal is principal implies that J(B) is an 
(external) Stone algebra, since the principal ideals form a sublattice. Conversely, 
if B is complete, we may ‘relativize’ the above argument to every principal ideal 
of B, to deduce that the subobject classifier of Sh(B,C) is an internal Stone 
algebra. 


It will be seen that the key condition in 4.6.2 is (vii), which says that 2 
coincides with Q~; most of the other conditions say either that 2 (resp. the 
class of all decidable objects) has some property which is known to hold for Q.- 
(resp. the class of all -—-sheaves) in any topos, or that Q.. (resp. the class of 
all -+-sheaves) has some property which is known to hold for 2 (resp. the class 
of all decidable objects) in any topos. One further condition of this kind, which 
looks as if it ‘ought to be’ equivalent to those of 4.6.2 but turns out to be slightly 
weaker, is contained in the following lemma. 


Lemma 4.6.4 The following conditions on a topos E are equivalent, and are 
implied by those of 4.6.2: 


(i) 24 is decidable for any object A of E. 


(ii) The decidable objects form an exponential ideal in €. 


Proof Condition (ii) is implied by 4.6.2(iv), since the separated objects for 
any local operator form an exponential ideal by A4.4.3(ii). It is clear that (ii) 
implies (i); for the converse, suppose B is decidable. Then we have a pullback 
diagram of the form 


B———-> 1 
i | 
2 


B x B ———> 
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which is preserved by the functor (—)4 for any A. If 24 is decidable, then 
any morphism 1 = 14 — 24 is a complemented subobject; hence the diagonal 
A: B4 >= BA x B^ is complemented. o 


Example 4.6.5 Let C be a small category satisfying the condition that, for 
every f: A— B in C, there exist g: B — A and h: B — C such that h = hfg. 
Then the topos [C, Set] satisfies the conditions of 4.6.4. For we recall from A1.4.16 
that a functor F: C — Set is decidable in [C, Set] iff F(f) is injective for all f, 
and from A1.5.5 that the exponential G? in [C, Set] is defined by setting GF (A) 
to be the set of natural transformations C (A, —) x F — G. So suppose G is 
decidable, and let a, 8 be two such natural transformations which have the same 
composite with C (k, —) x 1p for some k: A — D; in other words, they agree on 
all pairs of the form (f,x) where f factors through k. Now consider ap(f, <x) 
for an arbitrary f: A— B and z € F(B); choosing g and h as in the condition 
above, we have 


G(h)(ap(f,2)) = ac(hf, F(f)(z)) 
= ac(hf, F(hfg)(z)) 
= G(hf)aa(1a, F(9)(x)) - 


But since G(k) is injective and ap(k, F(kg)(z)) = Gp(k, F(kg)(x)), we have 


aa(la, F(g)(2)) = Ba(la, F(9)(2)) ; 


hence G(h)(ap(f,xz)) = G(h)(Ga(f,z)), and since G(h) is injective we have 
ap(f,rz) = Be(f,z). Thus a = 8; so we have shown that G? (k) is injective for 
an arbitrary morphism k of C, i.e. GF is decidable. 

Now take C to be the three-element monoid {1,a,b} with a? = ba = a and 
b* = ab = b. Since the morphisms of C are all idempotent, it is easy to see that 
C satisfies the condition above; but it does not satisfy the left Ore condition. So 
the topos [C, Set] satisfies the conditions of 4.6.4, but not those of 4.6.2. 


We may also mention a stronger condition than De Morgan’s law: we say 
that a topos satisfies the strong De Morgan’s law if its subobject classifier is 
internally totally ordered, i.e. if the sentence 


(Vp, gq: 2)(((p>a@) V (@=>p)) = T) 


is satisfied. It is easy to see that this implies De Morgan’s law (in the form given 
in 4.6.1(ii)), by substituting —p for q, but the converse is false. For a functor 
category [C°P, Set], it is easily seen to be equivalent to the condition that every 
slice category C/A is strongly connected (i.e., given any two morphisms f,g of 
C with common codomain, either f factors through g or g factors through f) — 
which is clearly stronger than the Ore condition of 4.6.3(a). However, we have 


1004 HIGHER-ORDER LOGIC D4.6 


the following result: 
Lemma 4.6.6 For a topos E, the following are equivalent: 


(i) E satisfies the strong De Morgan’s law. 
(ii) For every object A of E and every local operator j in E/A, sh,(E/A) 
satisfies the strong De Morgan’s law. 


(iii) For every object A of E and every closed local operator c in E/A, sh,(E/A) 
satisfies De Morgan’s law. 


Proof (i) => (ii): Since the strong De Morgan’s law is expressible by a sentence 
in the internal language, it is clearly inherited by slice categories; so we need 
only show that it is inherited by sheaf subtoposes. But, if 7 is a local operator in 
E, then Q; is a sub-poset of 2 and hence totally ordered in £; and the associated 
sheaf functor preserves total orderings, so it is totally ordered in sh,(€). 

(ii) > (iii) is trivial. 

(iii) > (i): If p and q are variables of type Q, then (pA q = L) is easily seen 
to imply ((p>q) V (q=p)) = (=p V ~q) 2 (~p V 7p); so the strong form of De 
Morgan’s law follows from the ordinary version in this particular case. Thus, to 
verify that the strong form holds in £, it is sufficient to know that the ordinary 
form holds in the closed subtopos of E/Q x Q complementary to the subterminal 
object (T, T): 1 — Q x Q which is classified by the meet of the two projection 
maps2 x23 2. o 


Using this result, it is easy to verify that a localic topos Sh(X) satisfies the 
strong De Morgan’s law iff every closed sublocale of X is extremally disconnected. 
(It can be shown directly that this is equivalent to saying that every sublocale of 
X is extremally disconnected, using the fact that any sublocale is an intersection 
of sublocales which are the union of an open and a closed sublocale; cf. C1.1.17.) 

Because of the presence of negation in De Morgan’s law, we cannot expect 
its validity to be preserved or reflected along geometric morphisms, in general. 
However, there are two particular cases of interest: 


Lemma 4.6.7 
(i) If f: F + E is an open surjection, and F satisfies De Morgan’s law, then 
so does E. 


(ii) If E is a retract in Top of a topos satisfying De Morgan’s law, then E 
satisfies De Morgan’s law. 


Proof (i) is immediate from the fact that f* is a conservative Heyting functor. 

(ii) Let r: F — E, i: E — F be geometric morphisms with ri = 1g. Given an 
arbitrary subobject A’ — A in E, we have r*(7A’) < 7(r*(A’)) in Sub(r*(A)), 
since r*(—A’) is disjoint from r*(A’). The functor i* preserves this inequality, but 
we also have 7*(7(r*(A’))) < a(t*r*(A’)) = 3A’ in Sub(A), for a similar reason, 
and hence ~A’ & i*(-(r*(A’))). Since i* preserves complemented subobjects, it 
follows that ~A’ is complemented; so E satisfies De Morgan’s law. 
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We remark that the analogues of both parts of 4.6.7 hold for the strong 
De Morgan’s law of 4.6.6: they may be proved by similar arguments, or 
alternatively deduced from 4.6.6(iii). 

We saw in A4.5.23 that, for any topos €, we can find a (geometric) surjection 
f: F — E with F Boolean. Indeed, we have a canonical construction of such a 
topos F; but the morphism f does not have any particularly good properties 
(apart from being localic). If we make the more modest demand that the ‘covering 
topos’ F should satisfy De Morgan’s law, then we can do rather better: by 
4.6.7(i), we cannot hope for the covering morphism to be an open surjection, 
but it turns out that we can take it to be a proper surjection (indeed, tidy in 
the sense of C3.4.2), and hence a descent morphism. 

Before stating the next result, we need a new definition: we say a localic 
surjection f: F — E is minimal if the only closed subtopos of F which maps 
surjectively to £ is F itself. If f is a closed map in the sense of C3.2.1, so that 
the image of a closed subtopos shu) (F) (where U € Sub ¢(1)) is simply given by 
sh.,(vy)(E), this is equivalent to saying that f,: Sub¢(1) — Sube¢(1) reflects, 
as well as preserving, the bottom element. 


Theorem 4.6.8 For any topos E€, there exists a topos yE satisfying 
De Morgan’s law and a surjective geometric morphism f:yE— E which is 
localic, proper (in the sense of C3.2.12), separated (in the sense of the same 
reference) and minimal in the sense defined above. Moreover, yE is characterized 
up to equivalence in Yop/E by these properties. 


Proof We define yE to be the topos of sheaves on the internal site in E 
obtained by equipping the internal Boolean algebra (22... with its coherent cov- 
erage; equivalently, it is the topos of sheaves on the internal frame [(Q..) of 
ideals of Q~. Since Q.. is a complete Boolean algebra, it follows from 4.6.3(c) 
(which did not make any appeal to the classical logic of Set) that this topos 
indeed satisfies De Morgan’s law. Moreover, [(Q...) is a coherent frame, so it 
is compact (indeed, strongly compact, by C3.4.1(c)); it is also regular, since the 
principal ideals are complemented and form a basis for it, and so Hausdorff by 
C1.2.17(i). Hence yE — € is proper and separated. For minimality, we note that 
if J is an ideal of Q~ (that is, a subterminal object of yE), then f,(J) is simply 
the truth-value [T € J]; so f,(J) = + is equivalent to =(T € J), and hence to 
J = {1} (since any ideal must contain L, and {1} is the pseudocomplement of 
{T} in Q..). 

For the converse, suppose f: F — E is a geometric morphism with the stated 
properties. Since f is localic, it corresponds to an internal locale X in E; we 
must show that O(X) is isomorphic to J(Q...). Since f is proper and separated, 
X is a compact Hausdorff locale, and hence regular by C3.2.10(ii). Moreover, it 
is extremally disconnected since F satisfies De Morgan’s law; so, from the infor- 
mation that every U € O(X) is a union of opens whose closures are contained 
in U, we deduce that every U € O(X) is a union of complemented (and hence 
compact) elements of O(X). Thus O(X) = I(B), where B is the sub-semilattice 
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of compact elements of O(X); and since the compact elements are exactly the 
complemented ones by C3.2.10(i), B is actually a Boolean algebra — and it is 
complete, by 4.6.3(c). Finally, the minimal surjectivity of f tells us, as before, 
that the Heyting pseudocomplement of the singleton {15} in Sub(B) must be 
{05} — equivalently, since complementation is an isomorphism B — B, {1p} 
is the pseudocomplement of {0g}. So the classifying map x: B — Q of {1p} 
factors through Q~; and since we have 


(Yb: B)77((b = 1g) V (b = 0B)) 


(as the union of a subobject and its pseudocomplement is always >—7-dense), 
it is straightforward to verify that the induced factorization B — Q_.. is an 
isomorphism of Boolean algebras. o 


The topos y£ is called the Gleason cover of E in honour of A. M. Gleason [409] 
who was the first to investigate the corresponding problem of covering an arbi- 
trary space by an extremally disconnected one. The construction for toposes was 
first given in [512]. Note that the morphism yE — € is entire in the sense of 
B4.4.21(a); indeed, we could have replaced ‘proper and separated’ by ‘entire’ in 
the characterization above. 

Note also that if € itself satisfies De Morgan’s law, then yE — € is an 
equivalence; this may of course be deduced from the characterization of it in 
4.6.8, but also follows directly from the definition since if € satisfies De Morgan’s 
law we have J(Q...) = [(2) S Q. This suggests that the construction E > yE 
should be a coreflection of some kind; indeed it is, although the category on 
which it is functorial is far from being the whole of Top. 


Lemma 4.6.9 For a geometric morphism f:F > €, the following are 
equivalent: 


(i) f restricts to a geometric morphism sh- (F) > sh.4(E). 
(ii) f* preserves ~7-dense monomorphisms. 


(iii) For any monomorphism A’ — A in E, we have ~n f*(A') S af*(7A’) in 
Subz(f*(A)). 


Proof (i) (ii) is a particular case of A4.3.12. 

(ii) > (iii): If (ii) holds, then f*(A’)V f*(-A’) — f*(A) must be >—-dense for 
any A’ — A; pulling back along ~f*(A’) — f*(A) we deduce that f*(-.A’) — 
1f*(A’) is =>-dense (note that the former is always contained in the latter, since 
it is disjoint from f*(A’)), and taking pseudocomplements we obtain (iii). 

(iii) = (ii): If (iii) holds, then on applying it to a a—-dense subobject A’ — A 
we deduce that +7(f* A’) = af*(0) = f*(A). O 


Geometric morphisms satisfying the conditions of 4.6.9 are commonly called 
skeletal. They include all open maps in the sense of C3.1.7 (since the inverse 
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image of an open map commutes with negation), but also all dense inclu- 
sions since if F is a dense subtopos of E then sh_..(¥) coincides with sh_.(E) 
(cf. A4.5.21). In fact we can characterize skeletal inclusions as follows: 


Lemma 4.6.10 Let f: F — E be an inclusion, corresponding to a local opera- 
tor j on E. Then f is skeletal iff the exterior of j (that is, the open complement 
of the closure of j, cf. A4.5.19(i)) is a a7-closed subterminal object of E. 


Proof First suppose f is skeletal. From the proof of A4.5.19(i), we recall that 
ext(j) is the image of the initial object 0 of F under f,; but 0 is a s—-sheaf in 
F, so ext(j) must be a ——-sheaf in E. Equivalently, by A4.3.8, it is a —7-closed 
subobject of 1. 

Conversely, suppose the condition is satisfied, and consider the pullback 


FS phe) 


F ~sh,(€) ——>€ 


By A4.5.16, F’ is simply the intersection of the two subcategories sh; (£) and 
sh....(€) of £; in particular it contains the object ext(j), since the given condition 
implies that this is a s—-sheaf as well as a 7-sheaf. So F’ contains the initial 
object of F, and is therefore dense; hence it contains the subtopos sh...(F), and 
so f is skeletal. (In fact F’ must coincide with sh_.(7) in this case, since it is 
a subtopos of sh (E) and therefore Boolean by A4.5.22.) o 


Corollary 4.6.11 The class of skeletal inclusions is the smallest class X 
such that 


(i) open inclusions are in X, and 


(ii) given a composable pair of inclusions 


g f 


G ——> F —— eE 


where g is dense, we have fE E iff fg € X. 


Proof The class of skeletal inclusions certainly has these closure properties 
(the second one because, if g is dense, then sh~~(F) and sh..(G) coincide, as 
we recalled earlier). Conversely, suppose f: sh,(€) — € is skeletal, and let U be 
the pseudo-complement of ext(j) in Sube(1). Then ext(o(U)) = =U = ext(j) by 
4.6.10, so the open subtopos sh,iy)(€) and sh;(€) are both dense in the closed 
subtopos sh-,ext(;))(E). Hence f € Ð. o 


1008 HIGHER-ORDER LOGIC D4.6 


For more information about the composition properties of skeletal mor- 
phisms, see [519]. 

It is easy to see that yE — E is always skeletal; for we can represent sh_,..(€) 
as the topos of sheaves for the canonical coverage C on 2.., and the inclu- 
sion Sh¢(Q..,C) — She (Q~, P) (where P is the coherent coverage) is clearly 
dense. Indeed, since 4.6.2(vii) says that the inclusion sh_..(E) = sh>-(y&) —> yE 
is pure in the sense of C3.4.12, we could alternatively have constructed yE 
as (the middle term of) the pure-entire factorization of sh- (£) — E. From 
the latter observation, and the functoriality of the pure—-entire factorization, we 
immediately deduce 


Proposition 4.6.12 Let Yop,, denote the 2-category of toposes, skeletal geo- 
metric morphisms and geometric transformations between them, and DINZop,, 
the full sub-2-category whose objects are toposes satisfying De Morgan’s law. 
Then E+ yE is a functor Lop,, > DMTop,,, right adjoint to the inclusion. 


Proof The ‘1-dimensional’ part of the proof is contained in the discussion 
above; to complete it, we have only to discuss the effect of y on geometric 
transformations. For this we use an idea that we have exploited before in B3.2.14, 
namely that the Sierpiński topos over F (i.e. the diagram category [2, F]) may 
be regarded as the tensor 2 & F in Yop. Now the subobject classifier in [2, F] 
may, by the construction of A2.1.12, be identified with the diagram (V: Qi > 9), 
where Q is the subobject classifier of F and (2; is, as usual, its order-relation; 
and an easy calculation shows that Q~ in (2, F] is simply the identity morphism 
(Q.. — Q..) (where the inclusion into Qi2 7; maps the first Q~, into Q; via 
the diagonal). In particular, we see that if F satisfies De Morgan’s law, so 
that Q.., coincides with 2 in F, then so does [2,7]. Also, for any F, we have 
y[2, F] = [2,7F]; the topos sh~~([2, F]) does not coincide with [2,sh..(F)]| 
(since the latter is not Boolean!) but with sh_.(#), mapping into [2, F] via 
the open inclusion F — [2, F]. Hence the projection [2,7] — F, and the open 
inclusion F — [2, F], are both skeletal; the closed inclusion is not skeletal (so that 
[2,7] cannot be regarded as a tensor 2 & F in the 2-category Zop,,), but if 
we are given any 2-cell a: f — g between morphisms f,g: F 3 E such that g is 
skeletal (in particular, any 2-cell of Lop,, with source F), the induced morphism 
[2, F] — E is skeletal, since its composite with the open inclusion is g. Hence we 
may define y on such 2-cells by lifting the morphism [2,7] — E to a morphism 
[2, yF] — yE. Thus we have shown that y is a 2-functor Yop,, — Yop,,; and 
since the canonical map yE — € is a natural transformation from y to the 
identity, which is an equivalence precisely when E satisfies De Morgan’s law, Y 
is right adjoint to the inclusion DMZop,, — Topp- 


In passing, we note that it is almost immediate from the definition of skeletal 
morphisms that the assignment € +> sh_.(€) defines a right adjoint to the 
inclusion BooLop — Topp, where BooTop is the 2-category of Boolean toposes 
and geometric morphisms. (Of course, any geometric morphism between Boolean 
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toposes is skeletal.) Note also that yE is Boolean only if € itself is Boolean, since 
in this case it must coincide with sh..(€) and hence the latter maps surjectively 
to E. 


Remark 4.6.13 An alternative proof of the 2-functoriality of y may be given 
along the following lines. Given a skeletal morphism f: F — E, there is a unique 
morphism 6: f*((Q..)¢) — (Q..)¢ making the diagram 


f*(2¢) ———> 25 


POs = sa. 


commute, since f*(2) > f*(Q-~) is s>-dense and 2... is a ---sheaf. (Note 
that this morphism is not simply the restriction of the familiar comparison map 
di: f*(Qe) — Qe; the latter does not map f*(Q ..) into Q... in general.) 6 is 
easily seen to be a Boolean homomorphism (since it is homomorphic on the ~~- 
dense subalgebra f*(2¢)), and hence a morphism of sites when its domain and 
codomain are equipped with their coherent coverages; hence it corresponds to 
a geometric morphism yF — f*(yE) over F, by C2.3.4 and C2.4.3. Composing 
this with the pullback of f along yE — € yields the effect of y on a 1-cell of 
Topp; to define its effect on 2-cells, we use the fact that yE — € is a fibration 
in Top, by B4.4.11. Thus, given a 2-cell a: f — g with codomain €, we obtain a 
diagram 
PAVE eS 98 


{ 


g* (VE) 
commuting up to a (non-invertible) 2-cell, where the left vertical arrow is induced 
by the Boolean homomorphism ag__: f*(Q..) > g* (Q~) in F. And if f and 
g are skeletal, then the diagram 


yF ——> P(e) 


g*(7E) 


commutes in Yop/F, since the corresponding diagram of Boolean algebras in 
F commutes (by uniqueness of 6). So on pasting the two diagrams above, we 
obtain the effect of y on the 2-cell a. 
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A natural ‘companion’ to 4.6.8, which was also first investigated by Gleason in 
the spatial case, is the projectivity theorem for extremally disconnected spaces; 
this was developed for toposes in [515]. First, we need an algebraic equivalent 
of De Morgan’s law: given an internal lattice (or ring) A in a topos E, we say 
an ideal J of A is proper (no connection with the notion of proper map!) if it 
satisfies -(14 € J); by a mazimal ideal we of course mean a maximal proper 
ideal. An ideal J of a lattice A is said to be prime if it is proper and satisfies 


(Va1, a2: A)((a1 Aaz E€ J)>((ai € J) V (a2 € J))) 5 


primeness for ring ideals is defined similarly, with multiplication in place of meet. 
Lemma 4.6.14 Fora topos E, the following are equivalent: 


(i) E satisfies De Morgan’s law. 
(ii) For any internal distributive lattice A in E, every maximal ideal of A is 
prime. 
(iii) For any internal commutative ring A in E, every maximal ideal of A is 
prime. 
(iv) For any internal Boolean algebra B in E, the maximal ideals of B are 
exactly the prime ideals. 


Proof (i) = (ii): Let A be an internal distributive lattice in E, J a maximal 
ideal of A, and suppose given elements a), @2 satisfying (a; A a2 € J). Let K; 
(i = 1,2) be the ideal generated by J together with a;, i.e. the ideal of all 
elements of the form b V c where b € J and c < qa;. By distributivity, it is clear 
that Kı N Kə = J; so since J is proper we have ~((14 € K1)^ (14 € K2)). Since 
De Morgan’s law holds, this implies ~(14 € Kı) V >(14 € Ko), i.e. one of Ky 
or Kə must be proper. But since J C K; for i = 1,2, this implies that we have 
(J = Kı) V (J = K2), and hence (a; € J) V (az € J). So we have shown that J 
is prime. 

The proof of (i) = (iii) is identical apart from notational changes. For (ii) 
= (iv) and (iii) > (iv), we note that a Boolean algebra is both a distributive 
lattice and a commutative ring (and the two senses of the word ‘ideal’ coincide), 
so either (ii) or (iii) implies that maximal ideals in a Boolean algebra are prime. 
But the converse holds without any assumption on €: for if J is a prime ideal in 
a Boolean algebra B, then for any b € B we have (b € J) V (~b € J), and so if 
K is any proper ideal containing B, then we have 


(b € K)=-(-b € K) since K is proper 
=>-(7b € J) since JC K 
>(beJ) 


so K=J. 
(iv) = (i): Consider the internal Boolean algebra Q~. This has a unique 
proper ideal {1}, as we saw in the proof of 4.6.8, so the latter must be maximal. 
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But the assertion that L is a prime ideal of Q.. is exactly the statement of De 
Morgan’s law (restricted to ——-stable truth-values, but this is sufficient since 
-(p A q) is equivalent to ~(>-p A =7q)). O 


To deduce the projectivity theorem for toposes satisfying De Morgan’s law, 
we shall have to make use of Zorn’s Lemma in the form in which we stated it in 
4.5.14. We must therefore restrict ourselves to toposes which are localic over the 
‘classical’ topos Set, in which the axiom of choice is presumed to hold. 


Theorem 4.6.15 For a localic Set-topos E, the following are equivalent: 


(i) E satisfies De Morgan’s law. 
(ii) E is projective with respect to entire surjections in Top/Set. 


(iii) E is projective with respect to proper separated localic surjections in 


Yop/Set. 


(iv) Every compact Hausdorff locale in E which is consistent (in the sense of 
C2.4.8) has a point. 


(v) Every coherent internal locale in E which is consistent has a point. 


Proof (i)=(v): Let X be a coherent internal locale in £: that is, O(X) = I(B), 
where B is the distributive lattice of compact open sublocales of X. It is clear 
that such an X is consistent iff B is non-degenerate, i.e. satisfies —(0g = 1g). 
Also, points of X correspond to completely prime filters in O(X) (cf. C1.2.2); 
and since every element of I(B} is a join of principal ideals, it is easy to see that 
a completely prime filter of I(B} is determined by the principal ideals which it 
contains, and that in fact there is a bijection between completely prime filters 
in [(B) and (ordinarily) prime filters in B. But if B is non-degenerate then the 
poset of proper filters in B is (inhabited, and hence) inductive; so by 4.5.14 it 
contains a maximal element. And by 4.6.14 (applied to the dual lattice B°P) 
such a maximal filter is prime. 

(v) = (iv) follows easily from C4.1.13, where we showed that every compact 
Hausdorff locale is a retract of a coherent locale: clearly, the latter is necessarily 
consistent if the former is. 

(iv) = (iii): Since proper separated localic surjections are stable under pull- 
back in Yop, it suffices to prove that every such map with codomain E can 
be split. But such a map with codomain € corresponds to a consistent compact 
Hausdorff locale in €, and a splitting of it corresponds to a point of this locale; so 
the implication is immediate. And (iii) = (ii) is immediate since entire morphisms 
are localic, proper and separated. 

(ii) > (i): Suppose £ is projective. Then the Gleason cover map yE — E must 
split. But this is equivalent to saying that the internal Boolean algebra Q~ in E 
has a prime ideal, and we saw in the proof of 4.6.14 that this is equivalent to De 
Morgan’s law in E. (Alternatively, we can use 4.6.7(ii) to deduce that E inherits 
De Morgan’s law from y£.) O 
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Remark 4.6.16 In combination with 4.6.8, the result of 4.6.15 implies that 
any geometric morphism f: F — E between localic Set-toposes can be lifted to 
a morphism yF — yE. However, in view of the arbitrary choice involved in the 
lifting, we cannot hope to make the latter functorial on a larger category than 
that of skeletal maps. 


Suggestions for further reading: Johnstone (509, 510, 512, 515], Kock & 
Reyes [645], Mulvey & Pelletier [882], Niefield & Rosenthal [897, 898]. 


D4.7 Real numbers in a topos 


The familiar set-theoretic constructions of the ring of integers, the field of 
rationals and the field of real numbers starting from the semiring of natural 
numbers may all be carried out within the higher-order type theory described 
in Section D4.3, and so make sense in any topos €, provided it has a natural 
number object from which the construction can start. However, the non-classical 
nature of the internal logic of E makes itself felt at the final stage of this con- 
struction: although all sensible constructions of the integers and of the rationals 
yield the same results, there are various different constructions of the reals which 
are classically equivalent but may yield different results in a non-Boolean topos. 

Throughout this section we work in a topos with a natural number object N. 
(The constructions from N of the object Z of integers and the object Q of 
rationals, being essentially first-order, could in fact be carried out in any pretopos 
with a natural number object.) We define Z to be the quotient of N x N by the 
equivalence relation which identifies (m,n) with (m’,n’) iff m+n’ =m +n, 
and we define a monomorphism N — Z to send n to the equivalence class of 
(n,o). The arithmetic operations of addition and multiplication (cf. A2.5.4) are 
extended to Z in an obvious way, and make it into a commutative ring: in fact 
it is the free commutative ring (with 1) on no generators, and its underlying 
additive group is the free group on one generator in € (cf. 5.3.5 below). Then we 
define Q to be the field of fractions of Z, i.e. the quotient of Z x Zx (where Zx is 
the subobject of nonzero elements of Z, i.e. the complement of 0: 1 — Z — note 
that this element is indeeed complemented, since the equivalence relation we 
defined on N x N is complemented and hence Z is decidable) by the equivalence 
relation which identifies (z, w) with (z’, w’) iff z-w’ = z’-w. Once again, this 
equivalence relation is complemented, and so Q is decidable; we embed Z into 
Q by sending z to the equivalence class of (z,1). And we can again extend the 
arithmetic operations from Z to Q: for example, the sum of the equivalence 
classes of (z,w) and (z’,w’) is the equivalence class of (zw’ + z'w, ww’). The 
total ordering on N, which we defined in A2.5.13, also extends to both Z and Q 
in a straightforward manner. 


Lemma 4.7.1 In any topos with a natural number object, Q with its arithmetic 
operations is a model of the coherent theory of fields described in 1.1.7(h); and 


D4.7 REAL NUMBERS IN A TOPOS 1013 


Q with its order relation is a model of the theory Læ of dense linearly ordered 
objects without endpoints, described in 3.4.11. 


Proof It is straightforward to verify that the usual set-theoretic proofs of these 
facts are constructive, and so valid in any topos. o 


We note that the constructions used so far are all preserved by coherent 
functors, in particular by inverse image functors: since the latter also preserve 
natural number objects by A2.5.6, it follows that they preserve objects of integers 
and of rationals. In particular, in any topos of the form |C, Set] the object of 
rationals is simply the constant functor on C with value Q, and in a topos of the 
form Sh(X) it is the constant sheaf with stalk Q, i.e. the sheaf whose sections 
over an open U are locally constant functions U — Q (equivalently, continuous 
functions U — Q, where Q is given the discrete topology). 

For most of this section, we shall consider real numbers as Dedekind sections 
of the object of rationals (though we briefly discuss an alternative construction 
in 4.7.12 below). However, there are still various choices to be made: do we work 
with ‘one-sided’ or ‘two-sided’ sections, and if we use two-sided sections how do 
we express the idea that the two sides are at zero distance from each other? 
Classically, these choices do not matter, since they yield isomorphic models of 
the real numbers; but we shall see that in constructive logic they yield different 
results. 

We begin with one-sided sections. Let (A, <) be a model of the theory Lx in 
a topos E: we define an open lower section of A to be a subobject L — A such 
that the sequents 


(((£ < y) A (y ETL") Fry (CETL), 
((x ETL") Fr (3y)((£ < y) A (y E" L"))) and 
(T H (3x)(x ETL”) 
are satisfied. We say an open lower section is proper if it also satisfies 
(T H (3xr)>(z e"L)): 


the reason for separating this axiom from the other three is that, unlike them, it 
is not expressible as a coherent sequent. Clearly, we may form a subobject At of 
PA whose elements are the open lower sections of A; and we shall write A, for 
the subobject of At whose elements are the proper sections. Associated with any 
element a: 1 — A, we have a proper open lower section |a = {x: A | £ < a}; this 
defines a morphism |; A — A,. We define the object R, of lower semicontinuous 
real numbers in E to be Q]. 

We may also consider open lower sections of A as models of a propositional 
geometric theory over € (in the sense of B4.2.12, or of 1.1.7(m)), expressed in 
a language with primitive propositions of the form (a € TL”), a € A. For sim- 
plicity we shall assume that E is defined over Set, and that A is of the form 
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~* Ao for some L,.-model Ao in Set (as it is in the case of interest to us), where 
y: E — Set is the geometric morphism: then we may identify open lower sec- 
tions of A in € with geometric morphisms from € to the classifying topos of this 
propositional theory. 


Lemma 4.7.2 Let A be an L,,-model in Set, and let T denote the propositional 
theory of open lower sections of A. Then the classifying topos Set[T] may be taken 
to be Sh(Y), where Y is the locale whose frame O(Y) is generated by symbols 
ta, a € A, subject to the relations Ta A^ 7b =f (max{a, b}), 


ta=\/{Tb| b> a} 


for alla € A, and T = V{fa|a€ A}. Moreover, this locale is spatial; it 


may be identified with the space Y = A’ obtained from the (classical) Dedekind- 
MacNeille completion A of A by adding a top point oo, endowed with the topology 
generated by the sets {y EY |a < y}, a€A. 


Proof By 3.1.14 (or by B4.2.12), we know that Set[T] is localic over Set, 
and the frame corresponding to it is the Lindenbaum algebra of T. The latter is 
generated by the primitive propositions (a € "L"), a € A, subject to relations 
corresponding to the axioms of the theory; the relations in the statement of the 
lemma are a direct translation of the three coherent sequents we wrote down 
earlier, with a change in notation from (a € TL7) to fa throughout. So the first 
assertion of the lemma is a triviality. The second assertion, that the locale Y is 
spatial, is slightly less trivial; but points of Y correspond to frame homomor- 
phisms O(Y) — O(1), and hence to mappings {fa | a € A} — O(1) preserving 
the given relations; and it is easy to see that these correspond exactly to the 
points of A (the point y which corresponds to a lower section L sending 7 a to 
T iff a € L), together with the point co which sends every | a to T. Moreover, 
the topology on Y, is that generated by the sets {y € Y | a < y}; we must show 
that this is isomorphic to O(Y). 

However, the first relation in the definition of O(Y) (plus the infinite distribu- 
tive law) tells us that any element of O(Y) is equal to a join of generators; and the 
other relations tell us that we have fa < \/{Tb |b € S} (resp. T < V{Tb| 6 E€ SH 
provided that for every c > a (resp. for every c € A) there exists b € S with b < c. 
And these relations are in turn implied by the corresponding inclusions between 
subsets of Yp; so the canonical frame surjection O(Y) =» O(Yp) is bijective. O 


Note that if we omit the top point oo from the space Y of 4.7.2, we obtain 
one which is not sober. This is related to the fact that the ‘properness’ axiom 
for open lower sections cannot be formulated geometrically. 


Corollary 4.7.3 Let X be a (sober) topological space. Then, in the topos 
Sh(X), 


(i) the object RY is the sheaf whose sections over an open V C X are all lower 
semicontinuous functions from V to the set Rt = RU {oo}; 
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(ii) Ry is the sheaf whose sections over an open V are all those lower semi- 
continuous functions f: V — R which are ‘locally bounded above’, i.e. such 
that for all x € V there exists q € Q and a neighbourhood W of x such that 
f(y) <q for ally € W. 


Proof (i) By definition, sections of R} over V correspond to models of the 
geometric propositional theory of open lower sections of Q in the topos Sh(V). So 
this is immediate from the fact that lower semicontinuous functions are exactly 
the continuous functions when R* is equipped with the topology described in 
4.7.2, plus the fact that continuous functions V — R*+ correspond bijectively to 
(isomorphism classes of) geometric morphisms Sh(V) — Sh(Rt) (C1.4.5). 

(ii) follows from (i) and the fact that, for the lower section L corresponding 
to a lower semicontinuous f: V — Rt, the truth-value of (3x)}=(x € TL7) is the 
union over all q € Q of the interiors of the sets {x € V | f(x) < q}. O 


The local boundedness condition in (ii) is stronger than simply asserting that 
f never takes the value oo; for example, consider the function f defined on R by 
f(x) = 0 for all z < 0 and f(x) = 1/x for x > 0. For more information about 
the approach to real numbers via one-sided Dedekind sections, see [1001]. 


One obvious problem with one-sided sections is that they fail to inherit the 
full algebraic structure of Q: although we can add two lower semicontinuous 
functions, we do not have additive inverses, and multiplication is even more 
problematic (unless we restrict ourselves to functions taking only positive values). 
Therefore, for most applications, it is more convenient to work with two-sided 
sections; that is, with pairs (L,U) such that L is an open lower section of our 
L..-model A, U is an open upper section, L and U are disjoint (i.e. 


(2 ETL) A(z €TU")) Fe L) ) 


and ‘L and U are at zero distance apart’. However, we still have a choice about 
how we formulate the last of these conditions. If we wish our sections to be 
models of a geometric theory, we are led to use the axiom 


((£ < y) Fry (ETL) v(yeTU"))). 
However, we can also consider the pair of axioms 
(((x <y)A7(yE'U")) Fey (£ € TL ")) and 


(((£ < y) Ale ETL) Fay (y EU"); 


these are both strictly weaker than the coherent sequent above, and (in the 
presence of the other axioms) they express the idea that L is the largest open 
lower section disjoint from U, and U is the largest open upper section disjoint 
from L. We shall call a pair (L,U) a Dedekind section of A if it satisfies the 
stronger axiom above (plus the earlier axioms), and a MacNeille section if it 
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satisfies the two weaker ones. Clearly, we have subobjects Aq and A, of PAx PA 
corresponding to these two notions; in the particular case when A = Q, we shall 
write Ry and Rm. Note also that we always have Ag C Am as subobjects of 
PA x PA; and we have an embedding A — Ag given by r+ (| 2,72), as well 
as an embedding Am >> A, given by (L,U) +> L. (The latter map is monic 
because, for a MacNeille section (L,U), we can recover U from L: 


((y € TU) Ary (S2)((@ < y)A(# ETL) .) 


As with the one-sided sections, we begin our investigation of two-sided sec- 
tions by considering the classifying topos for the geometric propositional theory 
of Dedekind sections of an L..-model A in Set. For simplicity, we shall restrict 
ourselves to the case A = Q (recall that this is not as special as it seems, since 
every countable model of Lo is isomorphic to Q). 


Lemma 4.7.4 Let Tq denote the theory of Dedekind sections of Q. The classi- 
fying topos Set[T4] 1s of the form Sh(Y), where Y is the locale corresponding to 
the frame F(R) (sometimes called the frame of formal reals) which is generated 
by symbols (p,q) where p and q are rationals with p < q, subject to the following 
relations: 


(max{p), p2}, min{q, q2}) if max{pi, p2} < min{qi, 42} 
al otherwise; 


(pi, 91) A (P2, 92) = | 


(p,q) = (p,q) V (p1,4) fosn<usG 
(p,q) = Vid) |p <p <q <a} ; and 


T =\1p.a) lp <q}. 


Moreover, this locale is spatial, and it may be identified with the space R equipped 
with the usual (Euclidean) topology. 


Proof As in 4.7.2, the first part is merely an explicit description of the 
Lindenbaum algebra of Ta. The latter is generated by primitive propositions 
of the forms (p € "L7) and (q € TU) (p,q € Q); the generators (p,q) which 
we have used for F(R) are simply the conjunctions ((p € "L7) A (q € "U’)). 
(For some purposes it is convenient to allow the use of the symbol (p,q) when 
p > q; of course, in this case, it is equal to the bottom element of the frame, 
by the disjointness axiom.) We can recover the original generator (p € 'L) as 
the join V{(p, q) | q > p}, and similarly for (q € "U™); so the elements (p, q) do 
indeed generate the frame. Then the first condition in our presentation becomes 
a translation of the axioms saying that L is a lower section and U is an upper sec- 
tion; the third translates the condition that they are open, and the fourth is the 
condition that they are both inhabited. Finally, the second condition expresses 
the ‘Dedekind version’ of the condition that they are at zero distance apart. 
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The real meat of the lemma, once again, comes in the assertion that the 
locale defined by this frame presentation is spatial. Of course, the points of 
the locale may be identified with Tg-models in Set, that is with real numbers 
(Dedekind sections of Q) in the classical sense; and the topology induced upon 
them is that generated by the rational open intervals {x € R |p < « < q}, 
which are the images of the formal symbols (p,q) — that is, the usual topology 
on R. The problem is thus to show that the subframe of P(R) generated by these 
subsets satisfies only those conditions which are deducible from those in the given 
presentation of F(R). We note that the first condition in the presentation enables 
us to reduce any element of F(R) to a (possibly empty) join of generators, so as 
in 4.7.2 we are reduced to considering inequalities of the form (p,q) < V{(pi, %) | 
ie I}or T < V{(pi, ai) | 1 €I}. We shall deal only with the first of these; the 
second is similar. 

The key ingredient that we need is the Heine-Borel theorem; that is, the 
assertion that every bounded closed interval in R is compact. Given this, we 
see that if an actual interval {x € R | p < x < q} is covered by the intervals 
{x | pi < £ < qi}, then for any (p’,q’) with p < p' < q' < q we must have a finite 
subcover of the closed interval {x | p' < x < q'}, and hence we can find a finite 
sequence of indices 7,,...,in with pi, < p’, qin > g and pi}; < qi for all j. 
But these inequalities, plus the second and third conditions in the definition of 
F(R), ensure that the inequality (p,q) < \VV{(pi, qi) | i € I} holds in the latter. 

O 


The use of the Heine-Borel theorem in the foregoing proof is unavoidable. 
Although this theorem does not make any use of the axiom of choice, it does 
require the law of excluded middle; we shall see in 4.7.13 below that there are 
toposes in which it fails for the internal space of Dedekind reals. On the other 
hand, the locale Ry of formal reals (the locale corresponding to the frame F(R), 
as constructed in a topos) is constructively locally compact; more specifically, 
for any two rationals p and q, the quotient frame F([p, q]) of F(R) obtained by 
adjoining the relations (r,s) = L whenever s < p or r > q (the ‘formal closed 
interval [p,q] +’) is compact (see [520, IV 1.2], for a proof). Hence, if we construct 
the locale Ry inside the topos of 4.7.13, it will not be spatial. 


Corollary 4.7.5 Let X be a (sober) topological space. Then, in the topos 
Sh(X), 
(i) Ra is the sheaf whose sections over an open V C X are all continuous 
functions f: V > R; 
(ii) Rm is the sheaf whose sections over V are pairs Ca, of functions V — R 
satisfying 
f(z) = inf{sup{ f(y) | y € U} | U € Nz} 


and 


f(x) = sup{inf{f(y) |y EU} |U E€ Nz} 


for allz € V, where N, denotes the filter of neighbourhoods of x in V. 
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Proof (i) This follows from 4.7.4 just as 4.7.3(i) followed from 4.7.2. 

(ii) A section of Rm over V is, in particular, a pair of proper one-sided sec- 
tions of Q in Sh(V), and hence corresponds to a pair of functions f: V — R and 
f:V — R which are respectively lower semicontinuous and upper semicontinu- 
ous. The ‘MacNeille form’ of the condition that L and U are at zero distance 
apart translates easily into the conditions on f and f given in the statement; but 
these conditions imply the required semicontinuity properties of the two func- 
tions, and so we do not have to state them explicitly. O 


Of course, a pair of functions (f P as in 4.7.5(ii) corresponds to an element 
of Ra(V) C Rm(V) iff the two functions coincide (in which case they are neces- 
sarily two-sidedly continuous). On the other hand, it is easy to give examples of 
MacNeille reals which are not Dedekind reals; for example, the pair of functions 
(f, f): R =R defined by f(z) = f(z) = 0 for all x < 0, f(z) = f(x) = 1 for 
x > 0, f(0) = 0 and f(0) = 1. We shall revert to the question of when R4 and 
Rm coincide in 4.7.11 below. 

Our next task, however, is to extend the order and algebraic structures from 
Q to R. This may be done either for Ra or for Rm, by essentially the same 
constructions; although we are mainly interested in Ra, we shall give the con- 
structions for Rm, since they may then easily be restricted to Ra. If t is a term 
of type Rm or Ra, we shall write L; and U; for the terms of type PQ which 
should properly be written fst("2 (¢)) and snd("7 (t)), where i: R — PQ x PQ 
is the canonical inclusion. We deal first with the order structure: we have two 
ways of defining an order relation on MacNeille sections, the strong ordering 


((z < y) dF (Sp: Q)((p € Uz) A (p E€ Ly))) 


and the weak ordering 


((z < y) dF (Vp: Q)((p € Lr) >(p € Ly))) . 


(The apparent asymmetry in the definition of < is only apparent: because U; is 
recoverable from L,, as we noted earlier, the condition Lz C Ly is equivalent to 
U, C Uz.) It is straightforward to verify that both these relations are transitive, 
that the first is irreflexive, and that the second is reflexive and antisymmetric. 
Also, the first relation is a two-sided ideal in the second (that is, (x < y) implies 
(x < y), and (x < y < z < w) implies (x < w)); once again, this is easily verified 
from the definitions. And for variables x, y, z of type Ra (though not, in general, 
of type Rm) we have 


((£ < y) Fr,y,z (2 < z) V (z < y))) ; 


for from (x < y) and the openness of the sections defining x and y we can deduce 
the existence of rationals p < q with ((p € Uz) A (q € Ly)), and then we must 
have either (p € L+) or (q € Uz). 
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On the other hand, it is not true that (x < y) is equivalent to 
((£ < y) V (x = y)), nor do we have (Vz, y)((z < y)V(y < z)), even for Dedekind 
reals x and y. For a counterexample to the first in Sh(R), take x to be the iden- 
tity function R — R and y to be the absolute-value function; then the truth-value 
of (x < y) is the whole of R, but that of ((z < y) V(x = y)) is R \ {0}. Similarly, 
for the second we take z to be the identity function and y to be the function 
tt» —t (or the constant function with value 0). However, we do have one useful 
relationship between the two orderings: 


Lemma 4.7.6 In any topos, we have ((x < y) Fs ~(y < x)) for variables 
x,y of type Rm. Hence in particular the objects Rm and Ra are —7-separated in 
any topos. 


Proof Let x and y be MacNeille reals. Clearly, from (x < y) and (y < zx) 
we deduce (3p)((p E Uy) A (p € L,)), contradicting the disjointness axiom; so 
(x < y) implies -(y < x). Conversely, =(y < x) is equivalent to (Vp)7((p € Uy) 
A (p € Lz)), so from (q E€ Lz) we deduce (Sp)((p > q) A (p € Lz)) by openness, 
and hence (Jp)((p > q) A -(p € U,)). But the latter implies (q E€ Ly) by the 
definition of a MacNeille real; so we have verified that Lẹ C Ly, i.e. that (£ < y). 

The second assertion follows immediately from the first and A4.3.6(a), since 
the formula (x = y) whose interpretation is the diagonal subobject of Rm x Rm 
(or of Ra x Ra) is equivalent to ((x < y) A(y < x)) and hence also equivalent to 
“(a < y) V (y < 2)). o 


We shall sometimes write (x # y) for the relation ((x < y) V (y < x)) on Ra, 
and call it the apartness relation on Ra. In general, a symmetric binary relation 
# on an object X is called an apartness relation if it satisfies 


((z = y) dkay n(x #y)) 


and 
((1#y) Fry, ((1#2z)V (Y#2))) - 


(For Ra, the second of these conditions follows from the condition ((x < y) Fz,y,z 
((x < z) V (z < y))), which we established earlier.) Objects with apartness rela- 
tions have been extensively studied in the context of constructive mathematics, 
as a useful generalization of decidable objects (cf. [90, p. 22]); the fact that Rg 
comes equipped with a ‘natural’ apartness relation means that results established 
for such objects can be applied to it. 

On the other hand, neither R,, nor Rg is decidable, in general: once again, 
a counterexample to ((x = y) V —(z = y)) may be given in Sh(R) by taking 
x and y to be the identity and absolute value functions. Nor is either object a 
—-+sheaf, in general. For if we weaken the definition of a two-sided section of Q 
by replacing the positive condition that L and U should be inhabited with the 
negative condition that they should be nonempty (i.e. =(Vq: Q)7(q E 'L"), etc.), 
we obtain objects *Rg and *Rm in which Rg and Rm are ——-densely embedded 
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(and which are still ---separated, by the same proof as above), but which do 
not coincide with Ra and Rm in general; for example, in Sh(R) the function 
(x ++ 1/|z|) (undefined at x = 0) yields a global section of *Rg which does not 
lie in Rg or even in Ry. 

In general, Ra enjoys more good properties than Rm: we have already seen 
that it has an apartness relation, and we shall see shortly that it has better 
algebraic properties as well. However, there is one respect in which R,, is better, 
namely order-completeness. Of course, neither Ra nor Rm is order-complete in 
the sense in which we usually use that term, since they lack top and bottom 
elements. One way to get round this would be to consider a closed interval 
[a,b] instead of the whole real line; alternatively we can modify the definition 
of completeness. We shall say that a poset X is conditionally complete if every 
inhabited subset of X which has an upper bound has a least upper bound; this 
condition is also sometimes known as Dedekind-completeness, but since Rg does 
not satisfy it in a general topos that name would be confusing in the present 
context. 


Lemma 4.7.7 In any topos, Rm is a conditionally complete poset. 


Proof Let S be a bounded inhabited subset of Rm. To construct a least upper 
bound z = (L,U) for S, we take U to be the interior of the intersection of all 
the upper sections of the members of S, i.e. 


U = {q:Q| (Ap <a)(Vy: S)\(p € U,)} . 


For L, we might be tempted to take simply the union of the lower sections of 
the members of S, but this will not yield a MacNeille real in general, so we have 
to define it from U: 


L= {p: Q | (dq: Q)((p < q) N7(q € U))} 


Verification of most of the conditions for (L, U) to be a MacNeille real is straight- 
forward (for example, inhabitedness of L and of U follow respectively from the 
conditions that S is inhabited and that it is bounded above - note that L cer- 
tainly contains the union of the left sets of the members of S). The only one 
which requires some work is 


(p< q)A7(p € L))=>(q4€U)); 


given p and q satisfying the hypotheses of this implication, let r = (2p +q)/3 and 
s = (p+ 2q)/3, so that p < r < s < q. Then from the definition of L we obtain 
a-(r € U). But since U G U, for all y € S, we have U N Ly = O for all such y, 
and so this implies ~(r € Ly) for all such y. Hence we have (Yy: S)(s € U,), and 
so q € U, as required. 

We have also observed that L} G L for all y € S, so x is an upper bound 
for S. But if z is any upper bound for S, then its upper section U, is open and 
contained in U, for all y € S, so it must be contained in U; hence z < z. O 
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It follows straightforwardly from 4.7.7 that any bounded closed interval 
[a,bļm C Rm is a complete poset in the usual sense. However, we shall see 
in 4.7.11 below that Ra is not conditionally complete in general. (On the other 
hand, we note for future reference that Rg does have binary join and meet oper- 
ations — we shall denote them by max{z,y} and min{z, y}, as is customary — 
which are obtained simply by taking unions and intersections of the appropriate 
sections.) 

Next we consider the algebraic structure on Rg and Rm. In fact the same 
definitions work in both cases; until further notice, we shall cease to specify 
whether we are working with Dedekind or MacNeille reals, and write R to denote 
either Rg or Rm. Defining addition is straightforward: we set 


(Ly, U;) + (L2, U2) 
= ({qı + 2 | (qı © Li) A (q2 € L2)}, {q1 + G2 | (qı E€ Ui) A (G2 E U2)}) 


and verify that this is a Dedekind (resp. MacNeille) real if both (L,,U,) and 
(L2, U2) are. It is also easy to verify that this operation makes R into an abelian 
group, the additive inverse of (L,U) being ({-q | q € U},{-q | q E€ LJP 
However, multiplication is a little more tricky, because the classical subdivision 
into cases according to the signs of the numbers being multiplied will not work 
in our constructive context, where we do not have a ‘trichotomy law’ for the 
ordering. To get round this difficulty, we use an idea based on J. H. Conway’s 
recursive definition of multiplication [248], which was first suggested in [504]. We 
first define the product of a real number and a rational (that is, we make R into 
a module over Q), for which we may legitimately subdivide into cases since the 
ordering on Q satisfies the trichotomy law: 


({q-r|r Ee Lr}, {g r|rEeU:}) ifq>0 
q:x= £ ({q-r|rE€Uz} {qg-r|re L} ifq<0 
0 ifq=0. 


It is not hard to verify that this does indeed satisfy the identities for a Q-module. 
Now we exploit the idea that if q) — zı and q2 — x2 have the same sign then 
(xı — qi)(Z2 — q2) should be positive; that is, we define Lz,.z, to be 


{a: Q | (Agi, 92)((((q1 E La, ) A (q2 E€ Lez)) V ((q1 € Uz1) A (q2 € Uzz)))A 
(q+ qı: q2 E Lgy-2,4+4)-22))} 
with a similar definition for Us, r, (in which the rationals qı and q2 are taken 


from the sections on opposite sides of xı and x2). A large amount of tedious but 
straightforward verification, which we omit, allows us to conclude 


Proposition 4.7.8 The above definitions of addition and multiplication make 
R into a commutative ring. O 
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Next, it is natural to ask in what sense, if any, the real number objects 
are fields. The example of a spatial topos, in which the stalks of the sheaf of 
continuous real-valued functions are well known to be local rings but not fields 
in general, shows that we cannot hope for even Rg to be a field in the coherent 
sense defined in 1.1.7(h), although we can hope to show that it is a local ring 
in the sense of 1.1.7(g). In fact both Ra and Rm are fields in a constructively 
weaker sense: we say that a commutative ring is a residue field if it satisfies the 
nontriviality axiom ((0 = 1) + L) and the non-coherent sequent 


(>(3y) (ay = 1) Fe (z = 0)). 


(The reason for the name ‘residue field’ is that if we attempt to form the residue 
field of a local ring — that is, if we factor out the ideal of non-invertible elements — 
what we obtain is exactly a ring satisfying this axiom.) 

In order to prove that Rg and Rm are residue fields, we need to determine 
their invertible elements. The key step in this is the following: 


Lemma 4.7.9 We have 
(Vz, y: R)((z-y > 0) => (((z > 0) A (y > 0)) v ((x < O)A(y<9)))). 


Proof The hypothesis that 0 € Lz., tells us that we have either q E€ Lz and 
r E€ Ly with q-r € Lay4rz, or else q € Uz and r € U, satisfying a similar 
condition. The analysis of the two cases is similar; each proceeds by considering 
the possible signs of q and r, and we shall do only the first case. Clearly, if q > 0 
and r > 0 then we have z > 0 and y > 0, so we need only consider the sub-cases 
when one of q and r is strictly less than 0. If q < 0 = r, then the condition 
becomes 0 € Lg, which implies 0 € U,; but this contradicts 0 = r € Ly. If 
q <0 < r, then we have s € U, and t € Ly such that q-r =q-s+r-t, or 
equivalently q-(r — s) = r-t; but q - (r — s) > 0 since both q and (r — s) are 
negative, and hence t > 0. But this forces z > 0, since t € Lz, and we already 
have y > 0 since r € L}. Finally, we need to deal with the case when both q and r 
are negative. In this case, we obtain s € U, and t € U, such that q-r =q-s+r-t, 
from which we get r -t = q- (r — s) > 0 and hence ¢ < 0, and similarly s < 0. 
But this forces both z and y to be negative. 0O 


Proposition 4.7.10 
(i) In both Ra and Rm, we have 


((dy: R(x- y = 1) dF, ((x > 0) v (z < 0))). 


(ii) Ra and Rm are both residue fields. 


(iii) Ra is a local ring. 
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Proof (i) One direction follows easily from 4.7.9. Conversely, suppose z > 0; 
then we may define its multiplicative inverse to be the section 


(a: Q | (Ca < 0) V ((q > 0) A (1/q € Uz)))}, fa: Q | ((q > 0) A (1/4 E Le))}) 


(note that z > 0 implies 1/n € 'L,' for some natural number n, so that the 
upper section of this pair is inhabited). It is again straightforward to verify that 
this (is a section of the required type and) is a multiplicative inverse for z. 
Similarly if x < 0. 

(ii) follows immediately from (i) and 4.7.6. 

(iii) In Rg, we have (Vz)((z > 0) V (x < 1)), and hence by (i) we have 
(Vx)((Sy)(ry = 1) V (Ay)((1 — z)y = 1)). But this is equivalent to the local-ring 
axiom as stated in 1.1.7(g). o 


However, Rm is not a local ring in general (see 4.7.11 below). On the other 
hand, we can say a good deal more about the algebraic character of Ra: it is 
a ‘separably real-closed local ring’. This means that it satisfies the axioms of a 
coherent theory (which we shall not state here explicitly) whose models in Set are 
Henselian local rings with real-closed residue fields. (Once again, we note that the 
stalks of the sheaf of continuous real-valued functions on any topological space 
are well known to be rings of this type.) For details of the proof, see [611, 511). 


Now we revert to the question, which we deferred earlier, of when the objects 
Ra and Rm coincide. The answer turns out to be straightforward: 


Theorem 4.7.11 In a topos € with natural number object, the following 
conditions are equivalent: 


(i) E satisfies De Morgan’s law (cf. 4.6.2). 
(ii) The objects Ra and Rm coincide in E. 
(iii) Rm is a local ring. 
(iv) The relation (x < y) V (y < x) defines an apartness relation on Rm. 
(v) Ra is conditionally order-complete (cf. 4.7.7). 


Proof First suppose De Morgan’s law holds, and let x be a MacNeille real in 
€. To show it is a Dedekind real, suppose given rationals p,q with p < q; let 
r = (p+ q)/2. By disjointness of L, and U,, plus De Morgan’s law, we have 
(=(r € Uz) V 7(r € L,)); but since p < r the first disjunct implies (p € Lz), and 
the second similarly implies (q € U+). Thus (i) implies (ii); and the fact that (ii) 
implies (iii), (iv) and (v) is clear from the results already established about Rg 
and Rm. 

In the converse direction, the key idea is that if we are given a truth-value 
w: Q, we may construct a MacNeille real z(w) by setting 


x(w) = ({q: Q | ((¢ < 0)V((q < 1)A>>w))}, {4: Q | (a > 1) V((q > 0)A>w))}) - 
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It is an easy exercise to verify that z(w) is indeed a MacNeille real. However, if 
it were a Dedekind real then we should have ((0 € Lzw)) V (1 € Uzywy)), Which 
is equivalent to (“~w V ~w); since w was arbitrary, this shows that De Morgan’s 
law follows from (ii). Also, if the indicated relation were an apartness relation 
on Rm, then we should have ((0 < x(w)) V (x(w) < 1)) (since we certainly have 
(0 < z(w) < 1)), but this again reduces to the statement of De Morgan’s law. 
Similarly, if Rm is assumed to be a local ring, then either z(w) or 1 — z(w) must 
be invertible, which yields the same conclusion via 4.7.10(i). Finally, we note 
that the set 
Sw = {4: Q | (a = 0) V ((4 = 1) Aw))} 

is inhabited and bounded in Ra. But if this set has a supremum (y(w), say) 
in Ra, then we have (w F (y(w) = 1)) and (~w F- (y(w) = 0)), and so from 
((0 < y(w)) V (y(w) < 1)) we may deduce (=-w V ~w). Thus, once again, we 
obtain De Morgan’s law. 0 


We remark that the elements z(w) constructed in the proof of 4.7.11 are all 
idempotents in the ring Rm; for they satisfy ~~((x(w) = 0) v (z(w) = 1)) and 
hence ~~(x(w)? = x(w)), from which the result follows by —~~—-separatedness 
of Rm. In fact the Boolean algebra of idempotents of Rm is readily seen to be 
isomorphic to Q... (whereas, since Ra is a local ring, its algebra of idempotents 
is simply the two-element set {0,1}). This fact may be used in conjunction with 
4.6.2(vii) to give another proof that the equality Rg = Rm implies De Morgan’s 
law. Also, the failure of Rm to be an indecomposable ring, in general, may be 
exploited to give another construction of the Gleason cover yE described in the 
previous section: in fact (yE, Ra) is the Pierce representation of the ringed topos 
(E, Rm) (cf. B4.4.23). For more details, see [512]. 


Whilst we are on the subject of comparisons between different objects of real 
numbers, it seems appropriate to mention the other popular way of defining the 
reals from the rationals, via equivalence classes of Cauchy sequences. This leads 
to an object commonly called the Cauchy real number object (though ‘Cantor 
real number object’ would be more historically accurate; fortunately our notation 
Re will do for either.) When working constructively with Cauchy sequences, it is 
convenient to restrict attention to sequences that converge ‘at a uniform rate’: 
thus we define the object C C Q” of Cauchy sequences to be the interpretation 
of the term 


{f: Q™ | (Ym, n: N)((0 < m < n)>(|f(m) — f(n)| < 1/m))}. 


With this definition, we can then define the equivalence relation E C C x C of 
‘converging to the same limit’ by the term 


{(f.9):C x C | (Yn: N)((0 < n)=(If(n) — g(n)| < 3/n))}. 


It requires a little work to verify that E is indeed an equivalence relation (specif- 
ically, that it is transitive); but if we assume (f,g) € E, (g,h) € E and n > 0. 
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we have |f(n) — f(6n)| < 1/n, |f(6n) — g(6n)| < 1/2n, |g(6n) — h(6n)| < 1/2n 
and |h(6n) — h(n)| < 1/n, yielding the required inequality. The object Re is 
then defined to be the quotient of C by E; we may proceed to define the arith- 
metic operations and order relations on C (for example, we define addition by 
f+g=An. f(2n) + g(2n) to ensure that the sequence converges at the required 
rate), and check that they induce the appropriate structure on the quotient. We 
may also define a mapping h: C — Rg by the term 


Ha: Q | (An)(q < f(n) — 1/n)}, {a: Q | (An)(q > f(n) + 1/n)}) 


where f is a free variable of type C; again, it requires verification that this is 
indeed a Dedekind real, but if we are given p < q in Q then we can choose n with 
3/n < q — p, and then we must have either p < f(n) —1/n or q > f(n) +1/n. 
It is similarly straightforward to verify that we have 


(VF, g: CAF) = h(g))  ((f,9) € E)) , 


so that h induces a monomorphism R, — Ra. Moreover, this monomorphism 
preserves the algebraic and order structure. 

However, Re — Ra is far from being an isomorphism in general. The reason 
can be summarized in two words: ‘countable choice’. If we are in a context where 
the axiom of countable choice holds (in other words, a topos where the natural 
number object is projective), then we may express any Dedekind real as the 
limit of a Cauchy sequence of rationals, since for every Dedekind real x and 
every nonzero natural number n we may find a rational q with |x — q| < 1/n. 
(To do this, note first that since L, and Us are inhabited we may certainly find 
rationals p,r with p < z < r; then we may divide the interval [p,r] into finitely 
many subintervals of length at most 1/n.) But in a topos where the axiom is not 
valid, Re may be very much smaller than Rg. We give two examples to show the 
different types of behaviour that can occur: 


Examples 4.7.12 (a) Let € be the topos of sheaves on a locally connected 
locale X (or more generally any locally connected topos over Set). Then the 
inverse image functor y*: Set — E preserves exponentials (cf. C1.5.9), and so 
QN = *(QY) is the sheaf whose sections over an open U C X are simply the 
locally constant functions U — Q". Similarly, C is the sheaf whose sections 
over U are the locally constant Cauchy-sequence-valued functions on U, and 
E is also a constant sheaf; so we deduce that R. is simply the sheaf y*(R) of 
locally constant (Cauchy-)real-valued functions over X. In contrast, Ra may be 
substantially larger than y*(R), for example if X itself is R. 

(b) Let E€ be a localic Set-topos satisfying (SS), for example the topos of 
sheaves on a second countable zero-dimensional locale X. Then we saw in 4.5.16 
that we may ‘lift’ the axiom of dependent choices from Set to €, and hence 
prove internally that every Dedekind real in € is the limit of a Cauchy sequence of 
rationals; thus in such a topos we have R, = Rg. If € is the topos of 4.5.2(a), this 
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result is not surprising, since the corresponding locale has the property that every 
continuous real-valued function on it is constant; but there are other examples 
provided by 4.5.2(d), for example the topos Sh(Q) (where Q is topologized as a 
subspace of R), in which there are many continuous real-valued functions which 
are not locally constant, and so Re is very far from coinciding with y*(R) in 
this case. 


It can be shown that, for a locale X, Re coincides with y*(R) in Sh(X) 
iff, for every open U C X, the lattice of (relatively) clopen sublocales of U is 
closed under countable unions. (For a second countable locale X, this condition 
is equivalent to local connectedness.) For details of the proof, see [357]. 

The fact that, in a spatial topos, Re need not coincide with either the sheaf 
of continuous real-valued functions or the sheaf of locally constant real-valued 
functions means that it is generally regarded as less useful than Rg as a model 
of ‘what the real numbers ought to be’, even though its algebraic and order 
structure is in general rather easier to work with. Another reason for discarding 
Re is that, when we regard it as an internal topological space in € (equipped 
with the topology generated by rational open intervals, i.e. the image in P(R-) 
of the frame F(R) of formal reals defined in 4.7.4), it is not in general sober 
(cf. C1.6.8); whereas Ry, being by definition the space of points of the formal 
reals, is always sober. 


Example 4.7.13 This seems a good point at which to give our promised exam- 
ple of a topos E in which the Heine—Borel theorem fails, so that the locale Ry 
of formal reals does not coincide with the space Ra. The first such example was 
given by M. Fourman and M. Hyland [364]; we present a slightly different exam- 
ple due to A. Joyal, which constructs € as the classifying topos of a geometric 
propositional theory, namely the ‘theory of an open cover U of [0,1] with no 
finite subcover’. Actually, in order to say ‘with no finite subcover’ in a positive 
(geometric) way, we demand a stronger condition: namely, that each finite subset 
of U should have total Lebesgue measure strictly less than Ł, 

The primitive propositions of our theory T are formal expressions of the form 
((p,q) E€ U), where p and q are rationals with 07 < p < 1,0 < q < 1+ and 
0<q-p< ft (Note: for the sake of uniformity of notation, we write (07 ,q) for 
the half-open interval [0, q) = {x E R| 0 < x < q}, and similarly for (p, 1*).) For 
technical reasons, we wish our covering to be a lower set in the poset of rational 
open intervals, so we adopt the axioms 


(((p,q) € U) F ((p',q') €U)) 


whenever p < p’ < q’ < q. Next, we adopt the axioms 


(Awa) EUu)r i) 


1 
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for each finite sequence ((p1,q1),---,(Pn,Qn)) such that the Lebesgue measure 
of U; (pi, qi) C R is greater than or equal to ¿. Finally, for each real number z 
with 0 < x < 1, we adopt the axiom 


(TH \V{((p,4) €U) |p <2 <q}). 


Clearly, T has no models in Set, so its classifying topos will be of the form 
Sh(X) where X is a locale with no points; nevertheless, we shall see shortly that 
it is consistent. We shall write To for the subtheory of T defined by the first 
two groups of axioms, and Sh(Y) for the classifying topos of To; much of the 
argument which follows is concerned with determining the relationship between 
Y and its sublocale X. 

The frame O(Y) is easy to describe. Consider the poset P whose elements 
(which we shall call nodes) are formal finite conjunctions A;_,((pi,qi) € U), 
subject to the condition that UJ", (pi, qi) has Lebesgue measure less than $ — we 
refer to this condition by saying that the conjunction is consistent — and ordered 
by setting 


nm 


((r,,53) EU) < A (lpi qi) €U) 


i=1 


ha 


ii 


J 


iff, for each i, there exists j such that r; < pj < qi < s;. (Thus we move down- 
wards in the poset P by either increasing the number of terms in our conjunction, 
or enlarging the intervals to which they refer.) Then O(Y) is simply the frame 
of lower sets of P; equivalently, Sh(Y) is simply the functor category [P°P, Set]. 
Thus Sh(X) may be identified with Sh(P,T) for a suitable coverage T on P; 
equivalently, O(X) is the frame of T-ideals of P, in the sense of C1.1.16(e). 
Again, T is easy to describe: for each real number z € [0,1], each node n is 
covered by the set of all conjunctions n A ((p,q) € U) where p < x < q, and the 
extra interval (p,q) is small enough for this new conjunction to be consistent. 
Since the definition of consistency involves a strict inequality, it is clear that we 
can always find such an interval, for any node n and any z € (0, 1]; hence every 
cover in T is inhabited — equivalently, X is a dense sublocale of Y. In particular, 
this shows that X is nontrivial; so T is consistent, as we claimed earlier. 

But in fact X satisfies a much stronger condition than denseness in Y; it is 
actually a flat sublocale of Y as defined in C1.1.16(d), i.e. O(X) is closed under 
finite joins in O(Y) (which are simply unions, of course). To show this, it suffices 
by C1.1.16(e) to show that the covers in T are (upwards) directed; but this is 
easy, since if R is a cover of n and we have n A ((pi,qi) EU) E€ R for i = 1,2, 
then the node n A ((p3,q3) E U), where p3 = max{p,,p2} and q3 = min{qi, q2}, 
is an upper bound for them which lies in R (it is clearly consistent, since the 
interval (p3,q3) is contained in both (pi,qi) and (p2, q2)). 

Now we wish to identify the object Ra in Sh(X) (or at least the closed 
interval [0, iJa — Ra which is the interpretation of {x: Ra | 0 < xz < 1}). To 
do this, we first consider its global sections, that is the actual Dedekind real 
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numbers in Sh(X). We note that Y is spatial, and that it has a focal point 
in the sense of C1.5.6 (that is, a point whose only neighbourhood is the whole 
space), corresponding to the empty family of rational intervals. So the reflection 
of Y in the category of Hausdorff spaces is a singleton, and thus Dedekind real 
numbers in Sh(Y) are just constant real-valued functions on Y. We now appeal 
to Joyal’s lemma C4.1.15(ii): since the locale corresponding to the frame F((0, 1}) 
is compact Hausdorff, each morphism from X to it can be uniquely extended to 
a morphism from Y. Thus the Dedekind reals in Sh(X) which lie between 0 and 
1 are simply the constant real numbers in this interval. A similar argument with 
X and Y replaced by their open sublocales corresponding to a principal ideal 
| (n) in P (note that every principal ideal is a T-ideal) yields the same conclusion 
about the sections of [0, 1]q over any such open in X: that is, [0, 1]a, considered 
as a T-sheaf on P, is no more than the constant functor with value (0, 1] in Set. 
(Again, it is not hard to verify directly that this functor is a T-sheaf; but of 
course that on its own would not suffice to identify it with the object [0,1] as 
constructed in Sh(X).) So the generic T-model in Sh(X) really does define an 
open cover of the closed interval [0,1]q in the sense of this topos; and it clearly 
has no finite subcover. 


Remark 4.7.14 Even if Ra is not the whole of R;, it may easily be seen to 
be strongly dense in the latter, in the sense introduced in C1.1.22: whatever the 
extra covers are that we have to add to the presentation of F(R) to obtain O( Ra), 
they are certainly inhabited, since each rational open interval (p,q) contains 
points. Also, Rg is an open locale in the sense of C3.1.16, because it is spatial. 
It follows from the ‘closed subgroup theorem’ C5.3.2 that it cannot be a localic 
subgroup of Ry unless it coincides with Ry: that is, the addition operation for 
Dedekind reals, which we (implicitly) defined on the spatial product (Ra x Ra)p 
(we didn’t actually verify that it was continuous, but that is easy), cannot be 
extended to the locale product. (On the other hand, if Ra is locally compact, 
then the spatial and localic versions of Ra x Ra coincide, by C4.1.8, and so Ra 
is a localic group; this provides another proof that the Heine-Borel theorem is 
sufficient for the isomorphism Ra = Ry.) This is another reason for preferring 
Ry to Ra in contexts (such as that just given) where the two differ. 


Finally in this section, we describe a universal characterization of the 
Dedekind reals (or rather, of the closed unit interval therein), due to P. Freyd. 
We consider posets P equipped with top and bottom elements 1 and 0 which are 
DM-separated in the sense that we have 


(Vp: P)(-(0 = p) V ~(p = 1)) ; 


the reason for the name is that, if De Morgan’s law holds, this is equivalent to the 
simpler condition =(0 = 1), but in general it is strictly stronger. In particular. 
we note that this condition holds for the subobject classifier Q iff De Morgan’s 
law holds in our topos £. (It does, however, hold for the Dedekind unit interval 
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(0, 1]a C Ra in any topos, since we have (Vr: Ra)((0 < x) V (x < 1)).) We write 
dms(€) for the category of posets in E satisfying this condition (the morphisms 
being maps preserving order and the two distinguished elements). 

Classically, the ordinal sum or ‘ordered wedge’ of two posets P, Q with top 
and bottom elements is defined to be the poset obtained from the disjoint union 
of P and Q by identifying the top element of P with the bottom element of Q. 
It is well known that it can also be identified with a sub-poset of P x Q, namely 


{(p,q): Px Q| ((p=1) V(O0=q))}. 


In our constructive context, we have to modify this definition slightly: we define 
the ‘thick ordered wedge’ P 6 Q to be 


{(p,q): P x Q| (p = 1) (0 = q)) A (70 = 4) > (p= 1))} . 


Clearly, each of the two conditions in the definition of P 6 Q is implied by 
((p = 1) V (0 = q)), so the ‘classical ordered wedge’ is contained in this one: we 
can think of it as obtained from the classical ordered wedge by ‘thickening up’ 
the latter around the point (1,0) to make the two parts fit together more tightly. 
Note also that, if P and Q are equipped with apartness relations # such that 
equality is the negation of apartness (for example, if they are closed intervals in 
Ra), then each of the conditions in the definition of P @ Q is equivalent to 


“((p# 1) A (0#4)), 


since the latter is clearly equivalent to ((p#1) = (0 = q)) and hence (since 
the truth-value of (0 = q) is =—-closed) to (~7(p# 1) > (0 = q)). However, in 
general neither of the two conditions in the definition implies the other. 

Since the top and bottom elements of P ® Q are (1,1) and (0,0), it is clear 
that it is a DM-separated poset if either P or Q is DM-separated, and hence that 
the mapping (P,Q) — P Ð Q defines a bifunctor dms(€) x dms(€) — dms(£). 
We shall be concerned with the ‘diagonalization’ of this bifunctor, i.e. the functor 
P= PoP. 

Freyd’s characterization asserts that [0, 1]4 is a terminal coalgebra for this 
endofunctor of dms(€). In order to prove this, we first need to show that it is a 
coalgebra (for which the structure map [0, 1]a — [0, 1]4@[0, 1]a is an isomorphism, 
cf. Al.1.4). But since any closed interval in Ra is isomorphic to any other, it will 
suffice to prove 


Lemma 4.7.15 For any three Dedekind reals a,b,c with a < b < c, we have 
[a,c] S [a,b] ® [b, c]. 

This lemma is, of course, the reason why we had to adopt the ‘thickened’ 
definition of the ordered wedge; it fails to hold for the classical definition. 


Proof For convenience, we shall take a,b and c to be —1,0 and 1 respec- 
tively. The maps establishing the isomorphism are then easy to define: they are 
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(x — (min{z, 0}, max{x,0})) and ((y,z) + y+ z). We have to show that the 
first map does indeed take values in [—1,0] ® [0, 1], and that the two maps are 
inverse to each other. 
The first statement is easy to verify, since Rg has an apartness relation: it is 
enough to verify 
(Va: [-1, l])A((z < O)A(O<2)). 


But this is immediate from the definition of <. The identity z = min{z,0} + 
max{z,0} holds for all z, not just for those in [—1,1]. And if y < 0 < z, 
then we certainly have y < y+ z and hence y < min{y + z,0}; but if we 
had y < min{y + z,0}, then we should have y < 0, and also y < y+ z and 
hence 0 < z, which contradicts the assumption that the pair (y, z) belongs to 
[—1, 0] @ [0, 1]. Thus we have y > min{y + z,0}, and hence y = min{y + z, 0}. 
Similarly, z = max{y + z,0} whenever (y, z} € [—1,0] © (0, 1]. o 


For future reference, we note that the coalgebra structure on [0,1] which we 
obtain from 4.7.15 is given by the mapping 


x |> (min{2z, 1}, max{2z — 1,0}): [0,1] — [0,1] ® [0,1] 


(and its inverse is given by ‘averaging’: (y, z) + (y + z)/2). 

Now consider an arbitrary coalgebra (c: X — X @ X). It will be convenient 
to consider not c itself but its two coordinate projections d,u: X = X: note that 
these maps are order-preserving and satisfy 


(Yz: X)(A(dz = 1)= (0 = uz)) 
and the dual condition. We first need to verify 
(Vx: X)((0 = uz) v ((0 = uuz) A (ddz = 1)) V (dz = 1)) (*) 


For this, observe that for any x we have (=(0 = duz) V 7(dux = 1)), since 
X is DM-separated; from —(duz = 1) we may deduce (0 = uuz), and from 
~(0 = duz) we may deduce —(0 = ux) (since d preserves 0) and hence (dz = 1). 
Combining this with the similar deduction from (=(0 = udz) V 7(udz = 1)) 
yields the displayed formula above. 

To define a coalgebra homomorphism f: X — [0,1], we suppose given an 
element x of X, and proceed to construct a Dedekind real (Lz, Uz) as follows. 
We consider the binary tree T whose nodes are all formal finite strings of d’s 
and u’s (including the empty string []}), ordered by v < w iff w = sv for some 
string s; given such a string w, we write wz for the element of X that we get by 
evaluating this string at x. We also associate with each node w a dyadic rational 
Tw (that is, one whose denominator is a power of 2) in (0,1), in a straightforward 
way; namely rj] = 3 and, if w is a string of length n > 0, then ray = ry —1/2"*? 
and Tuy = Tw + 1/2"+?. (Equivalently, rẹ is the unique real number mapped 
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by w to ł in the coalgebra structure already described on [0,1].) Clearly, every 


dyadic rational in (0,1) occurs as r,, for a unique node w. 

We now construct (Lg, Uz} by the following recursive procedure. Initially Ly 
contains all rationals less than 0, and Uz all rationals greater than 1. We work 
up the binary tree examining each node in turn; by ‘examining a node’ w, we 
mean asking the two questions ‘Is 0 = uwz?” and ‘Is dwz = 1?’. If 0 = uwr, 
we put all rationals greater than r,, into the set Uz, and if dwz = 1 we put all 
rationals less than rą into Lz. Formally, we have 


Le = {q: Q | (4 < 0) V (3w: T)((dwz = 1) A (q < rw))} 


with a similar definition for Uz. 


Lemma 4.7.16 The mapping x ++ (Lz,Us} defined above is a coalgebra 
homomorphism f: X — [0, 1]a. 


Proof First we have to verify that (Ls, Ux} really is a Dedekind real (it clearly 
lies in [0, 1]). It is also clear that L, is an open lower section and U, is an open 
upper section, so we have to verify that they are disjoint and satisfy the ‘zero 
distance apart’ condition. For disjointness, we have to show that if ry < ry 
(for two nodes v,w of T), then we cannot have both 0 = uvg and dwz = 1. It 
is clear that this cannot occur if v is an ancestor of w (that is, if w = sv for 
some string s); for then the condition r, < Tw would imply that uv was also 
an ancestor of w and so 0 = uvr would imply 0 = wz and hence 0 = dwz. 
Similarly if w is an ancestor of v. So suppose neither is an ancestor of the other, 
and let z be their longest common ancestor; again since Ty < Tw, v must have 
the form sdz and w must have the form tuz for some strings s,t. Now we have 
(=(0 = wwr) V a(uvx = 1)); but from —=(uvz = 1) we can deduce —(dzx = 1) 
since the composite us preserves 1. Similarly, from —=(0 = dwz) we could deduce 
~(0 = uzz); but we cannot have both of these since the pair (dzz, uzxr) belongs 
to X ® X. So we must have either =(0 = uvr) or a(dwz = 1). 

For the ‘zero distance apart’ condition, we use the displayed formula (*) 
above: the point of this formula is that it assures us that, by the time we have 
examined all nodes of length at most n, we shall have ‘tied down’ fz to lie in 
an interval of length at most 1/2”. Thus, given rationals q and r with q < r, we 
choose n such that 1/2” < r — q, and then by examining all nodes of length at 
most n we shall succeed in establishing either (q € Lz) or (r € Uz). 

Next, we must verify that f is order-preserving. But this follows easily from 
the fact that d and u are order-preserving: if zı < z2 in X, then for any word 
w € T we have wz, < wz, and so wz, = 1 implies wx2 = 1; thus we obtain 
Lz, C Lr. A similar argument shows that f preserves 0 and 1. 

Finally, we must show that f is a coalgebra homomorphism. But it is easy 
to verify that for any node w of T we have rwa = Tw/2, and hence that f(dz) = 
min{2f(x), 1}, as required (and similarly f(uz) = max{2f(z) — 1,0}). O 
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Theorem 4.7.17 In any topos €, the Dedekind unit interval {0,1]q is a 
terminal coalgebra for the endofunctor X ++ X ® X of dms(€). 


Proof After the last two lemmas, we have only to verify that the coalgebra 
homomorphism f constructed above is unique. Suppose we have another such 
homomorphism g, and suppose we have an z: X such that (fx < gx). Choose 
a dyadic rational r such that fx < r < gz, and let w be the unique node of 
T such that r, = r. Then we have 0 = uwz since r € Uz; however, since g 
commutes with u and d, it is easy to see that we have 0 < g(uwz), since r = ry 
is the largest element of [0,1] such that uwr = 0. Thus we have contradicted our 
supposition; that is, we have proved 


(Va: X)A( fa < gz). 


Similarly, we may prove (Vz: X)7(gx < fx); so we conclude (Vx: X)(fx = gz), 
ie. f =g. O 


Remarks 4.7.18 (a) It will be observed that the order-relation on the posets 
in dms(€) does not play any essential rôle in the foregoing argument. Thus 
we could alternatively work in a category whose objects are simply objects of € 
equipped with two distinguished elements which are DM-separated, and describe 
(0, 1]q as the terminal coalgebra of the ‘thick wedge’ endofunctor of this category. 

(b) It is also interesting to note that, if we weaken the DM-separatedness 
condition to mere separatedness (that is, to the condition ~(0 = 1)), then the 
terminal coalgebra for the corresponding endofunctor of our enlarged category 
is simply the MacNeille unit interval [0, 1],,. Once again, most of the preceding 
proof goes through in this case without significant modification. 


Suggestions for further reading: Banaschewski & Mulvey [61, 62], 
Dubuc [301], Fourman [357], Fourman & Hyland [364], Kennison & 
Ledbetter [598], Negri & Soravia [890], Reichman [1001], Rousseau [1071], 
Stout [1123]. 


D5 


ASPECTS OF FINITENESS 


D5.1 Natural number objects revisited 


The definition and some of the basic properties of natural number objects in 
a topos were introduced in Section A2.5. However, there were some important 
properties which could not be developed there because they required the ability 
to interpret higher-order logic in a topos. Having developed the techniques for 
this in the previous chapter, we are now ready to return to the study of natural 
number objects and complete what we left unfinished in Section A2.5. 

Let us, for the time being, write T for the algebraic theory freely generated 
by one nullary and one unary operation (so that a T-algebra is simply an object 
A equipped with morphisms a: 1 — A and t: A — A). We recall that, if (NV, 0, s) 
is a natural number object in a topos € (that is, an initial object in the category 
of T-algebras in £), then we have two colimit diagrams 1 — N — N and 
N 3 N —> 1 (A2.5.5), and N satisfies the ‘Peano postulates’ that s is monic, o 
and s are disjoint subobjects of N, and any (o, s)-closed subobject of N is the 
whole of N (A2.5.8 and A2.5.9). Our first objective in this section is to prove 
that the converses of these results are valid. For the moment, we shall call a T- 
algebra (N, 0, s) a Lawvere natural number object if it satisfies Definition A2.5.1, 
a Freyd natural number object if the two diagrams of A2.5.5 are colimits, and a 
Peano natural number object if it satisfies the Peano postulates. 

Given a T-algebra (A,a,t), we shall say a subobject A’ — A is closed (or 
(a, t)-closed, if it is necessary to specify the structure maps) if it is a sub-T-algebra 
of A, i.e. if there exist morphisms a’ and t making 


1 1 


t 
j > Ae — A’ 


ae 


commute. And we shall say that A’ is recursive if, in addition, the pair (a’, t’) is 
jointly epic. 
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Lemma 5.1.1 Any T-algebra contains a unique smallest closed subobject, 
which is recursive. 


Proof It is easy.to see that an arbitrary (internal) intersection of sub-T- 
algebras is a sub-T-algebra (cf. 5.3.2 below), so we may obtain the smallest one 
by forming the intersection of all of them. Also, if (A,a,t) is a T-algebra, the 
subobject which is the union of the images of a and t is (a,t)-closed, so if A is 
minimal in the above sense this subobject must be the whole of A. o 


Theorem 5.1.2 For a T-algebra (N,o,s) in a topos E, the following are 
equivalent: 
(i) (N,0,s) is a Lawvere natural number object. 
(ii) (N,0,s) is a Freyd natural number object. 
(iii) (N,o, s) is a Peano natural number object. 
Proof (i) > (ii) and (i) = (iii) were proved in Section A2.5. 

(ii) = (iii): Clearly, the first of Freyd’s two conditions implies that s is monic 
and its image is disjoint from o. So we have only to verify the induction postulate. 
Suppose m: N’ — N is a (o, s)-closed subobject of N; by 5.1.1, we might as well 
assume that N’ is actually recursive. Now consider the binary relation S — 
N x N which is the symmetrization of (1, s): N — N x N, i.e. the union of this 
relation and its opposite. The assertion that N’ is (o, s)-recursive implies that 
the formula 


(Va,y: N)(((z,y) ETS) => (z ETN) (ye TN") 
is valid. Now let T — N x N be the interpretation of the closed term 
{z: N x N | (Vw: PN)((w is S-closed) = ((fst(z) € w) (snd(2) € w)))}, 
where ‘w is S-closed’ is an abbreviation for the formula 
(vz, y) (((2,y) €7 5) = ((e Ew) (ye w))) 


It is clear that T is an equivalence relation, and that it contains S; so it must 
contain the kernel-pair of the coequalizer of S =3 N. But this coequalizer is 
N — 1, since S contains the graph of s; so T is the whole of N x N. Also, since 
N' is S-closed, it is also T-closed, i.e. 


(vz, y: N)(((z,y) €°T) = (ETN NS(y ETN"); 
but since (o € TN") holds, it follows that we have 
(Vz: N)(x € TN"), 


i.e. N’ is the whole of N. 
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(iii) > (i): Suppose given a T-algebra (A, a,t). Applying 5.1.1 to N x A, we 
obtain a diagram 


| eee 7 pr eae eee 2 


(p,q) (p,q) 


sxt 
N x A <«— NxA 


where (b, u) is jointly epic. We shall show that (p,q) is the graph of a morphism 
N — A, i.e. that p is an isomorphism. 

It is clear that p is epic, since its image is a closed subobject of N. To show 
that it is monic, consider the subobject N’ — N which is the interpretation of 
the term 

{a: N | (Vy, y": B)(py = z = py')>(y=y')}; 


we shall show that N’ is (o, s)-closed, and so is the whole of N. First, o € "N" 


because the square 
1 B 
o 


1 ————_> N 
is not merely commutative but a pullback, since B is the union of b and the 
image of u, and the latter is mapped by p into the subobject s which is disjoint 
from o. Similarly, the square 


ven SEES 


B' >——> B 


A 


P p 


NS Sy 
is a pullback, where B’ is the image of u (note that we don’t yet know that u 
is monic, but it doesn’t matter) and p’ is the unique factorization of p through 
B -» B' (which exists because s is monic). Thus from (py = sz = py’) we may 


deduce 
(Ga B) (a= a) Nae =r yA Oe =e =P )), 


and hence we obtain (x €"N")>(sx € TN"). 
Thus we have verified that (N,o,s) satisfies the ‘existence’ half of Defi- 
nition A2.5.1. For the uniqueness, suppose f,g: (N,0,s) = (A,a,t) are two 
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T-algebra homomorphisms; then the equalizer of f and g is a (o, s)-closed sub- 
object of N, and so f =g. C] 


Corollary 5.1.3 A topos E has a natural number object iff it contains an object 
A equipped with a monomorphism t: A — A and a well-supported subobject B — 
A disjoint from t. 


Proof First suppose there exists a morphism a: 1 — A factoring through B. 
Then, applying 5.1.1 to the T-algebra (A, a, t), we obtain a subobject of A which 
is readily seen to be a Peano natural number object. In general, we may apply 
this argument to the object B*(A) of €/B to prove that the latter topos has a 
natural number object; but since B is well-supported the canonical geometric 
morphism €/B — E is a surjection by A4.2.7(a), and so € has a natural number 
object by A4.2.1(v). O 


For 2-valued toposes (i.e. those in which the only subterminal objects are 
1 and 0), we have an even more striking result: 


Corollary 5.1.4 Suppose E is a 2-valued topos. Then the following conditions 
are equivalent: 


(i) E has a natural number object. 


(ii) There exists a monic endomorphism f: A A in E which is not an 
isomorphism (i.e. E contains an object which is not ‘Dedekind-finite’). 


(iii) There exists an epic endomorphism g: A-» A in E which is not an 
isomorphism. 


Proof (i) > (ii) and (i) > (iii) are trivial. 

(ii) > (i): Given f as in (ii), consider f^: A4 — A^. The transpose 1 > A^ 
of 14 does not factor through f4, because f is not a split epimorphism, so the 
pullback of f4 along it is a proper subobject of 1 and hence must be 0. Hence 
we can apply 5.1.3. 

(iii) = (i): Given g as in (iii), we argue similarly using A9: A4 — A^, O 


We remark in passing that Dedekind-finite objects in general toposes have 
been studied by L. N. Stout [1127]. 


Remark 5.1.5 There are analogues of 5.1.1, 5.1.2 and 5.1.3 for list objects, as 
introduced in A2.5.15: given an object B equipped with morphisms b: 1 — B and 
t: A x B — B (where A is a given object), there is a unique smallest subobject 
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B' — B which is (b, t)-closed in the sense that we have a commutative diagram 


U 1 


ee A 


B «—— AxB 


for this subobject, the pair (b’,t') is jointly epic, and the subobject will have 
the universal property of a list object LA provided t’ is monic and its image is 
disjoint from b’. The proofs are essentially similar to those given above. 

We note that this provides an alternative, less explicit, proof of the existence 
of list objects in a topos with natural number object (A2.5.17): given an object A, 
the object (1 I A)" admits morphisms (1,)% :1 21% — (111 A)% and 


1 
Ax QIAN S22 (1A) x (01AN & (0 0A)C & (LIT A) 


whose images are disjoint, so it contains a subobject which is a list object over A. 


We saw in Section A2.5 that the natural number object comes equipped 
with a total ordering Nı — N x N. It is natural to ask whether this is in any 
sense a well-ordering. Since there exists a monomorphism 1 II 1 — N, the result 
of 4.5.7 tells us that it cannot be a well-ordering as defined in Section D4.5, 
unless the ambient topos € is Boolean; but, given this impossibility, it comes as 
close as it can. We say an ordered object (A,<) is decidably well-ordered if it 
satisfies the sentence in the internal language which asserts that every inhabited 
complemented subobject of A has a least element; and we shall see in the next 
section (5.2.10(ii)) that (N, <) has this property in any topos — in particular, it is 
well-ordered in any Boolean topos. The proof is deferred to Section D5.2, because 
it involves properties of finite cardinals; however, we note one consequence of it 
here, which extends a result already observed in Section A2.5. 


Proposition 5.1.6 Let E be a topos with a natural number object. Then 
every partial recursive function f of k variables is ‘realizable’ by a partial map 
f: NE — N in the sense that, for standard natural numbers nj = s™o in E, the 
morphism (Ti, .-. Tp): 1 — N* factors through the domain of f iff f(m,... Nk) 
is defined, and we then have 


f(™,..-,7%) = f(m... Nk). 


Proof We use the result that every partial recursive function is con- 
structible from primitive recursive functions plus a single application of the 
u-operator: specifically, for any f, there exist primitive recursive functions g 
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and h (of k + 1 variables), such that 


f(na,---, Mk) = g(ni,-.., Me, pn. (h(ni,..., ng, n) = 0)) 


where ‘yn.(...)’ means ‘the least n such that ...’ (and f(n1,...,x) is undefined 
if h(ni,... nk n) # 0 for all n). We saw in A2.5.2 how to realize primitive 
recursive functions of k + 1 variables by morphisms N*+! — N in £; so we have 
in particular such a morphism h. Now the subobject h (o) is complemented in 
N*+1\ since o is complemented in N; so we may regard it as a complemented 
subobject of (N*)*(N) in €/N*. Let m: D — N* be the interpretation of the 
sentence which asserts that this subobject is inhabited; then in €/D we have an 
inhabited complemented subobject of D* N, so its least element defines a natural 
number which, when regarded as a partial map d: N* — N, realizes the function 
pn. (h(ni,... nk, n) = 0). (Note that we do not assert that D is a complemented 
subobject of N*; in general it need not be.) Then we form the partial map 


m,d g 
D Bah neti _9 oy 


Nk 
which is the required realization of f. O 


The reader should be warned, however, that if f is (classically) a total recur- 
sive function N* — N, the partial map f: NF — N constructed in the proof of 
5.1.6 need not be a morphism N* — N in £ — its domain contains all the stan- 
dard natural numbers, but the union of the latter need not be the whole of N. 
(We shall return to this point in Section F3.4.) In this connection, we say a topos 
E with a natural number object is standard if the standard natural numbers in € 
(those of the form s”o for some external natural number n) form an epimorphic 
family of morphisms 1 — N. We saw in Section A2.5 that in any topos with 
countable copowers N is the countable copower of copies of 1, with the standard 
natural numbers as the coprojections; so all such toposes are standard. However, 
not all toposes with natural number objects are standard: 


Example 5.1.7 Let ® be a filter of subsets of the set N of natural numbers 
containing all cofinite sets, and let € be the filterpower (Set/N) (cf. A2.1.13). 
Since the projection Pp: Set/N — (Set/N)@ is logical, Ps(N*N) is a natural 
number object in €. It is clear that there are natural numbers in € which are 
not standard: for example, the number which is the image under Pg of the 
diagonal map N — N x N. But Ps maps every finite subset of N to 0, so this 
natural number is actually disjoint from every standard natural number in €. 
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Since epimorphic families are stable under pullback in any topos, the standard 
natural numbers cannot form an epimorphic family in €. 


We note that the topos of 5.1.7 is not merely nonstandard; it does not admit 
any bicartesian (that is, cartesian and cocartesian) functor to any standard topos. 
For any such functor would preserve the natural number object (and hence the 
standard natural numbers), by A2.5.6; but it would also preserve equalizers 
of pairs of morphisms 1 =3 N, and hence map the particular natural number 
constructed in 5.1.7 to a natural number disjoint from all the standard ones. On 
the other hand, we have 


Proposition 5.1.8 Let E be a small standard topos. Then there is a full and 
faithful bicartesian functor from E to a Grothendieck topos. 


Proof Let F be the topos of sheaves on € for the coverage consisting of all 
epimorphic families in €. It is easy to see that this coverage is subcanonical, and 
so the canonical functor € — F is full and faithful (cf. C2.2.15); it preserves 
finite limits, and all epimorphic families which exist in € (hence in particular it 
is a coherent functor). To prove that it preserves arbitrary coequalizers, it there- 
fore suffices to show that it commutes with the construction of the equivalence 
relation on A generated by a parallel pair B = A in £E; equivalently (since the 
reflexive and symmetric relation generated by a pair may be constructed using 
finite limits and images), to show that it commutes with the construction of the 
transitive closure of a (reflexive and symmetric) relation. 

However, in any topos with a natural number object, we may construct the 
transitive closure as an ‘internal N-indexed union’; that is, given a relation 
R= A, we may construct a relation R* 3 N*A in E/N such that the pullback 
of R» along the standard natural number s”o is the nth iterate R™ of the 
relation R, and then verify that the image of Uj R* — A x A is the transitive 
closure R of R. (Explicitly, we take R» to be the relation named by the morphism 
N — P(A x A) which is induced by the morphism 1 — P(A x A) naming the 
diagonal subobject and the morphism P(A x A) — P(A x A) which internalizes 
the operation of composing an arbitrary relation A % A with R; cf. also 5.5.8 
below.) Now since € is standard, it follows that the family of morphisms R™ = 
R is epimorphic; so it is preserved by the embedding E — F. o 


In [371], P. Freyd showed that any topos satisfying the hypotheses of 5.1.8 
in fact admits a jointly faithful family of bicartesian functors to Set. 

The definition of a natural number object tells us that morphisms whose 
domain is N (or, more generally, an object of the form A x N), may be con- 
structed recursively. However, we frequently wish to construct objects of €/N 
recursively, as well: that is, given an object A of € and a ‘procedure’ T for con- 
structing new objects from old, we wish to find an object F of E/N such that 
o*(F) = A and s*(F) & T^ (F), where T denotes the procedure T ‘applied 
fibrewise’ to objects of €/N. To assign a precise meaning to the latter informal 


1040 ASPECTS OF FINITENESS D5.1 


idea, we shall assume that T is actually a functor, and indeed an €-indexed func- 
tor E — E, where E denotes the canonical indexing of € over itself as defined in 
B1.2.2(c). We shall refer to the pair (A, T) as recursion data for constructing an 
object of €/N, and say that F satisfies the recursion data (A,T) if o*F & A and 
s*F = TN (F). For example, the generic finite cardinal C of A2.5.14 satisfies the 
recursion data (0,1 U (—)), and the list object (l: LA — N) over A, constructed 
in A2.5.17, satisfies the recursion data (1, A x (—)). 

We shall not consider the general existence question here, since in practice it 
will always be easy (as it was for the list object) to construct an explicit solution 
of any recursion problem we encounter. (The general problem is discussed at 
length in [549], where it is shown that for a general € there may be recursion 
problems which do not have a solution, but there is a large class of problems 
which do.) However, the following general uniqueness result will very frequently 
be of use to us in the next couple of sections. 


Theorem 5.1.9 Let E be a topos with a natural number object, A an object 
of E, and T:E —E an €-indexed endofunctor of E. Then there exists, up to 
canonical isomorphism, at most one object F of E/N satisfying o*(F) = A and 
s*(F) = T%(F). 


Proof Suppose F” is another solution of the recursion problem. We have to 
construct an isomorphism F © F’ in E/N; equivalently, we have to construct a 
morphism 1y — Iso(F, F’), where Iso(F, F’) denotes the object of isomorphisms 
constructed in E/N, i.e. the interpretation of the term 


{f: [FF] | (Ag: [F> FI)((va: F)(9(F(x)) = X) A (Vy: FY(F(9(y)) = y))} 


in the internal language of this topos. Now pullback functors preserve objects of 
isomorphisms (since they are logical), so we have a morphism 


la 
1 — ó Iso(A, A) © Iso(o*(F), 0*(F’)) = o* (Iso(F, F’)) . 
Similarly, since indexed functors are strong by B2.2.2, T induces a morphism 


Iso( F, F’) —> Iso(T™ (F), T^ (F’)) S s*(Iso(F, F’)) . 


But by A2.5.16 these are precisely what we need in order to construct a morphism 
ly — Iso(F, F’). 0O 


We remark that, in the above proof, the indexed category E could be replaced 
by any locally small €-indexed category (that is, any locally internal category 
over E in the sense of B2.2.1). 


Suggestions for further reading: Coste et al. [257], Freyd [371], Johnstone 
& Wraith [549], Loiseau [730], Rosebrugh [1034], Stout [1127]. 
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D5.2 Finite cardinals 


There are numerous possible notions of ‘finiteness’ for objects of a topos. Even in 
classical logic, it is well known that there are many different definitions of ‘finite 
set’, and that (at least some part of) the axiom of choice is required to prove 
all of them equivalent; in the non-classical internal logic of a topos, the range of 
possibilities is even wider, and we shall not in this work be able to explore more 
than a small fraction of those which have been considered. However, there are two 
particular notions which seem to be by far the most important for applications, 
and which we shall consider in the present section and Section D5.4 respectively. 

We devote the present section to studying finite cardinals. The definition of 
finite cardinals was given in Section A2.5: a finite cardinal in a topos € (with a 
natural number object) is an object which occurs as a pullback, along a morphism 
p:1— N, of the particular object C = (sm2: Ni — N) of E/N, the generic finite 
cardinal. As in section A2.5, we shall write [p] for the cardinal corresponding to a 
morphism p: 1 — N. The key feature of cardinal-finiteness, which makes it such 
a good notion to work with, is precisely the fact that the finite cardinals in E 
are ‘parametrized’ by the generic finite cardinal, which in a sense contains all of 
them as particular cases; thus we can prove general results about finite cardinals 
by simply proving that they hold for the generic cardinal, and then observing 
that they are stable under pullback in an appropriate sense. We shall see many 
examples of this style of argument in what follows. 

We saw in A2.5.14 that we have [o] = 0 and [sp] = 111 [p] for any p. This gives 
rise to an important ‘induction principle’ for proving facts about finite cardinals, 
which we shall use repeatedly: 


Lemma 5.2.1 LetP be a property of objects which is expressible in the internal 
language of a topos, and suppose (i) that the initial object 0 satisfies P, and 
(ii) that whenever an object A satisfies P, so does 1 II A. Then every finite 
cardinal satisfies P. 


Proof Since P is expressible in the internal language, we can construct a 
subobject N’ >=> N which is the truth-value of the assertion that the generic 
finite cardinal satisfies P in €/N. The two conditions on P imply that N’ is a 
(o, s)-closed subobject of N; so it is the whole of N. o 


Condition (ii) of 5.2.1 may of course be split into two: that 1 satisfies P, and 
that the class of objects satisfying P is closed under binary coproducts. In this 
connection, we note 


Lemma 5.2.2 For natural numbers p and q in a topos E, we have isomor- 
phisms |p + q] = [p] I [q], [p x a] = [p] x [q] and [p — q] = [p]. In particular, 
the class of finite cardinals in E is closed under finite coproducts, finite products 
and exponentials. 
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Proof Since the pullback functor g* preserves coproducts, products and expo- 
nentials, it suffices to establish the three isomorphisms for the pair of natural 
numbers (N*p,1y) in E/N. But we can do this using 5.1.9, by verifying that 
the objects on each side of the isomorphism satisfy the same recursion data. For 
example, for addition we have [p + o} = [p] = [p] I [o], [p + sq] = [s(p + q)] = 
[p +q] I 1 and fp) U [sq] S [p] I ({g] 111) = ([p} I {q}) I 1, so that both sides 
are solutions of the data o* F = [p], s*F & F I 1. Similarly, the two sides of the 
second isomorphism satisfy o* F = 0 and s*F = F II [p]; and those of the third 
isomorphism satisfy o*F = 1 and s*F = F x [p]. The second assertion of the 
lemma is immediate from the first, plus the facts that 0 = [o] and 1 & [so] are 
finite cardinals. 


We write Efe for the full subcategory of E whose objects are finite cardinals 
in €. In addition to finite products and coproducts, it is closed under equalizers 
and coequalizers: to prove this, we require a couple more lemmas. Note first that 
cardinals are decidable, since they are subobjects of N; hence the equalizer of 
any parallel pair [p] = [q] is a complemented subobject of [p]. 


Lemma 5.2.3 A complemented subobject of a finite cardinal is a finite 
cardinal. 


Proof We note first that a complemented subobject of 1 is a cardinal: if 
U — 1 has a complement V — 1, we may identify it with [p] where p is the 
unique morphism making the diagram 


U >>> | «c V 
AEE EA N S 


commute. 

Now consider the generic case. We have an object D = (N*2)° in E/N 
indexing the complemented subobjects of the generic cardinal C; we have to 
construct a morphism D — N*N whose cardinal is isomorphic to the generic 
complemented subobject of C. But constructing such a morphism is equivalent 
to constructing an element of the exponential (N* N)? in E/N; and we may do 
this by A2.5.16 using the morphism 


1 —“s N2N! Xot((N*NY ND) 


and the morphism (N*N)? 5 s*((N*N)®) = (N*N) P whose exponential 
transpose is 


ev x N*u 


(NN) DI x (NFA) x N*2 N*N x N*N ——> N*N 
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where u: 2 — N is the ‘cardinality’ of the generic complemented subobject of 1, 
constructed as above. It is then a straightforward application of 5.1.9 to verify 
that the cardinal of this natural number is isomorphic to the generic comple- 
mented subobject of C. g 


In the opposite direction, we have 


Lemma 5.2.4 The image of any morphism f : |p| — |q] between finite cardinals 
is a complemented subobject of [q]. 


Proof Fix the cardinal [q]. The assertion, for an arbitrary object A, that the 
‘image’ map [q]4 — Ql factors through the subobject 214] — Qla] is expressible 
by a sentence in the internal language of €. This sentence is trivially valid when 
A = 0, and its validity for A = 1 is just the assertion that [|q] is decidable. And if 
it holds for two objects A and B, then it holds for their coproduct, since a finite 
union of complemented subobjects is complemented. Hence by 5.2.1 it holds for 
all finite cardinals [p]. o 


Lemma 5.2.5 The coequalizer of a parallel pair of morphisms between finite 
cardinals is a finite cardinal. 


Proof Asin 5.2.3, we consider first the case of a pair f,g: 1 = [q]. By the last 
two lemmas, we have a coproduct decomposition 


EEEE 


pulling this back along g, we get a coproduct decomposition 1 = U I V. Now 
the morphisms 1 II 9’: 1 ¥ U I V — U U [r] (where g’ is the restriction of g) and 
vz: [r] + U U [r] induce a morphism [q] — U I [r], which is readily seen to be 
a coequalizer of f and g; but U is a complemented subobject of 1 and hence a 
cardinal by 5.2.3, and so U I [r] is a cardinal by 5.2.2. 

Now consider the general case: fix a cardinal [p] in E, and consider the asser- 
tion, about a variable object A, which says ‘for each pair of morphisms A = [p], 
there exists q: 1 — N and a morphism fp] —> [q] such that A = [p] — fq] is a 
coequalizer’. This assertion is expressible in the internal language of €, and it is 
clearly true for A = 0 since two morphisms 0 = [p] must be equal. Suppose it 
holds for A, and consider a pair of morphisms 1 I A = [p]. Then the coequal- 
izer of A =3 [p] can be written in the form [p] > [q], and we have to form the 
coequalizer of the two composites 1 =3 [p] —> [q]. But by the first part of the 
proof this coequalizer is a finite cardinal; so 1 H A satisfies the assertion above. 
Hence by 5.2.1 the assertion holds for all finite cardinals. gO 


Corollary 5.2.6 A quotient of a finite cardinal is isomorphic to a finite 
cardinal iff it is decidable. 


Proof One direction is immediate from the fact that finite cardinals are decid- 
able. Conversely, given an epimorphism |p] —» A where A is decidable, its 
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kernel-pair is a complemented subobject of [p] x [p] = [p?], and hence isomorphic 
to a finite cardinal by 5.2.3; but any epimorphism in a topos is the coequalizer 
of its kernel-pair, so the result follows from 5.2.5. o 


Putting together the above results, we have 


Theorem 5.2.7 For any topos E with a natural number object, the full sub- 
category Efo of finite cardinals in E is a Boolean topos. The inclusion functor 
Efc —> E preserves finite limits, finite colimits and exponentials; it is logical iff E 
is Boolean. 


Proof The existence of finite limits, finite colimits and exponentials, and their 
preservation by the inclusion functor, follow from 5.2.2, 5.2.3 and 5.2.5. By 5.2.3 
and 5.2.4, the subobjects of a cardinal [p] in Efe are precisely its complemented 
subobjects in £; so the cardinal [sso] = 2 will serve as a subobject classifier in 
Efc, and it is preserved by the inclusion functor iff € is Boolean. oO 


Examples 5.2.8 (a) Let C be a small category. Since inverse image functors 
preserve natural number objects by A2.5.6, the natural number object in [C, Set] 
is simply the constant functor C — Set with value N. Hence a natural number in 
[C, Set] is a natural-number-valued function on ob C which is constant on each 
connected component of C; and a finite cardinal is a finite-set-valued functor 
which is constant on each connected component. Thus [C, Set]je ~ (Sets)’, 
where T is the set of connected components of C. (In particular, if G is a group, 
then [G, Set] j< is equivalent to Set;, rather than (as one might perhaps have 
hoped) to the topos [G, Set] of A2.1.4; for the latter, one needs to take the 
topos [G, Set]ax s of 5.4.18 below.) 

(b) Similarly, if X is a topological space, the natural number object in Sh(X) 
is the constant sheaf N, i.e. the sheaf of sections of the local homeomorphism 
(nı: X x N — X) (where N has the discrete topology); so a natural number in 
Sh(X) is a continuous (= locally constant) function X — N. If the space of 
components of X is discrete (for example if X is locally connected), then we 
may deduce that Sh(X);. ~ (Sety)/, where J is the set of components of X. 


Next, we note some further consequences of the induction principle 5.2.1. 
Lemma 5.2.9 


(i) Finite cardinals are (internally) Dedekind-finite; that is, the canonical 

monomorphism Iso((p], [p]) — Mono([p}, [p]) is an isomorphism for all p. 

(ii) Finite cardinals are internally projective in the sense of 4.5.1(b); that is, 
the functor (—)'"! preserves epimorphisms. 

(iii) Finite cardinals are ‘decidably well-ordered’; that is, if we order |p] as a 


subobject of N , it satisfies the sentence which says that every complemented 
inhabited subobject of [p] has a least member. 
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Proof (i) The assertion that an object A is internally Dedekind-finite is 
expressible by a sentence in the internal language, and it is clearly true for 0 
since Iso(0,0) = Mono(0, 0) = 1. Suppose it is true for A, and suppose we have a 
monomorphism m: 1 I] A — 1 I A (in some slice category €/B — but, to simplify 
the notation, we shall assume B = 1). Form the pullbacks 


U: I Ay >—> 1UA << VIA 


my, m mə 


vı V2 
1 >——-> 101 A <_< A 
Since mı is monic, we see that A; is subterminal, and indeed contained in the 
complement U2 of U, in Sub(1); thus we have a monomorphism 


m 
A® A, IA >—> Up A >> A 


which must be an isomorphism by hypothesis. It now follows easily that my and 
A, >> Uz are both isomorphisms, whence m = m; II mg is an isomorphism. 

(ii) The assertion that A is internally projective is not expressible by a sen- 
tence in the internal language, since it involves quantification over arbitrary 
epimorphisms in £. However, the assertion, for a given epimorphism e: B > C, 
that e^ is epic is so expressible. It is trivially satisfied when A = 0 or A = 1; and if 
it holds for A; and Ag then it holds for A; I Ag, since (—)41¥42 x (_)41 x (_)A2, 
So it holds for all finite cardinals, and for all epimorphisms in €. 

(iii) Here we are dealing with a property of ordered objects, rather than of 
objects pure and simple; but the induction principle works in the same way. 
Clearly 0 is (decidably) well-ordered. Suppose (A, <) is decidably well-ordered, 
and consider a well-supported complemented subobject B — 1 H A, where (for 
the moment) we regard the extra element 1 as preceding all of A. We can decom- 
pose B as a coproduct U II A’, where U is a complemented subterminal object; 
the fact that B is well supported ensures that oA’ contains the complement V of 
U, and so V* A’ is a well-supported subobject of V* A in €/V. Hence we can find 
a morphism a’: V — A’ which is its least element; then the unique morphism a 
making 

U ————> 1 «c V 


U ———_> B <———__ A’ 
commute is easily seen to be the least element of B. 
Strictly speaking, we have proved that a finite cardinal is decidably well- 
ordered in the opposite of its usual ordering; if we identify [p] with the subobject 
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{x: N | sx < p}, then the extra element of [sp] = 1U [p] comes after the elements 
of [p], not before them. However, any finite cardinal is order-isomorphic to its 
opposite; for 0 this is trivial, and for a successor cardinal [sp] the isomorphism 
is given by the map p—(—). So [p] is also decidably well-ordered in its usual 
ordering. oO 


We shall prove a stronger version of 5.2.9(ii) in the next section (see 5.3.10(i)). 
Corollary 5.2.10 In any topos E with a natural number object, 


(i) the subcategory Efe satisfies (AC); 
(ii) the natural number object is decidably well-ordered. 


Proof (i) Lemma 5.2.9(ii) implies that Efe satisfies (IC), since the inclusion 
Efc — E preserves exponentials. However, as an object of Efe the terminal object 
1 S [so] is not just internally but externally projective; for if a cardinal [p] is 
well-supported, then it has a (least) element by 5.2.9(iii). So the result follows 
from 4.5.4. 

(ii) Let A — N be a well-supported complemented subobject. Since A is 
well-supported, we may assume (after pulling back along some B — 1) that it 
contains some natural number p; so AM [sp] is well-supported, and hence has a 
least element, which is clearly the least element of A. oO 


We may make the categories (€/I);-, I € ob €, into an €-indexed category 
E;, since pullback functors preserve finite cardinals. As an indexed category, Es 
is ‘small’; that is, it is equivalent to the standard indexing of an internal category, 
namely the internal full subcategory of the canonical indexing of € over itself 
generated by the generic finite cardinal (cf. B2.3.5). This internal category has 
already been exploited in B3.2.9; the following lemma is of use there, and also 
in Section D5.3. 


Lemma 5.2.11 Inverse image functors preserve exponentiation to the power 
of a finite cardinal; that is, if f: F — E is a geometric morphism, p a natural 
number in E and A an object of E, then the canonical comparison map f*(Al?!) > 
(f* A)\F"Pl is an isomorphism. 


Proof It suffices to prove that (f/N)* preserves the generic such exponential 
(N*A)°. We can do this either by recalling that Djv((N*A)°) is the list object 
LA (A2.5.17) and that list objects are preserved by cartesian functors having 
right adjoints, or alternatively using 5.1.9 and the fact that the two objects of 
F/N which we want to prove isomorphic satisfy the same recursion data. E 


In dealing with finite cardinals in slice categories of E, the following result is 
often useful. 


Proposition 5.2.12 Let f: A — |p] be a morphism of E whose codomain is a 
finite cardinal. Then f is (isomorphic to) a finite cardinal in E/[p] iff A is a 
finite cardinal in €. 
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Proof First suppose A is a finite cardinal |g], say. Then we have a pullback 
diagram 


| ee 
aP) A 


fxi 
[a] x [p] —> [p] x [p] 


in E, which we may reinterpret as a pullback 


F ——> liy 


[p]" [q] LAA [p]* [p] 
in E/[p]. Since (€/[p]}fe is closed under pullbacks by 5.2.7, it follows that f is a 
finite cardinal. 

Conversely, consider the assertion about an object B which says that, for 
every natural number r: B — N in E/B, there exists q: 1 — N such that 
Ep[r] & [q]. This assertion is expressible in the internal language, and it is 
clearly true for B = 0 and B = 1. If it holds for B, and Bo, then for any 
r = (r1,r2): Bı Il Bz > N it is easy to see that Xpg[r] = Xps, [rı] U Xz, [re], so 
it is a finite cardinal by 5.2.2. So the assertion holds for all finite cardinals, by 
5.2.1. 0O 


The next two results ‘internalize’ the idea that arbitrary coproducts may be 
constructed from finite coproducts and filtered colimits. 


Lemma 5.2.13 Let S be a topos with a natural number object, and let C and 
D be S-indexed categories with S-indexed coproducts, such that D is locally small 
and both C and D are stacks for the coherent coverage on S. If F: — D is an 
S-indexed functor preserving (fibrewise) finite coproducts, then F also preserves 
coproducts indexed by finite cardinals in S, i.e. the diagram 


I 
cf pl 


Le Er 
J 
CI F > D! 


commutes up to isomorphism whenever z: I — J is a finite cardinal in S/J. 
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Proof Since the Beck—Chevalley conditions hold for coproducts in both C 
and D, it suffices to consider the case when z is the generic finite cardinal 
C: Ni > N. (Actually, we need to consider the generic cardinal in S/J for an 
arbitrary object J; but since the hypotheses are ‘stable under slicing’ we may 
as well take J = 1.) In this case, given an object A of C™, we have to con- 
struct an isomorphism Ec F%1(A) > FN£c(A) in DN; more precisely, to show 
that the canonical morphism between these two, regarded as a morphism 1 —> 
D(XcF%1(A), FNEc(A)) in S/N, factors through the object of isomorphisms 
Iso(Xc F™1(A), FNUc(A)). To do this, we use 5.1.9 (or rather the extension of 
5.1.9 mentioned in the remark which follows it). Applying o* to this object of 
isomorphisms yields Iso(0p, F!(0c)) where Oc and Op denote the initial objects 
of C! and D! respectively, since C? ~ D? ~ 1 by the descent condition, and this 
object is isomorphic to 1 since F! preserves the initial object. A similar argument 
using the fact that F preserves binary coproducts yields the required morphism 


Iso(£cF™ (A), FNS c(A)) —> s*(Iso(EcF™! (A), FN Ec(A))) . 
o 


Proposition 5.2.14 Jn addition to the hypotheses of 5.2.13, suppose that C 
and D have, and F preserves, arbitrary S-indexed filtered colimits. Then F 
preserves arbitrary S-indered coproducts. 


Proof The hypotheses on filtered colimits are to be taken, as in Sections C3.2 
and C3.4, to refer to filtered internal categories in arbitrary slice categories S/J 
of S; equivalently, by B2.6.6, to arbitrary weakly filtered categories in S. In other 
words, the statement that F preserves filtered colimits means that, for a weakly 
filtered internal category I with mol = J, the diagram 


lI, F] 
{I,C] ——> [ID] 


limy lim; 
FY 
ci —— > p 
commutes up to canonical isomorphism. With this proviso, the hypotheses of 
the proposition are stable under slicing, so it suffices to show that for any object 
J of S the diagram 


J 
oE’ 


commutes. 
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The key idea is that we can express J as a filtered colimit of finite cardinals. 
In fact we have already done this in the proof of B3.2.9; let us recall that, in that 
example, we associated with an object J of S an Sy-torsor J given by equipping 
the list object L(J) = En((N*J)°) with an appropriate right action of Sy. 
The assertion that J corresponds to a geometric morphism S — [S;,S] whose 
inverse image sends the generic object to J becomes, in this context, that the 
colimit in S of the diagram J — S; — S (where the first factor is the discrete 
fibration, and the second is the inclusion functor) is J. Hence we may compute 
the J-indexed coproduct of an object of C7 by first computing the L(J)-indexed 
family of finite coproducts corresponding to this diagram, and then forming their 
filtered colimit. But F commutes up to isomorphism with both these operations 
(the first by 5.2.13, and the second by assumption); so it commutes with Dj, as 
required. q 


Corollary 5.2.15 Suppose C and D are cocomplete S-indered categories 
(where S is a topos with a natural number object), which are stacks for the 
coherent coverage on S, and suppose that D is locally small. Then an S-indezed 
functor F: C — D is S-cocontinuous iff it preserves finite (fibrewise) colimits and 
filtered S-indezed colimits. 


Proof This is immediate from 5.2.14 and the definition of cocontinuity 
(B1.4.13) for indexed functors. O 


Suggestion for further reading: Johnstone & Wraith [549]. 


D5.3 Finitary algebraic theories 


In this section we explore the relationship between finitary algebraic theories (in 
the sense of 1.1.7(a)) in toposes and monoids in the object classifier (cf. 3.2.1). 
We begin by recalling some facts about reflexive coequalizers (cf. A1.2.11). 


Lemma 5.3.1 Let € be a cartesian closed category, and T a finitary algebraic 
theory. Then the forgetful functor T-Mod(€) — E creates coequalizers of reflex- 
ive pairs. If in addition E has coequalizers of reflexive pairs, then the forgetful 
functor T-Mod(€) — E is monadic iff it has a left adjoint. 


Proof If h,k: A= B is a reflexive pair of T-model homomorphisms and we 
are given a coequalizer q: B — C for (the underlying morphisms of) f and g 
in E, then it follows from A1.2.12 (plus induction) that A” 3 B” — C” is a 
coequalizer for each natural number n. Hence, for each n-ary function symbol f 
of T, the morphisms f4 and fg induce a unique morphism fc: C” — C such that 
q is a homomorphism. Also, since q” is epic for each n, C satisfies any equation in 
these operations which is satisfied by B; in particular, it is a T-model. It is now 
straightforward to verify that, when we equip C with this T-model structure, 
q: B — C becomes a coequalizer for h and k in T-Mod(€). 
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The second assertion is immediate from the first and A1.1.2, since the for- 
getful functor is clearly conservative. a 


We shall also require the following properties of categories of T-models: 
Lemma 5.3.2 LetT be a finitary algebraic theory. 


(i) If E is regular (resp. effective regular), then T-Mod(€) is regular (resp. 
effective regular). 
(ii) If E is locally cartesian closed, then the E-indexed category T-Mod(€) 
defined by T-Mod(€)! = T-Mod(E//) is €-complete. 
(iii) If E is locally cartesian closed and cocartesian, then T-Mod(&) is 
E-cocomplete provided the forgetful functor T-Mod(E/I) — E/I has a 
left adjoint for every I. 


Proof (i) Since image factorizations in E are stable under finite products, 
it is clear that the image in € of any homomorphism h: A —> B of T-models 
in E can be given a unique T-model structure which makes both halves of the 
factorization of h into homomorphisms. It follows that the covers in T-Mod (£) 
are exactly the homomorphisms whose underlying €-morphisms are covers; hence 
also T-Mod(€) has images, and they are stable under pullback. For effectiveness, 
we similarly verify that if R = A is an equivalence relation in T-Mod(€), then 
the coequalizer Q of R = A in € can be given a unique T-model structure which 
makes the cover A — Q into a homomorphism; and R = A is the kernel-pair of 
A->@Q in T-Mod(€), since it is so in E and the forgetful functor creates limits. 

(ii) It is trivial that T-Mod(€) has €-indexed finite limits. For €-indexed 
products, we have simply to observe that the pullback functors 2*: E/I => €/J 
induced by morphisms x: J — I in E, and their right adjoints I}, both pre- 
serve finite limits and so lift to functors between the corresponding categories of 
T-models. 

(iii) The existence of finite colimits in the categories T-Mod(€/J), and 
their preservation by pullback functors, follows from 5.3.1 and a theorem of 
Linton [727]. The functors £,:€/J— E/I do not lift directly to functors 
between the categories of T-models, since they do not preserve finite products, 
but it follows from A1.1.3(i) that the ‘lifted’ left adjoints exist. a 


If € satisfies the hypotheses of 5.3.2(i) and (ii) and A is a particular T-model 
in €, then we note that the €-indexed category which to an object I associates 
the preorder of sub-T-algebras of I*A is €-complete, since it is reflective in the 
€-indexed slice category T-Mod(&)/A. In other words, we can form the internal 
intersection of an arbitrary €-indexed family of subalgebras of a given algebra — 
a fact which we have already exploited in 5.1.1 (and in B4.2.10). Of course, it is 
not hard to prove directly that such intersections exist. 

It is clear from 5.3.1 and 5.3.2 that we need to investigate the circumstances 
in which the forgetful functor T-Mod(€) — E has a left adjoint. There is one 
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case in which the construction of this left adjoint is particularly easy: namely, 
when T is the theory of monoids. 


Proposition 5.3.3 Let E be a topos with a natural number object. Then, for 
any object A of E, the list object LA over A has the structure of a monoid; and 
in fact LA is the free monoid generated by A. 


Proof By 5.2.2, we have [p + q] S [p] I [q], from which it follows easily that 
+*(1: LA => N) = nf (l) x 73 ()) in E/N x N, i.e. we have a pullback square 


+ 
[AXLA—=s LA 


lxl l 


Nic Nay 


We take the top edge of this square to be the multiplication of the monoid LA; 
the unit is similarly defined by the pullback square 


OA 
1 ———> LA 


ns Sy 


The fact that (LA, +4,04) satisfies the axioms of the theory of monoids follows 
straightforwardly from the fact that (N,+,0) does so. We note also that the 
construction of +4 and o4 is ‘natural’ in A, so that L becomes a functor E — 
Mon(€). 

To verify that L is left adjoint to the forgetful functor, we shall construct 
the unit and counit of the adjunction. The unit is the top edge of the pullback 
square 


5 Gen TA . 


1—"_sN 


again, this is clearly natural in A (and we note that n4 = sa(1 x 04)). To 
construct the counit, let (M,m,e) be a monoid in E; by the definition of list 
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objects, we have a unique morphism cem: LM — M such that 


OM SM 
1 ——> LM <—- Mx LM 


EM 1x €y 


M <—— MxM 


commutes. Now we have eunm = €mSm(1 x om) = m(1 x €y)(1 x om) = 
m(l x e) = lm, so we see that any morphism f: A— M from an arbitrary 
object A to the underlying object of a monoid M can be extended to a monoid 
homomorphism ey Lf: LA — LM — M. But if we had another such extension 
g, the equalizer of g and em Lf would be a submonoid of LA containing na, from 
which it is easy to verify that it would be a (04, s4 )-closed subobject and hence 
the whole of LA (cf. 5.1.5). So the extension is unique, and the adjunction is 
established. o 


Remark 5.3.4 In fact the existence of a natural number object in € is nec- 
essary as well as sufficient for the existence of a free functor E  Mon(€). We 
can see this using 5.1.3: for if (M, m,e) is a monoid freely generated by 1 (with 
g: 1— M as the unit of the adjunction), then it is easily verified that 1 H M is 
also a monoid with unit vı and multiplication 


(V1, V2, V2, V2SM) 
(10M) x(10M)21UMUMU(Mx M) -2 S 10M, 


where s is the composite 


1xg m 
M —-—> M x M —-> M, 


and that 1 H M is freely generated by the element 2e: 1 — 1 H M. So we have 
an isomorphism M 1 H M, and hence by 5.1.3 € has a natural number object 
— which is of course M itself. (Alternatively, we could argue as in the proof 
of 5.4.10 below to show that a free monoid generated by A necessarily has the 
universal property of a list object over A.) 

Variants of this argument can be given for a number of other familiar algebraic 
theories (for example, for groups}; but there are also some nontrivial theories, 
such as that of semilattices (cf. 5.4.9 below), for which the free functor exists in 
any topos. 


Theorem 5.3.5 LetE be a topos with a natural number object, and let T be a 
finitely-presented finitary algebraic theory (i.e. one whose signature has finitely 
many function symbols, and which has finitely many azioms). Then the forgetful 
functor T-Mod(€) — E has a left adjoint. 


D5.3 FINITARY ALGEBRAIC THEORIES 1053 


Proof We consider first the case when T is a free theory, i.e. one with no 
axioms. Let F = { fj, fo,..., fx} be the finite set of function symbols of T; as 
usual, we identify F with a finite copower of 1 in E£, and write a: F — N for the 
morphism indexing each operation by its arity (regarded as a standard natural 
number in €). We shall construct the free T-model on an object A of € as a 
subobject of the list object L = L(F I A). 

We note that L itself has a T-model structure induced by its monoid structure: 
given an operation f; (with a(f;) = ni, say), we define (f;),: L™ — L to be the 
interpretation of the term (fi) +21 +---+2n,, where 7 and + are as defined 
in the proof of 5.3.3, and 21,...,2n, are variables of sort L. Also, we have a 
morphism v2: A > L. 

Now let N; denote 1 H N, with the element vı: 1 —1HN denoted by L 
(and elements of N identified with their images under v2). We define p: L — NL 
to be the unique morphism making the diagram 


(hee ss 7 er A) ee 


p 1xp 


N, <— (FIUA)x Ny, 


commute, where t is the morphism defined by 


ale ify = 1 
‘eas sy ifye NandzeaA (5.1) 
) s(ytaz) ifyeN,xe€Fandar<y 
L ifyeN,xe€Fandar>y. 


(Intuitively, p is the morphism which sends a string of elements of F I A to 
n if the string can be ‘parsed’ as a sequence of n ‘well-formed words’, and to L 
if it cannot be so parsed.) It is now straightforward to verify that the pullback 


M ——> 1 


L —> Ni 


is a sub-T-model of L, and that it contains nv2: A — L; also, the restrictions 
of the (f;),: L™ — L to M™ are all monic, since if p(nvı (fi) + x) = so, then 
p(x) = n; and there is a unique way of decomposing T as a sum of n; elements 
of M. And the images of these morphisms are disjoint from each other and from 
NV2, since this is already true for L. It is also easy to see that these morphisms 
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are jointly epic, so in fact we have a coproduct decomposition M = AU M™ II 
Me U0... 0 M”. 

It can be shown without much difficulty that M is generated as a T-model 
by nv2: A — M (ie. that it is the internal intersection of all sub-T-models con- 
taining this subobject); but even without verifying this we could cut down to 
the submodel generated by A by the remarks after 5.3.2, and then proceed 
much as in the proof of 5.1.2(iii) => (i). Given a T-model B, together with a 
morphism h: A — B, we consider the smallest sub-T-model (a,b): R — M x B 
which contains the image of (g, h): A — M x B, and verify that R is the graph of 
a homomorphism M — B by showing that {x: M | (sly: R)(a(y) = x)} defines 
a sub-T-model of M containing A. Hence every such h extends to a homomor- 
phism M — B; but the equalizer of any two such homomorphisms would be a 
sub-T-model of M containing A, and so the extension is unique. 

We now prove the general case by induction on the number of axioms of 
T. Suppose that T is obtained by adding a single axiom to a theory S, and we 
already know that free S-models exist. We shall show that T-Mod(E€) is reflective 
in S-Mod(€). Suppose the extra equation in T involves n free variables; then 
for each S-model B we may construct a pair of morphisms B” = B in E corre- 
sponding to the terms on either side of the equation, such that these morphisms 
are equal iff B is a T-model. Let F denote the free S-model functor, and form 
the coequalizer q: B — C of the induced pair F(B") 3 B in SMod(€) (which 
we may do by 5.3.2(iii)). By the proof of 5.3.2(i), we know that q is epic in €, 
and hence that q”: B” — C” is epic; but since q is a homomorphism it com- 
mutes with the morphisms B” =3 B and C” = C which are the interpretations 
of the terms in the extra equation, and hence C must satisfy this equation, i.e. 
it is a T-model. But any homomorphism h: B — D from B to a T-model must 
have equal composites with B” =3 B and hence with F(B") = B; so it factors 
uniquely through q. In particular, if B is the free S-model on some object A of E, 
then C has the universal property of a free T-model on A. o 


We note that, in both halves of the proof of 5.3.5, the finite presentability 
assumption could be replaced by an ‘E-presentability’ assumption: in the first 
part, we could replace the finite set F by an arbitrary object of € equipped with 
an ‘arity’ map F — N, and in the second we could replace the single equation 
by an ‘J-indexed family of equations’. In particular, if E is cocomplete, we can 
construct free functors for arbitrary (Set-based) finitary algebraic theories; but 
in fact the assertion we have just made is more general than this, as we shall see 
below. f 


Remark 5.3.6 There is an alternative, less explicit, proof of 5.3.5, using the 
ideas of B4.2.10. Given a finitely-presented algebraic theory T and an object 
A of E, we may consider the geometric theory (A|T) over E (in the sense 
of B4.2.7) whose models consist of a T-model M together with a morphism 
from A to the underlying object of M. We may construct a classifying topos 
(p: E[(A|T)] — €) for this theory, containing a generic model (f: p*A — M), 
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by the method of B4.2.9. Then, as in the proof of B4.2.10, we may show that the 
transpose f: A —> p,M of f is a weakly initial object in the category of (A | T)- 
models in €; and, by cutting down to the internal intersection of all sub-T-models 
of p,M which contain the image of f, we may obtain an initial object of this 
category, that is a free T-model generated by A. 


Next, we note 


Lemma 5.3.7 Let f: F — E be a geometric morphism, and T an algebraic 
theory such that the forgetful functors T-Mod(€) — E and T-Mod(F) > F 
have left adjoints (for example, suppose that T is finitely-presented and that E 
(and hence F) has a natural number object). Then the diagram 


f* 


Ey 2 


* 


T-Mod(€) —> T-Mod(F) 


commutes up to natural isomorphism, where the vertical arrows denote the free 
functors. 


Proof The functors in the diagram all have right adjoints, and the diagram 
of right adjoints trivially commutes. So this is immediate from uniqueness of 
adjoints. a 


Corollary 5.3.8 Let E be a topos with a natural number object, and T a 
finitely-presented algebraic theory. For each E-topos (f:F — €), let Tr = 
(Tr,n¢, px) be the monad on F induced by the free T-algebra functor and 
its right adjoint. Then the monads TF form a Yop/E-indexed monad on the 
Yop /E-indered category (f: F > E) > F. 


Proof This is immediate from 5.3.7 and the fact that inverse image functors 
also commute with the forgetful functors T-Mod (F) — F. o 


We have seen indexed monads of the type described in 5.3.8 before. In 3.2.1 
we defined a monoidal structure on the object classifier Set[O], and observed 
that monoids for this structure correspond to Lop/Set-indexed monads. The 
same argument works over any base topos € with a natural number object: if 
E[O] denotes the object classifier over E (which exists by B3.2.9), then an indexed 
endofunctor of €-toposes corresponds to an object T = Tejo (Go) of E[O] (where 
Go is the generic object), and an indexed monad structure on the functors TF 
corresponds to a monoid structure on T. Accordingly, we now introduce the 
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following definition: 


Definition 5.3.9 Let E bea topos with a natural number object. By an (inter- 
nal) finitary algebraic theory in €, we mean a monoid in the monoidal category 
(E[O], 9, Go) of 3.2.1. If T = (T,m,e) is such a monoid, we write T-Mod(F) 
(where F is an arbitrary topos defined over €) for the category of algebras of the 
monad whose functor part is the F-component Tr of the indexed endofunctor 
of €-toposes corresponding to T. 


We have seen that, in the case € = Set, the above definition includes the 
classical notion of finitary algebraic theory as a special case. (In fact, as we shall 
see below, it is equivalent to the latter.) We next establish the analogue of 5.3.1 
for our new notion: 


Lemma 5.3.10 Let € be a topos with a natural number object. 


(i) If p:1—N is a natural number in E, then the functor (—)P): E > E 
preserves coequalizers of reflexive pairs. 
(ii) If T is an object of EJO], then for any E-topos F the functor Tr: F — F 
preserves coequalizers of reflexive pairs. 
(iii) If T is a finitary algebraic theory in E (in the sense of 5.3.9), then for 
any E-topos F the forgetful functor T-Mod(F) — F creates coequalizers 
of reflexive pairs. 


Proof (i) This is proved by the same method as 5.2.9(ii): given a particular 
reflexive coequalizer diagram A = B — C in £, the assertion that an exponential 
functor (—)* preserves it is expressible by a sentence in the internal language of 
E. This sentence is clearly satisfied when X = 0 or X = 1; and if it holds for X; 
and X then A1.2.12 ensures that it holds for X, H Xz. So it holds for all finite 
cardinals. 

(ii) Since the hypotheses are stable under change of base from E to F, it 
suffices to consider the case E = F. We recall from B3.2.9 that the object 
classifier E[O] may be constructed as the internal diagram category [E,, E], and 
that the object of objects of Ey is N. Hence by B2.3.16 E[O] is monadic over 
E/N. In particular, any object T of E[O] admits a free presentation, that is a 
(reflexive) coequalizer diagram 


R(r) Z3 R(g) ——-> T, 


where g: G — N and r: R— N are objects of ‘generators’ and ‘relations’ in 
E/N and R is the left adjoint to the forgetful functor (cf. B2.5.4(c)). But, for 
any object A of E£, the functor T > Tg(A) is the inverse image A of the 
geometric morphism A: € — E[O], and hence preserves coequalizers; so we have 
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a coequalizer diagram 


R(r)e(A) Z3 R(g)e(A) —> Te (A). 


By the construction of B3.2.9, the Es,-torsor corresponding to A has (N TAN 
for its underlying object, where C is the generic cardinal in €/N, and it follows 
easily that, for any g: G —> N, the functor R(g)¢ may be identified with the 
composite 
G* (—)is! Ec 
E ———> E/G ——> E/G ——_ > E, 

i.e. the partial product functor P(—, g) (cf. A1.5.7). And this preserves reflexive 
coequalizers, by (i) and the fact that G* and Ug have right adjoints; so the result 
for Te follows by a straightforward diagram-chase. 

(iii) is immediate from (ii) and the definition of T-Mod(F). a 


Corollary 5.3.11 Let €, T and F be as in 5.3.10(iii). Then T-Mod(F) is 
effective regular and cocartesian. 


Proof The forgetful functor T-Mod(F) — F creates limits (in particular, 
kernel-pairs); so 5.3.10(iii) enables us to lift image factorizations from F to 
T-Mod( F), and to check that they are stable under pullback and that equiva- 
lence relations in T-Mod(F) are kernel-pairs of their coequalizers. The existence 
of arbitrary finite colimits in T-Mod(F) follows from Linton’s theorem, as in 
5.3.2(iii). a 


Remark 5.3.12 In the case € = Set, the functor Tset: Set — Set induced by 
an object T of Set[O] = [Set;, Set] may be alternatively described as the left 
Kan extension of T along the inclusion Set; — Set (cf. the discussion after 3.2.1, 
or alternatively the proof of 5.3.10(ii)). Thus a &-monoid in Set[O] induces a 
monad on Set whose functor part is finitary; but this in turn can be presented 
as a (finitary) algebraic theory in the usual sense, taking the elements of T(n) as 
n-ary function symbols for each natural number n, with appropriate equations. 
Combining this with 5.3.8, we see that internal finitary algebraic theories in 
Set are (up to equivalence) the same thing as algebraic theories in the classical 
sense. However, the advantage of Definition 5.3.9 is that it enables us to give a 
unified treatment of these theories and of others whose operations and equations 
‘vary within a topos’: for example, if R is an internal ring in a topos €, we 
have a notion of R-module (that is, of f*(R)-module) in any €-topos f: F — €; 
provided £ has a natural number object, it is easy to construct free R-modules, 
by an appropriate modification of the construction in 5.3.5, and hence to express 
R-modules as the models of an internal finitary algebraic theory in E. 


If T is an object of E[O], then (—) ® T and T 8 (—) both preserve reflexive 
coequalizers as functors €[O] — E[O]: the former because it has a right adjoint by 
3.2.1, and the latter by 5.3.10(ii). From this, it follows by A1.2.12 (and the argu- 
ment in the proof of 5.3.1) that the forgetful functor Mon(E[0], 8, Go) — E[O] 
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creates reflexive coequalizers. Given that it also has a left adjoint (which may 
be constructed by an appropriate recursion — we refer to [549] for the details), 
it follows by A1.1.2 that it is monadic; and so also is the composite forgetful 
functor 


Mon(£[0], 8, Go) —> EO] ——> E/N. 


Thus we obtain the idea of a free internal finitary algebraic theory generated 
by an object g:G—WN of E/N (that is, by a G-indexed family of ‘finitary 
operations’, together with a morphism specifying their arities), and the fact that 
any internal finitary algebraic theory has a ‘free presentation’ as a coequalizer of 
a pair of morphisms between such free theories. We shall not pursue this topic 
further: the reader is referred to [549] for a more detailed account. 


Suggestions for further reading: Blass [124], Johnstone & Wraith [549], 
Lesaffre [719,720], Rosebrugh [1032, 1033], Schumacher [1099]. 


D5.4 Kuratowski-finiteness 


The notion of finite cardinal, studied in Section D5.2, provides us with a notion 
of ‘finite object’ in any topos with a natural number object, and as we have 
seen it enables us to develop a lot of ‘finitary mathematics’ inside such a topos. 
However, there is another notion of finiteness, which has the advantage that it 
can be defined in any topos (without assuming a natural number object), and 
which is more appropriate for many purposes — particularly in contexts involving 
ordered objects and lattices (cf. 5.4.26 below). This notion first appeared, in the 
context of set theory, in a paper by W. Sierpinski [1109]; but the first detailed 
investigation of it was made by C. Kuratowski [655], and it has become generally 
known as Kuratowski-finiteness (or K-finiteness for short). 

We saw in 5.2.1 that cardinal-finiteness corresponds to a certain ‘induction 
principle’: if we can prove (in the internal logic of our topos) that some assertion 
holds for the particular finite cardinals 0 = [o] and 1 = [so], and that it holds 
for a binary coproduct (= disjoint union) A I B provided it holds for both A 
and B, then it holds for all finite cardinals. Kuratowski-finiteness corresponds 
in the same way to a very similar induction principle: the only formal difference 
is that we replace the disjoint unions A H B by arbitrary binary unions A U B 
of subobjects. However, the crucial difference is that the Kuratowski induction 
principle is ‘local’; that is, it takes place inside the poset of subobjects of some 
fixed object of the topos, rather than in the topos as a whole. This is the reason 
why the Kuratowskian definition works without a natural number object: we 
can reasonably expect that the collection of all finite subobjects of some given 
object A should be ‘parametrized’ by some object K(A) of the topos (we shall 
shortly see how to construct it as a subobject of PA) - whereas the existence of 
an object F parametrizing the collection of all finite objects of the topos, for any 
reasonable definition of finiteness, will imply the existence of a natural number 
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object, since we shall have morphisms 1 — F (corresponding to the object 0) and 
F > F (corresponding to the operation A +> A II 1) whose images are clearly 
disjoint. (In fact, even if our topos € has a natural number object, we shall not 
in general be able to construct an object parametrizing all the K-finite objects 
of € in this sense.) 

The starting-point of the theory is thus the definition of the object A (A) for 
an arbitrary object A of E. 


Definition 5.4.1 

(a) Let A be an object of a topos €. We define K(A) — PA to be the sub-join- 
semilattice of PA generated by the subobject {}: A — PA: in other words, 
it is the internal intersection of all subobjects B of PA which contain {} 
and "07: 1 — PA (the name of the subobject 0 — A), and are closed under 
the binary join map U: PA x PA — PA. (Note that, in this and the next 
section, we shall generally write U rather than V for the binary join map, 
in order to distinguish it from disjunction in the internal language of £.) 

(b) We say A is Kuratowski-finite (or simply K-finite) if the top element 
Ta: 1— PA of PA factors through K(A) — PA. 


We shall see before long that (A) is indeed the object of K-finite subobjects 
of A, in the sense that a morphism 1 — PA factors through K (A) iff (the domain 
of) the subobject A’ — A which it names is a K-finite object. However, it is 
not true in general that every subobject of a K-finite object is K-finite; thus we 
do not assert that A is K-finite iff K(A) is the whole of PA. (Again, we shall 
see a counterexample before long.) On the other hand, saying that A is K-finite 
is equivalent to saying that the internal poset K(A) has a greatest element; for 
such an element must be an (internal) upper bound for the singletons of A, and 
hence it must be T4. 


Remark 5.4.2: K(A) could equivalently be defined as the smallest subobject 
of PA which contains "07: 1 — PA and is closed under the operation of ‘adding 
singletons’, in the sense that 


Q x1 


AM pA EE T E PAS SPA 


restricts to a morphism A x K(A) — K(A) (which we shall denote by 4). For 
K(A) clearly has these closure properties; conversely, if we write K’(A) for the 
smallest subobject with these closure properties, then we may show that K’(A) 
is a join-semilattice (and hence has the closure properties of K(A)), by showing 
that the subobject of K’(A) defined by the term 


{w: K'(A) | (Vw’: K"(A))((w Uw’) € "K'(A))} 


has the closure properties of K’(A) and hence is the whole of K’(A). 
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Lemma 5.4.3 
(i) The assignment A |> K(A) defines a functor E — E (in fact a subfunctor 
of the covariant power-object functor). 


(ii) The functor K preserves monomorphisms and epimorphisms. 


Proof (i) If f: A— B is a morphism of E, then the diagram 


f 


A ——— B 


{} {} 


af 
PA ——> PB 
commutes, as we saw in A2.3.6(i), and 3f (being an internal left adjoint) is a join- 
semilattice homomorphism; so it restricts to a morphism K(f): K(A) > K(B). 
Thus K is a subfunctor of the covariant power-object functor. 

(ii) The fact that K preserves monomorphisms is immediate from the fact 
that the covariant power-object functor does so (cf. A2.2.5). For epimorphisms, 
let f: A > B be an epimorphism. The image of K(A) under 3f: PA — PB is 
a sub-join-semilattice of PB, since 3f is a join-semilattice homomorphism; and 
it contains {}: B — PB by the commutativity of the diagram in the proof of 
(i) and the fact that f is epic. So it contains A(B); but the reverse inclusion is 
simply the functoriality of K. 0O 


We may now establish some of the basic closure properties of the class of 
K-finite objects. 


Lemma 5.4.4 
(i) The objects 0 and 1 are K-finite. 


(ii) Epimorphic images of K-finite objects are K -finite. 
(iii) An object which is a union of two K -finite subobjects is K -finite. 
(iv) A coproduct AU B is K-finite iff both A and B are K-finite. 


Proof (i) We have P(0) & 1, so its top and bottom elements coincide; K (0) 
contains the bottom element by definition, so it contains the top element. For 1, 
we claim that K(1) is the subobject (L, T): 1 H 1 — Q; for it contains these two 
elements of P(1) = Q (note that T is the singleton map 1 — P(1)), and they 
form a sub-join-semilattice. In particular, it contains the top element of Q. 

(ii) If f: A — B is epimorphic, then the composite If o T4: 1 > PA — PB 
equals T g; so this is immediate from 5.4.3(i). 

(iii) Similarly, for a monomorphism m: A» B, we have Imo T4 ="m', 
so if B is the union of m and m’: A’ — B where A and A’ are K-finite, then 
both “m? and m’? factor through K(B). But since K(B) is a join-semilattice, 
so does the name of their union, i.e. T p. 
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(iv) One direction follows immediately from (iii). Conversely, suppose A ll B 
is K-finite. It is easily seen that the square 


All B ———> All 


{} ({},50") 


Py, 
P(A U B) ———3 PA 


commutes, since both ways round correspond to the subobject 
(4,14): A= (AUD B) x A. But Pr is a join-semilattice homomorphism since 
pullback preserves unions of subobjects; and since K(A) is a semilattice it 
contains the element "07. Hence Pv; maps K(A Il B) into K(A), and so 
Ta = Pr o T ane factors through K(A). O 


Corollary 5.4.5 
(i) K(A) is ‘the object of K-finite subobjects of A’; i.e., given a subobject 
m: A = A, A is K-finite iff “m7: 1 — PA factors through PA. 
(ii) A subterminal object is K -finite iff it is complemented. 
(iii) For any two objects A and B, we have K(AU B) = K(A) x K(B). 


Proof (i) If A’ is K-finite, then "m™ = Jm o T 4 factors through K(A) by 
the argument of 5.4.4(iii). For the converse, we observe that the assertion ‘A is 
K-finite’ is expressible by a sentence in the internal language of €, so we may 
construct ‘the object of K-finite subobjects of A’ as a subobject K (A) of PA, by 
evaluating the truth-value of this sentence for the generic subobject of PA*(A) in 
€/PA. Then parts (i) and (iii) of 5.4.4 ensure that K(A) is a sub-join-semilattice 
of PA and that it contains {}: A — PA; so it contains K(A). 

(ii) follows easily from (i) and the fact, which we observed in the proof of 
5.4.4(i), that A (1) ¥ 1111 is the classifier for complemented subobjects. 

(iii) similarly follows from (i) and 5.4.4(iv): morphisms I —+ K(A UI B), for 
an arbitrary I, correspond to K-finite subobjects of J*(AU B) in E/I, and these 
correspond in turn to pairs of K-finite subobjects of J*(A) and J*(B). oO 


We note that part (ii) of 5.4.5 justifies the assertion, made earlier in this 
section, that not all subobjects of K-finite objects need be K-finite. By 5.4.4(iv), 
we know that complemented subobjects of K-finite objects are always K-finite; 
however, a general K-finite object may have K-finite subobjects which are not 
complemented. (We shall return to this point in 5.4.17 below.) Note also that, 
in conjunction with the fact (5.4.4(ii)) that the support of any K-finite object 
is K-finite, 5.4.5(ii) tells us that it is true in the internal logic of € that ‘any 
finite object is either empty or inhabited’. This is characteristic of the fact that 
many classical modes of reasoning, whilst invalid for arbitrary objects of a topos, 
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may nevertheless be safely employed for finite objects. As a further example, we 
sketch the proof of the ‘dual Frobenius rule’ (cf. 1.3.9). 


Lemma 5.4.6 For any (first-order) formulae ọ and w such that @ does not 
involve the variable y, the sequent 


((Vy)(PV Y) Fz ($ V (Yy)y)) 


is valid in the internal logic of a topos, provided (the object interpreting) the sort 
of the variable y is a K -finite object. 


Proof Let A be the product of (the objects interpreting) the sorts of the 
variables in 7, and let B be the sort of y. Then @ and w are respectively 
interpreted as subobjects X ~— A and Y >> Ax B. We need to show 
that Yx ((X x B) UY) < X U Yr,(Y) in Sub(A) (in fact we have equality 
here, since the reverse inclusion is true in general). It is clear that we may 
form a subobject L of PB which is ‘the object of subobjects B’ such that 
Yr ((X x B’)U(Y N(A x B'))) < X UVa, (Y N(A x B’))’. Now L clearly 
contains “0? and singletons, and it is closed under binary unions since Sub(A) 
is a distributive lattice. Hence K(B) < L; in particular the top element of PB 
belongs to L, and so the result is established. 0O 


A similar argument shows that, in a topos where De Morgan’s law is satisfied 
(cf. 4.6.2), the sequent 


(a(Vy)¢ Fe (Ay)>¢) 


is valid for any ¢ with free variables in the string r, y, provided the sort of y is 
K-finite. Indeed, this may be taken to be yet another equivalent way of expressing 
De Morgan’s law in a topos, since it reduces to the usual formulation of the latter 
when the sort of y is 2. 

Although a subobject of a K-finite object need not be K-finite, the following 
rather technical lemma (due to T. Coquand [249]) provides a partial substitute 
which is of use in several contexts. 


Lemma 5.4.7 Let A be an object of a topos E, and let A,, A2, A3 be subobjects 
of A such that Ay < AgUA3 and A, is K-finite. Then it is locally true that A, is 
contained in the union of two K -finite subobjects of Ag and A3, i.e. there exists 
B-»1in€ and K-finite subobjects AL — B* A2, Ay — B* A3 in E/B such that 
B* A, < A% U AS. 


Classically (i.e. in a Boolean topos), this result would be a triviality, because 
we could simply take B = 1 and A, = Aı N A; (j = 2,3). However, using 5.4.14 
below it is easy to give examples in non-Boolean toposes where we really need 
to localize, i.e. there do not exist suitable K-finite subobjects with B = 1. 
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Proof It suffices to show that the sentence 


(Vu, v,w)(((u Ee "K(A))A(u < vVUw))> 
(wv < v) A w € "K(A)7) A (u < 0’ Uw))) 


(where all variables are of type PA) is valid in the internal language of €, since 
applying this twice will yield the result in the statement. To do this, we consider 
the subobject L of PA defined by the term 


{u | (Vu, w)((u < vu w)> (Av’)((u' < v) A (v ETK(A)) A (u < v' U w)))}, 


and show that it has the closure properties of 5.4.2. It clearly contains "07, since 
if u = "07 then we can take v’ to be "07 as well. If u € L and u’ = {xr} Uu, 
then from (u’ < v U w) we deduce the (local) existence of a finite v’ < v with 
u < v'Uw, and also ((x € v) V(x € w)). If (x € v) holds, we define v” = {x}Uv’, 
and if (x € w) we define v” = v’; in either case, it satisfies the conditions. O 


As well as finite coproducts, the class of K-finite objects is also closed under 
finite products. However, the proof of this fact is a little more complicated: in 
effect, it involves a ‘double induction’. 


Proposition 5.4.8 A product of two K -finite objects is K -finite. 
Proof Let 64.3: PA x B — P(A x B) be the name of the relation 


€a xB >> PAxAxBxB2=PAxBxAxB 


(where the monomorphism B — B x B is the diagonal). In ‘elementwise’ terms, 
64,5 sends (A’,b) to A’ x {b}; in fact it is a strength for the covariant power- 
object functor, in the sense of B2.1.4. Since the product of a K-finite object and a 
singleton is clearly K-finite, 04,8 restricts to a morphism K(A) x B— K(Ax B), 
which is a strength for the functor K. Hence, if A is K-finite, we have a diagram 


(Ta, lB) 04B 
B ——— >> K(A) x B —> K(Ax B) 


{0 


Pro 
Poem P(AxB) 


which is readily seen to commute, since both ways round name the same subob- 
ject of B x A x B. But Pr is a join-semilattice homomorphism, so the pullback 
of K(A x B) along it is a sub-join-semilattice of PB, and we have just seen that 
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it contains {}: B — PB; so it contains K(B). Thus if B is also K-finite, we 
deduce that the composite 


TB Pro 
1 ———> PB ——> P(A x B) 


factors through K(A x B); but this is precisely the top element of P(A x B). 
O 


We have seen that K is a functor E — E, and the factorization of the singleton 
map A» PA through K(A) clearly yields a natural transformation 7 from the 
identity functor to K. It is no great surprise that 7 is the unit of a monad 
structure on K; what is perhaps more unexpected is that the algebras for this 
monad turn out to be something very familiar. 


Proposition 5.4.9 For any A, K(A) is the free semilattice generated by A. 
In particular, the forgetful functor from the category SLat(€) of internal semi- 
lattices in E to E is monadic, even if E does not have a natural number 
object. 


Proof By definition, K(A) carries a semilattice structure, and K(f) is a 
semilattice homomorphism for each morphism f: A — B of E, since it is a restric- 
tion of the semilattice homomorphism 3f: PA — PB. So we can regard K as a 
functor E > SLat(€). 

Now let B be an internal semilattice in E£. The equalizer of the semilattice 
operation V: B x B — B and the second projection is a partial order on B; we 
may construct the object C — PB of subobjects of B having least upper bounds 
for this ordering, and since least upper bounds are unique when they exist, they 
define a morphism C — B. We claim that K(B) < C, so that this restricts to 
a morphism eg: K(B) — B. Clearly, C is a sub-join-semilattice of PB, since 
the zero element of B is a least upper bound for the zero subobject, and the 
least upper bound of the union of two members of C is the join of their least 
upper bounds; but the least upper bound of a singleton is its unique member, 
and so {}: B — PB factors through C. Thus the claim is established; moreover, 
it follows from the proof that €p is a semilattice homomorphism. 

Now suppose we are given an arbitrary morphism f: A — B, where B has a 
semilattice structure. Then the composite 


K(f) €B 
K(A) ——> K(B) ——> B 

is a semilattice homomorphism; and its composite with ņa is f since €gnp is 
the identity on B. But if we had two semilattice homomorphisms K(A) 3 B 
extending f, their equalizer would be a sub-semilattice of K(A) containing the 
image of ņa, and hence would be the whole of K(A). Thus the adjunction is 
established. 

Finally, the monadicity of the forgetful functor SLat(€) — € follows 
from 5.3.1. o 
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We may use 5.4.9 to deduce further properties of K-finiteness from known 
results about finitary algebraic theories. For example, it follows immediately 
from 5.4.9 and 5.3.7 that inverse image functors commute up to isomorphism 
with the functor K. But in fact a stronger result is true: arbitrary bicartesian 
(that is, cartesian and cocartesian) functors between toposes commute with K, 
even if they do not have right adjoints. Before proving this, we need an alternative 
universal characterization of K(A), which stands in the same relation to 5.4.9 as 
list objects do to free monoids. 

Given an object A of a topos €E, let us write T4 for the internal algebraic 
theory freely generated by one nullary operation and an A-indexed family of 
unary operations, so that a T,-algebra is simply an object B equipped with 
morphisms b: 1 — B and t: Ax B— B. (Thus a list object LA, if it exists, 
is an initial object of the category T4-Mod(€).) We shall say that a T4- 
algebra (B,b,t) is idempotent (resp. commutative) if it satisfies the sequent 


(T Fay (t(x, t(x, y)) = t(x, y))) (resp. 
(T Fray (t(x, t(2',y)) = t(x’, t(£,y))))); 


and we shall write S4 for the theory of idempotent commutative T 4-algebras. 
Clearly, the structure maps "07: 1 — K(A) and o4: A x K(A) > K(A) of 5.4.2 
make K(A) into an S,-algebra. 


Proposition 5.4.10 For any A, (K(A),°0',o,) is an initial object of the 
category S4-Mod(€). 

Proof Suppose given an arbitrary S4-algebra (B, b, t). Consider the transpose 
t: A— B® of t; let A — B® denote its image. Since (B,b,t) is idempotent, 
it is clear that A is contained in the subobject Idem(B) — B® of idempotent 
endomorphisms of B, defined by the term 


{f: BY’ | (fof=f)} 


(where we are using f og as a shorthand for the term Ax.app(f, app(g,z))). We 
define successively smaller subobjects T, S of B® by the terms 


{f: B® | ((f €Idem(B)") A (v9)((9 € 4) > (fog = 9°f)))} 


and 
{f: BP | (f ET) A (Yg) (g ET )>(f °g = 9° f)))} 


respectively. Since (B, b, t) is commutative, it is easy to see that we have A<s 
in Sub(B¥); but since the members of S commute with each other they are 
closed under composition, i.e. S is a sub-monoid of B®. By construction, it is 
also idempotent and commutative, so it is a semilattice. Hence by 5.4.9 we may 
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extend the composite f: A-» Å >— S uniquely to a semilattice homomorphism 
f: K(A) — S. Now the composite 


f 1xb 
Ki Ay a pe pe Se 


is easily seen to be a homomorphism of S,-algebras; and it is unique, since the 
equalizer of two such homomorphisms would be a sub-S,4-algebra of K(A), and 
hence would be the whole of K(A) by 5.4.2. 


In fact S4-Mod(€), like SLat(€), is monadic over E even if E does not 
have a natural number object; the free S,-algebra on an arbitrary object B is 
simply K(A) x (1 H B). However, we shall not need this; instead, we turn to a 
characterization of K(A) which is similar to Freyd’s characterization of natural 
number objects and list objects: 


Corollary 5.4.11 Let A and B be objects of a topos. Then B = K(A) iff there 
exist morphisms b: 1 — B and t: A x B — B for which the coherent sequents 


(T Fay (t(z, t(z, y)) = t(x, y))) 
(T Fry (t(x, t(x, y)) = t(2’, t(x, y)))) 
((t(z,y) = b) Fay L) 
((t(x, t(x’,y)) = t(2', y)) Faey ((£ = 2’) v (t(x, y) = y))) 


and 
(T Fy ((y = b) V (Gx) (t(x, y) = y))) 


are satisfied (where x and y are variables of types A and B respectively), and the 
diagram 


t 
AxB -S B> 


n2 
is a coequalizer. 


Proof If B= K(A),then'0': 1— B andthe g4: A x B — B are easily seen 
to satisfy the given conditions. In particular, the last displayed sequent expresses 
the fact, which we noted after 5.4.5, that any K-finite object is either empty or 
inhabited. To verify the last condition, let R =< B be the equivalence relation 
generated by the parallel pair (o4,72) (i.e. the kernel-pair of its coequalizer); 
then it is easy to see that the R-equivalence class of 0" satisfies the closure 
conditions of 5.4.2, and is thus the whole of K(A). 

Conversely, suppose B satisfies the conditions. The first two say that (B, b, t) 
is an S,-algebra, so by 5.4.10 we have a unique homomorphism f: K(A) > B. 
But the last two conditions imply that B satisfies the analogue of Peano’s fifth 
postulate: that is, any subobject B’ ==> B which is a sub-S,-algebra must 
be the whole of B. (The proof of this is essentially the same as the proof 
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of (ii) = (iii) in 5.1.2.) Now we define a morphism g: B — PA by the term- 
in-context y.{x: A | (¢(x, y) = y)}. Clearly, we have (T F- (g(b) = "07)) and 
(T Fay (g(t(x, y)) = {x} Ug(y))) by the third and fourth displayed sequents, so 
the subobject B’ — B defined by the term {y: B | (gly) E "K(A)")} is a sub- 
Sa-algebra, and hence we may regard g as a morphism B — K(A). Moreover, 
it is clearly an S,-algebra homomorphism; and the composite fg (resp. gf) is 
equal to the identity, since in each case the equalizer of the composite and the 
identity is a sub-S,4-algebra of B (resp. K(A)). O 


Corollary 5.4.12 Let F:E— F be a bicartesian functor (for example, an 
inverse image functor). If A is a K -finite object of E, then F(A) is K-finite in F. 


Proof By 5.4.11 (or by 5.4.9 and 5.3.7, if F is an inverse image functor), 
we have an isomorphism F'(K(A)) = K(F(A)); and this is easily seen to be an 
isomorphism of internal posets. But F(K(A)) clearly has a greatest element if 
K(A) has. oO 


An alternative proof of 5.4.12, in the case when the toposes € and F have 
natural number objects, may be deduced from A2.5.7 and the fact (3.2.10) that 
K-finite objects are exactly the models of a o-coherent theory K; but the latter 
identification depends on 5.4.13 below, as we remarked in 1.2.15(k). 

The converse of 5.4.12 does not hold even if F is conservative. It follows from 
5.4.12 that if A is a K-finite object in a functor category [C, Set], then A(c) is a 
finite set for each object c of C; but this condition is not sufficient for K-finiteness 
of A, as we shall see in 5.4.14 below. 

An important consequence of 5.4.9 is that it enables us to determine the 
precise relationship between K-finiteness and cardinal-finiteness in the case when 
E has a natural number object. 


Theorem 5.4.13 Let € be a topos with a natural number object, A an object of 
E. Then A is K-finite iff it is locally a quotient of a finite cardinal, i.e. iff there 
exist B —> 1 in E, a natural number p in E/B and an epimorphism |p] > B*(A). 


Proof First we observe that finite cardinals are K-finite: this is an immediate 
consequence of 5.2.1 and 5.4.4(i) and (iii). Hence, by 5.4.4(ii), any quotient of 
a finite cardinal is K-finite; and since K-finiteness is equivalent to the validity 
of a sentence in the internal language of €, it is reflected by conservative logical 
functors such as B*. This proves the right-to-left implication. 

For the converse, we observe that, since the algebraic theory of semilattices 
is a quotient of that of monoids (or, if you prefer, because S4 is a quotient 
of T4), the free semilattice K(A) on an object A must be a quotient of the 
free monoid L(A) = Ey(N*(A)°) constructed in 5.3.3. Since the epimorphism 
L(A) — K(A) is a monoid homomorphism, it is easy to see that its ‘elementwise’ 
description must be as the operation which sends a morphism [p] — A to its 
image considered as a K-finite subobject of A. In particular, if A is K-finite and 
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we form the pullback 


p 


TA 


L(A) ——> K(A) 


then the natural number p in €/B and the morphism [p] — B*(A) which corre- 
spond to the left vertical map in this diagram have the required properties. O 


Example 5.4.14 Let C be a small category. Then an object A of the functor 
category |C, Set] is K-finite iff each A(c), c € ob C, is a finite set, and each A(f), 
f:c— d in C, is surjective. For we saw in 5.2.8(a) that the finite cardinals in 
[C, Set] are finite-set-valued functors which are constant on each connected com- 
ponent of C. Clearly, if A is a quotient of such a functor, then all the A(f) must 
be surjective; and this property is ‘reflected by localizations’, so the necessity 
of the condition is established. For the sufficiency, observe that if the condition 
holds then, for each object c of C, H*(A) is a quotient of H*([n]) in [C, Set]/H,, 
where n is the number of elements in A(c) (and H, denotes the functor C (c, —)); 
and the coproduct of all the He has global support. 


Remark 5.4.15 In contrast to finite cardinals, K-finite objects are not inter- 
nally projective in general; nor is exponentiation to the power of a K-finite object 
preserved by inverse image functors (cf. 5.2.9(ii) and 5.2.11). For a counter- 
example to both, let € = [2, Set] be the Sierpiński topos, and consider the 
object A = (2 — 1), where n denotes an n-element set. It is clear that we have 
an epimorphism [sso] > A, and hence that A is K-finite; but (—)^ does not 
preserve this epimorphism, since [sso] and A^ respectively look like (2 — 2) 
and (4 — 1). Also, if c: Set — [2, Set] is the closed point of £, then c* fails 
to preserve the exponential [sso]4. It is true in the Sierpiński topos that B4 is 
K-finite provided both A and B are (and it is true in any topos with a natural 
number object that an object of the form Bl?! is K-finite if B is, by an easy 
induction using 5.4.8), but in general the class of K-finite objects can fail to 
be closed under exponentials: for a counterexample, take € to be the topos of 
diagrams of shape (èe « e — e) in Set, and consider the exponential [sso]4, 
where A is the K-finite object (1 — 2 — 2). 


In general, the K-finiteness of an object A neither implies nor is implied by 
that of its power object PA. It is easy to give examples of toposes in which Q = 
P(1) is not K-finite: one such is the functor category whose subobject classifier 
is displayed before A1.6.10. On the other hand, the nontrivial subterminal object 
U = (0 — 1) in the Sierpiński topos [2, Set] is not complemented and hence not 
K-finite, but PU is isomorphic to 1111 and hence K-finite. In contrast, we have: 


Proposition 5.4.16 An object A is K-finite iff K(A) is K-finite. 
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Proof First suppose A is K-finite. We may form the subobject L — PA 
of subobjects A’ > A such that K(A’) is finite, as the interpretation of an 
appropriate formula in the internal language of €. Our aim is to show that L isa 
sub-join-semilattice of PA and contains {}: A — PA. The proof that it contains 
0 and singletons is easy, since we know that K(0) = 1 and K(1) = 1U 1 are 
both K-finite. So suppose we have subobjects A’ and A” such that K(A’) and 
K(A") are both K-finite. Then the epimorphism A’ U A” —> A’ U A” induces 
an epimorphism K(A’ II A”) = K(A' U A”), by 5.4.3(ii); but AK(A’ I A”) S 
K(A’) x K(A”") by 5.4.5(iii), and so it is K-finite by 5.4.8. Hence K(A’ U A”) 
is K-finite by 5.4.4(ii). Thus L contains K(A), and in particular T 4 factors 
through it; i.e. (A) is K-finite. 

Conversely, suppose K(A) is K-finite. We note that the counit map 
€x(a): K(K(A)) — K(A) is order-preserving (since it is a semilattice homomor- 
phism) and (split) epimorphic; so it must map the greatest element of K(K(A)) 
to a greatest element for K(A). So A is K-finite. O 


The operation A —> K(A) thus acts something like a ‘power-object functor’ 
for the subcategory Ex s of K-finite objects of E. Of course, it does not in general 
have all the properties of the power-object functor in a topos: in particular, 
it does not normally carry the structure of a contravariant functor, since the 
pullback of a K-finite subobject B’ — B along a map A — B need not be 
K-finite, even if A and B both are. However, if we restrict our attention to 
objects which are decidable (i.e. have complemented diagonal) as well as K-finite, 
this problem can be overcome: 


Lemma 5.4.17 If A ıs decidable, then every K-finite subobject of A 1s 
complemented; in fact K(A) < 24 as subobjects of PA. 


Proof 24 is a sub-join-semilattice (in fact a sublattice) of PA, since finite 
unions of complemented subobjects are complemented. The assertion that A 
is decidable says precisely that {}: A — PA factors through 2“, so the second 
assertion of the lemma is immediate from the definition of K (A). The first follows 
from applying the second to global elements of PA. o 


If A is K-finite, then applying the proof of 5.4.4(iv) to the generic comple- 
mented subobject of A in €/(24) yields the information that 24 < K(A) in 
Sub( PA); so if A is both decidable and K-finite then K(A) coincides with 24. 
We may thus conclude: 


Theorem 5.4.18 For any topos E, the full subcategory Eaxs of decidable 
K-finite objects of E ıs a Boolean topos. In particular, if E itself ıs Boolean, 
then Egy 1s a topos. 


Proof A product of two K-finite objects is K-finite, as we saw in 5.4.8; and 
a product of two decidable objects is decidable, by A1.4.15. Since 1 is both 
decidable and K-finite, it follows that Eag ¢ is closed under finite products in £. 
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It is also closed under equalizers, since the equalizer of two morphisms A = B is 
a complemented subobject of A if B is decidable (and is therefore K-finite if A 
is), and any subobject of a decidable object is decidable. Thus €4x is cartesian. 
Also, monomorphisms in Eggy coincide with those in E; so every subobject in 
Eaxys is complemented, and hence K(A) & 24 has the universal property of a 
power object for A in Eak f, provided it lies in this subcategory. We already know 
it is K-finite, by 5.4.16; so we need only prove it is decidable. 

For this, we argue inductively as usual to prove that, for any K-finite A, 
24 is decidable. We may form the subobject L — PA of subobjects A’ — A 
for which 24’ is decidable; this clearly contains 0 and singletons since 2° = 1 
and 2 are decidable, and it is closed under binary unions since Q4’UA" admits 
a monomorphism to 244” œ 24” x 24”, corresponding to the epimorphism 
A’ ll A” > A’ UA”. So K(A) < L, and hence if A itself is K-finite T4 factors 
through L. 

This completes the proof that Eqgxs is (a weak topos, and hence by A2.3.4) 
a topos. The second assertion follows since in a Boolean topos every object is 
decidable. O 


Example 5.4.19 Let C be a small category. Combining 5.4.14 and A1.4.16, we 
see that a functor A: C — Set is decidable and K-finite as an object of [C, Set] 
iff A(c) is finite for each object c of C and A(f) is bijective for each morphism f 
of C. Thus we see that the topos [C, Set]axs is equivalent to the topos [G, Setp] 
(cf. A2.1.5), where G is the groupoid reflection of C, i.e. the category obtained 
by freely adjoining inverses for all the morphisms of C. In particular, if C itself 
is a groupoid, then [C, Set]ax,s = [C, Set p]. 


More generally, by combining 5.4.13 and 5.2.6 we see that, in a topos with 
a natural number object, an object is decidable and K-finite iff it is locally 
isomorphic to a finite cardinal. 


Remark 5.4.20 It is possible for Exs to be a topos even if £ is not Boolean, 
but (perhaps surprisingly) this cannot happen unless it coincides with Eag s. In 
fact it can be shown that the following conditions on a topos E are equivalent: 
(i) Exy 1s a topos. 
(ii) Every K-finite object of € 1s decidable. 
(iii) Every subobject of a K-finite object of E 1s K-finite. 
(iv) The (external) lattece Subg(1) of subterminal objects of E is Boolean. 
) 


(v) Exs 1s closed under the formation of equalizers in £. 


= 


We omit the proof, which can be found in [543]. 


The fact that subobjects of K-finite objects are not in general K-finite may 
be ‘remedied’ by modifying the definition in either of two ways. We could sim- 
ply consider the class of objects which occur as subobjects of K-finite objects 
(cf. 5.4.20(iii)); however, this yields a condition which is not local (i.e. an object 
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A of E may be such that B*A is a subobject of a K-finite object of E/B for 
some B — 1, without A being a subobject of any K-finite object of £). A better 
approach is to replace the subobject {}: A — PA in the definition of K (A) by 
the partial-map classifier A, regarded (as in A2.4.10) as ‘the object of subobjects 
of A having at most one element’. 

_ Formally, we define K(A) to be the sub-join-semilattice of PA generated by 
A> PA. 


Lemma 5.4.21 K(A) is downwards-closed as a subobject of PA. 
Proof For any subobject B — A, the mapping (—)N B: PA — PA is a join- 


semilattice homomorphism (since PA is a distributive lattice) and it maps A 
into itself since A is downwards-closed. So it maps K(A) into itself. Since this is 


true for all B, K(A) is downwards-closed. o 


Thus we see that, for a given A, the top element ' A’ of PA factors through 
K(A) iff the latter is the whole of PA; we say that A is K-finite if these conditions 
hold. The elementary properties of the functor K, and of K-finite objects, may 
be developed along the same lines as those of K in the earlier part of this section; 
but it is quicker to use 


Proposition 5.4.22 An object A is K-finite iff it is locally a subobject of a 
K -finite object. 


Proof Since K(A) is generated by A as asemilattice, it must be an epimorphic 
image of the free semilattice K (A). It is easy to see that in ‘elementwise’ terms 
the epimorphism K(A) > K(A) sends a K-finite subobject B = A to the 
subobject B’ — A defined by the pullback 


BSS 4 . 


B>—> A 


for this operation is a join-semilattice homomorphism, and has the right effect 
on the generators. So, if Al: 1 > PA factors through K(A) —> PA, then by 
forming the pullback : 

c ——> K(A) 


rA)? m 
1 ——> K(A) 


we obtain a K-finite subobject B of C*(A) for which the corresponding B’ is the 
whole of C*(A). Conversely, if A is a subobject of a K-finite object B, then the 
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image of the map B > A corresponding to the partial map (A >—> B, 14) defines 
an element of K(A) whose image under K(A) > K(A) is A’. And since K- 
finiteness is a local property by the way in which it is defined, we may conclude 
that any object which is locally a subobject of a K-finite object is A-finite. O 


In particular, if € has a natural number object, then an object of € is K-finite 
iff it is locally a subquotient of a finite cardinal, by 5.4.13 and 5.4.22. 

K-finiteness, like K-finiteness and cardinal-finiteness, corresponds to an 
induction principle: if P is a property of subobjects of A (expressible by a for- 
mula in the internal language of £) such that P holds for all subsingletons (that 
is, all subobjects U — A whose domains are subterminal objects), and P holds 
for a union Bı U B2 provided it holds for By and B2, then P holds for all K-finite 
subobjects of A. In practice, this induction principle is less useful than that asso- 
ciated with K-finiteness, since it is hard to ‘get the induction started’; i.e. the 
class of properties which hold for all subsingletons is uncomfortably restricted. 
However, we note one case where it may be applied: 


Proposition 5.4.23 The category Ex, of K-finite objects of E is a logical 
subtopos of E iff Q is K -finite. 


Proof Clearly, if Ez ș is a logical subtopos then it must contain Q; but 2 is 


injective by A2.2.6, so if it is K-finite then it is (locally, and hence globally) 
K-finite. 

Conversely, suppose 2 is K-finite. It follows easily from 5.4.22 that Ez 
closed under finite limits (and finite colimits) in E; so we need only prove a 
a power object PA is K-finite (equivalently, K- finite) if A is K-finite. For this 
we use the induction principle mentioned above: consider the sentence which 
asserts about a subobject B of A that PB is K-finite. It holds for subsingletons, 
since if B is subterminal then PB is a retract of P1 = Q, by A2.2.5. And if 
B = B; U Bo, then PB is a retract of P(B, U B2) = PB, x PB, so the result 
follows from 5.4.8. c 


Corollary 5.4.24 IfE ıs a topos whose subterminal objects form a generating 
family (for ezample, a localic Set-topos), then exe is a topos uf Q is K-finite 
am E. 


Proof One direction follows from 5.4.23. Conversely, suppose € f is a topos; 
let Q’ denote its subobject classifier. The generic monomorphism in €; Ky is a 
monomorphism in £, so it is classified by a morphism Q’ — Q; but since every 
subterminal object of £ is in €z p it follows easily from the hypothesis that this 
morphism is an isomorphism. So Q is K-finite in £; as in the proof of 5.4.23, this 
implies that it is K-finite. O 


In general, however, Eg ys may be a topos without being a logical subtopos 
of €. Examples may be constructed using 5.4.20 above, or alternatively using 
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the following: 


Example 5.4.25 IfE is a functor category [C, Set], then a functor A: C — Set 
which is K-finite as an object of € must take finite sets as values, though its 
transition maps need not be either monic or epic. However, in general not every 
finite-set-valued functor on C is K-finite as an object of [C, Set], even if C itself 
is finite. For if C contains a parallel pair (f,g) such that f # g but fh = gh 
for some h, then it is clear that any K-finite object A of [C, Set] must satisfy 
A(f) = A(g); so this condition is inherited by any subobject of a K-finite object, 
and also by any K-finite object (the latter because, if B is a well-supported 
object such that B*(A) is a subobject of a K-finite object, then B(c) must 
be nonempty, where c is the domain of h). For finite categories C, this is the 
only obstruction to K-finiteness: of C is a finite category, then |C, Set]; , may 
be identified with [C/R, Sets], where R ıs the congruence on C generated by all 
pairs (f,g) for which there exists h with fh = gh. In particular, [C, Set] z sisa 
topos for any finite C, although if the congruence R is nontrivial it need not be 
a logical subtopos of [C, Set]. 

To prove this, consider first the case when C has an initial object (so that 
R consists of all parallel pairs in C). If A: C — Set; is a functor respecting the 
congruence R, we define 


A(c) = ( aa) =, 
H, / 


for each c € ob C, where =, is the equivalence relation which identifies z € A(d) 
with y € A(e) iff there exist f: d  c and g: e > c such that A(f)(x) = A(g)(y). 
(The fact that A respects R ensures that this is indeed an equivalence relation.) 
It is easy to see that if there exists a morphism c — d in C then =, C =q; so the 
assignment c ++ A(c) may be made into a functor C — Sets whose transition 
maps are surjective, and which is therefore K-finite by 5.4.14. Moreover, =e 
cannot identify distinct elements of A(c), so we have an injective map A(c) — 
A(c), which is easily seen to be the c-component of a natural transformation 
A — A. So we have expressed A as a subobject of a K-finite object of [C, Set]. 

For the general case, let c be an arbitrary object of C and let H, denote 
the functor C (c, —). The total category of the discrete opfibration corresponding 
to H, (as in A1.1.7) is the co-slice category c\C, which has an initial object; 
moreover, if a functor A respects the congruence R on C, then H7(A), regarded 
as a functor on c\C, respects the corresponding congruence on the latter. So if 
A also takes finite sets as values, we see that H(A) is a subobject of a K-finite 
object of |C, Set]/H.; and since the coproduct of all the He, c € ob C, is well- 
supported it follows that A is locally a subobject of a K-finite object. So the 
result follows from 5.4.22. 

Note in particular that if every morphism of C is epic, then the congruence 
R is trivial, and so [C, Set]; , = [C, Set] is a logical subtopos of [C, Set]. (Of 
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course, it is easy to see that in this case Q has surjective transition maps, and 
is therefore K-finite.) However, the reader should be warned that the quotient. 
category C/R may not have the property that all its morphisms are epic; it may 
be necessary to iterate the quotient construction a finite number of times before 
we arrive at a category C/R with the latter property. Moreover, the objects 
of |C, Set] which are globally expressible as subobjects of K-finite objects are 
exactly the finite-set-valued functors which respect. this larger congruence R; so, 
as we claimed earlier, the latter property is not equivalent to K-finiteness. For 
more details, see [543]. 

For an infinite category C, there are further restrictions on the objects of 
[C, Set] z F For example, if C is the ordered set of natural numbers, then it 
is easy to see that the functor A defined by A(n) = {0,1} for each n, with 
each transition map A(n) —> A(n + 1) sending both 0 and 1 to 0, is not K- 
finite. Moreover, [C, Set] z yf ÌS not a topos in this case, since it lacks a subobject 
classifier. (Recall that we pictured the subobject classifier of [C, Set], for this 
particular C, before A1.6.10. The object A which we have just defined occurs 
naturally as a subobject of Q; in fact it is the Higgs object of the topos (C, Set], 
as defined before 4.5.10.) 


In 5.2.14 and 5.2.15 we saw one way in which preservation of finite colimits 
may be broken down into preservation of finite colimits and of filtered colimits. 
The notion of K-finiteness gives us another way of doing this, which is simpler 
in that it does not require the topos to have a natural number object; however, 
it works only for indexed preorders. The reason is that, if we are dealing with 
an indexed category C which is not a preorder, then an object A in a fibre C7 
of C will not in general satisfy A = A + A; so, if we are given a morphism 
z: [p] — J to J from a finite cardinal [p], with (K-finite) image m: J’ — J, then 
the coproducts 4),)2*A and Xj-m"*A will in general be different. But if C is an 
indexed preorder, then they coincide; so we can forget about the L(J)-indexed 
family of all morphisms from finite cardinals to J, and work instead with the 
poset AK (J) of K-finite subobjects of J. Thus we have 


Proposition 5.4.26 Let S be a topos, C and D two cocomplete S-indexed 
preorders which are stacks for the coherent coverage on S, and F:C — D an 
S-indezed functor. 


(i) If F preserves (fibrewise) finite joins, then it preserves joins indexed by 
K-finite objects of S, i.e. the diagram 


I 
cE sp 
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commutes up to isomorphism whenever z:I— J is a K-finite object 
of S/J. 

(ii) If F preserves finite joins and S-indezed directed joins, then it ıs 
S-cocontinuous. 


Proof (i) For simplicity, we consider the case J = 1. Given an object A of 
C!, we may construct in S the subobject L — PI indexing those subobjects 
I’ — I such that the square commutes for A restricted to C7; the fact that C 
and D are stacks and F preserves fibrewise finite joins easily implies that L is a 
sub-join-semilattice of PI, and it contains singletons since the vertical edges of 
the diagram reduce to identity maps if I’ is a singleton. So K(J) < L, and in 
particular the top element "I? of PI factors through L. 

(ii) Again, we deal with the case when J = 1. For any I, K(J) is a directed 
internal poset in S, because it is a join-semilattice; and we may compute the join 
of an J-indexed family A € ob C! by first computing the joins of all its K-finite 
subfamilies, and then forming the directed join of the latter. So if F commutes 
with joins of both types, then it commutes with arbitrary joins; but this suffices 
for cocontinuity, since we are dealing with preorders. oO 


Suggestions for further reading: Acuña-Ortega & Linton [14], Blass [132], 
Freyd [373], Johnstone & Linton [543], Kock et al. [635]. 


D5.5 Orbitals and numerals 


We observed at the start of the previous section that, in a topos without a natu- 
ral number object, we cannot hope to have an object parametrizing all the ‘finite’ 
objects of the topos, for any reasonable notion of finiteness. Nevertheless, it does 
seem reasonable to expect that we can find some non-local notion of ‘counting’ 
in such toposes. The theory of numerals, introduced by P. Freyd [373], pro- 
vides such a notion, which relates well to K-finiteness (and to cardinal-finiteness, 
when the topos does have a natura] number object). In this account, we shall 
approach it via the theory of orbitals, which was developed more recently (but 
independently) by J. Bénabou and B. Loiseau [106]. 

The starting-point of the theory is an idea which we have already exploited 
in 5.1.1. As in Section D5.1, we shall write T for the algebraic theory freely 
generated by one nullary and one unary operation. 


Definition 5.5.1 By an orbital in a topos E, we mean a T-algebra (A, a, t) 
which ‘satisfies Peano’s fifth postulate’ internally, i.e. satisfies the sentence 


(Vw: PA)((w is (a, t)-closed) > (w =" A”)). 


In this terminology, then, 5.1.1 says that every T-algebra contains a unique 
orbital (and that in an orbital the morphisms a and t are jointly epic). Hence we 
do not have to worry about interpreting the displayed sentence internally; if it is 
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externally true that a T-algebra (A,a@,t) has no proper (a, t)-closed subobjects, 
then the orbital which it contains must be the whole of A. 


Lemma 5.5.2 
(i) Let (A,a,t) and (B,b,u) be orbitals in a topos E. Then there exists at 
most one T-algebra morphism A — B, and if such a morphism ezists it is 
an epimorphism (in €E). 

(ii) If (A,a,t) is an orbital and f:(A,a,t) » (B,b,u) is an epimorphism 
(in E) and a T-algebra homomorphism, then (B,6,u) is an orbital. 

(iti) If we write (A,a,t) > (B,b,u) when there exists a morphism of T-algebras 
(A,a,t) > (B,b,u), then the resulting preorder Orb(E) has finite joins and 
finite nonempty meets; and it has a greatest element iff E has a natural 
number object. 

(iv) If we define an E-indexed category Orb(€) by Orb(€)/ = Orb(E/T), then 
this category has E-indezed colimits; and ıt has E-indered lmits af E has 
a natural number object. 


Proof (i) Given two T-algebra morphisms f,g: A = B, their equalizer is an 
(a, t)-closed subobject of A, and so must be the whole of A; similarly, the image 
of any homomorphism f must be a (b, u)-closed subobject of B. 

(ii) Pullback along f maps (b, u)-closed subobjects to (a, t)-closed subobjects, 
and it preserves properness of subobjects. 

(iii) The terminal object 1 of T-Mod(€) is clearly an orbital, and provides 
a least element of Orb(€). To form the join (A,a, t) U (B, 6, u) — we shall denote 
joins and meets in Orb(€) by U and N rather than V and A, for reasons which 
will become clear later - we observe that (A x B,(a,b),t x u) is a T-algebra, 
and so contains an orbital A U B which admits morphisms to both A and B. 
Moreover, if (C,c,v) is another orbital which does this, then the image of the 
induced morphism C > A x B is (an orbital, and hence) equal to AU B. So 
AUB is a least upper botind for A and B in Orb(€). To form the meet of A and 
B, consider the pushout 


AUB A 


B ANB 


in E. It is easily seen that AN B can be given a unique T-algebra structure such 
that the morphisms A > AM B and B > AN B are homomorphisms, and it 
is an orbital by (ii). But any pair of morphisms (A — C, B — C) where C is 
an orbital must factor through this pushout, since the category of orbitals is a 
preorder; hence AN B is a greatest lower bound for A and B. Finally, if N exists 
then it is initial in the category of T-algebras and hence (since we have reversed 
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the arrows) a greatest element in Orb(&); conversely, a greatest element, if it 
exists, admits a unique homomorphism to each orbital and hence (by 5.1.1) to 
each T-algebra, so it is a natural number object. 

(iv) Since the property of being an orbital is defined by a sentence in the 
internal language, it is preserved by logical functors such as pullback functors; 
so the preorders Orb(&/I) do indeed fit together to form an €-indexed preorder. 
We may similarly observe that the process of cutting down a T-algebra to its 
orbital, which provides a right adjoint to the inclusion Orb(€)°? — T-Mod(€), 
commutes with pullback functors; so Orb(€)°? is coreflective in the €-indexed cat- 
egory of T-algebras, and hence inherits €-completeness (and €-cocompleteness, 
if € has a natural number object) from the latter. O 


Now suppose given an object A equipped with an endomorphism t: A —> A. 
We may form the object C(A, t) — PA of t-closed subobjects of A (i.e. those 
A’ >= A satisfying A’ < t*(A’)). This subobject is clearly closed under arbi- 
trary (internal) intersections in PA, and so the inclusion C(A, t) — PA has 
a left adjoint, giving rise to a closure operation c: PA — PA. We shall write 
O: A — PA for the composite of c with the singleton map {}: A — PA, and 
call it the orbit map associated with t. Clearly, for any a: 1 — A, the subobject 
of A named by the composite Oa is an orbital (with distinguished element a). 

We define a binary relation < on A by setting x < y iff O(y) < O(2); i.e. 
<>» Ax Ais the pullback along O x O of the opposite of the usual order-relation 
on PA. We also write x < y for tz < y. By the cyclic part C(A) of A, we mean 
the subobject {z: A | z < z}; we say (A,t) is acychc if its cyclic part consists 
entirely of fixed points of t, i.e. C(A) — A is the equalizer of t and 14. 


Lemma 5.5.3 With the above notation, 


(i) < ıs a preorder on A. 

(ii) If f: (A,t) > (B, u) is a morphism of objects-with-an-endomorphism, then 
f is an order-preserving map (A, <) — (B,x). In particular, t is an 
order-preserving map (A, <) > (A, 3). 

(iii) We have (Vz, y: A)((z < y) ((z < y) v (z = y))). 

(iv) If (A,a,t) ıs an orbital, then ıt satisfies (Vz : A)(a < z) and 
(Yz, y: A)((z Sy) V (y $2). 

(v) (A, t) ıs acyclic iff < is a partial order on A. 


Proof (i) It is clear that < inherits reflexivity and transitivity from the 
ordering on PA. 

(ii) It is also clear that if f satisfies the hypothesis, then, for any z: A, Of(z) 
is the image of O(z) under f, since it is an orbital with distinguished element 
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f(z). In other words, the diagram 
f 
A ——> B 


O O 


sf 
PA ——> PB 
commutes. So the result follows from the fact that 3f is order-preserving. The 
second assertion follows from the fact that t commutes with itself. 

(iii) It is again clear that z < y is equivalent to y € O(z); but since O(z) is 
an orbital, this is in turn equivalent to ((x = y) V (Sz)((z € O(z)) A(tz = y))) by 
5.1.1. From the second half of this disjunction, we deduce (tz < y) (i.e. (x < y)) 
since t is order-preserving; so we have proved the left-to-right implication in (iii). 
But the right-to-left implication follows trivially from (i), since x < tz. 

(iv) The first assertion is clear since Oa is the whole of A. For the second, 
consider the subobject A’ — A which is the interpretation of 


{x: A | (Vy: A)((x < y) V (y X z))}; 


we shall show that A’ is (a, t)-closed, and hence the whole of A. Clearly a factors 
through A’, by the result just established. Suppose (x € A’); then for any y 
we have ((z < y) V (y < 2)). But by (iii) the first possibility is equivalent to 
((x = y) V (tx < y)); and both (x = y) and (y < z) imply (y x tz). So we have 
shown that ((x €' A" )=> (tz €°A)). 

(v) Suppose (A, t) is acyclic. Then from ((z < y) A (y < z)) we may deduce 
((x = y) V ((@ <x y) A(y < 2))), by (iii), and hence ((x = y) V (z ~ 2)), ie. 
((x = y) V(x €'C(A)")). But from (x € "C(A)") we deduce (xz = tz) and hence 
(O(x) = {x}) since {x} is t-closed; so this possibility also leads to (x = y). So 
we have shown that < is antisymmetric, and hence a partial order. Conversely, 
suppose ~ is antisymmetric; then from (x < x) we deduce ((x < tz) A (tz < 2)), 
and hence (x = tz). So C(A) is contained in the equalizer of t and 14; but the 
reverse inclusion always holds. g 


We define a numeral to be an acyclic orbital. This is not Freyd’s original 
definition; he took the partial order <, as well as the morphisms a and t, as part 
of the primitive structure of a numeral. However, the following lemma shows 
that Freyd’s definition is equivalent to the one just given. 


Lemma 5.5.4 Let (A,a,t) be an orbital, and suppose given a partial order < 
on A such that t ıs order-preserving and a ıs the least element of A. Then < 
coincides with <; in particular, (A,a,t) is a numeral. 


Proof First we observe that we have (Vz: A)(x < ta), since the subobject of 
A which is the interpretation of {x: A | x < tz} is (a, t)-closed by the hypotheses 
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on <. It is now clear that (x < y) implies (x < y), since {y: A | z < y} isa 
t-closed subobject of A containing z, and hence contains O(x). For the converse, 
consider the subobject A’ — A which is the interpretation of 


{x: A | (Vy: A)((x < y)=> (z < y))}; 


we shall show that A’ is (a,t)-closed, and hence the whole of A. Clearly, a 
factors through A’, since Oa is the whole of A; so suppose (x € TA’), and 
let (tz < y). Since (x < tx), we have (x < y) and hence (x < y), so by 
5.5.3(iii) we have ((x = y) V (x < y)). But from the first possibility we deduce 
(tz < x), whence (x = tx) by antisymmetry of <, and so (tx < y); and the 
second possibility is exactly the statement that (tz < y). Thus we have shown 
that ((x € TA’) > (tz € "A’))), ie. A’ is t-closed. o 


Given an arbitrary orbital (A, a, t), we may consider the equivalence relation 
æ on A which is the intersection of < and its opposite. The quotient A = A/~ 
clearly inherits a T-algebra structure from A; and it is an orbital since we have 
an epimorphism A —> A. Since it also carries a partial order (the image of <) 
satisfying the conditions of 5.5.4, we deduce that it is a numeral. Moreover, any 
morphism from A to a numeral must factor through A — A, since it preserves 
< and hence %; so we have proved 


Corollary 5.5.5 The sub-preorder Num(E) of numerals in E is coreflective in 
Orb(E). o 


Moreover, the coreflection can be made into an E-indexed coreflection, as 
in the proof of 5.5.2(iv). Thus the €-indexed preorder Num(€) enjoys the same 
completeness properties as Orb(€) (cf. 5.5.2(iv)). 

Next, we prove a result which is the analogue for numerals of Freyd’s 
characterization of natural number objects (5.1.2). 


Proposition 5.5.6 Let (A,a,t) be a T-algebra in a topos E, and < a partial 
order on A such that t is order-preserving and a is the least element of A. Then 
(A,a,t) is a numeral iff it satisfies the following conditions: 


(i) (Vz: A)(a < tx); 

(ii) (a, t) is jointly epic; 

(iii) < is a total order; 

(iv) (Yz, y: A)(((x < y) A (y < tz)) > ((e = y) V (y = tz))); 

(v) the coequalizer of t and 14 is A > 1. 
Proof First suppose (A, a,t) is a numeral. Then we have already seen that (i) 
holds in the proof of 5.5.4, (ii) holds by 5.1.1, and (iii) holds by 5.5.3(iv). For 
(iv), we observe that (x < y) implies ((z = y) V (tz < y)) by 5.5.3(iii) and 5.5.4, 
and that ((tz < y)A(y < tz)) implies (y = tz) since < is antisymmetric. Finally, 
for (v), suppose we have a morphism f: A — B satisfying f = ft. Then f is a 
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T-algebra morphism (A, a,t) — (B, fa, 1g), so its image is the orbit O( fa) — B, 
which is clearly {fa}. Thus f factors through A — 1 (and the factorization is 
unique since A — 1 is split epic). 

Conversely, suppose the five conditions hold. Let A’ — A be the orbit named 
by Oa; we shall argue as in the proof of 5.1.2(ii) = (iii) to show that A’ is the 
whole of A. First of all, we claim that 


(Vz,y: A)(((x Sy)A(YETA))>(z ET A")); 
to prove this, let A” =— A’ be the interpretation of 
{y: A’ | (Va: A)((x < y)=> (z € TA))}. 


We claim that A” is (a, t)-closed, and hence the whole of A’. Clearly, a factors 
through A” since (x < a) implies (x = a) by antisymmetry of <. If (y € A”) 
and (x < ty), then we have either (x < y) or (y < x) by (iii); the first of these 
implies (x € ' A’), and the second implies ((y = x) V (x = ty)) by (iv), from 
which we again deduce (x € " A’"). Next, we claim that 


(Vz: A)((x ETAN) (tr €°A"))); 


the left-to-right implication is clear since A’ is t-closed, so suppose (tz € ' A’"). 
From (ii), we deduce ((tz = a) V (Sy: A’)(tx = ty)); but (tz = a) implies 
(a < x < tx = a) and hence (x = a), so (x € TA’). Given y such that 
((y €A’') A (tax = ty)), we have ((x < y) V (y < 2)) by (iii); the first possibility 
implies (x € A’) by the argument already given, and the second implies (y < x < 
tx = ty), whence ((y = x) V (x = ty)) by (iv), and either of these possibilities 
implies (x € TA’). 

Thus we have shown that A’ is S-closed, where S — A x A is the sym- 
metrization of the graph of t (i.e. the union of the graph and its opposite). 
Arguing exactly as in the proof of 5.1.2(ii) > (iii), we now deduce that it is T- 
closed, where T is the kernel-pair of the coequalizer of S = A. But T is the whole 
of A x A by (v); so, since A’ contains the element a, it must be the whole of A. 

OD 


Corollary 5.5.7 Any bicartesian functor between toposes preserves numerals. 


Proof Such a functor preserves the validity of the conditions of 5.5.6 (note 
that the first four of them are expressible as coherent sequents, and are thus 
preserved by arbitrary coherent functors). O 


Let A be an object of a topos equipped with a reflexive relation R: A A. 
Then the composite of R with the universal relation € 4; PA % A (cf. A3.4.5) 
defines a morphism r: PA — PA, which corresponds to the operation of compos- 
ing an arbitrary relation B + A with R. It is clear that r is order-preserving (for 
the usual ordering on PA) and inflationary (i.e. it satisfies (Yw: PA)(w < rw)). 
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So if we fix any particular subobject A’ — A and cut down the T-algebra 
(PA,° A,r) to an orbital, the latter will be a numeral by 5.5.4. Let us write 
B for this numeral, and consider the subobject A” — A which is the internal 
union of the B-indexed family of subobjects named by B — PA. We claim that 
A” is the R-closure of A’, that is the smallest subobject satisfying A’ < A” and 


(Vz, y: A)(((x €T AN) A ((z,y) ET R')) > (y € TA). 


For A” clearly has this closure property (since B is r-closed), and contains A’ 
since A’? factors through B. Conversely, if C — A is R-closed in the above 
sense, then "C7: 1 — PA is a fixed point of r; hence if we also have A’ < C then 
{w: PA |A" < w < "CO7} is a sub-T-algebra of (PA,"A’",r) and so contains 
B, i.e. (Vw: B)(w < °C”). 

We may now deduce, for toposes not necessarily having natural number 
objects, the analogue of a result which we proved for toposes with natural number 
objects back in A2.5.7: 


Proposition 5.5.8 LetT:E — F be a coherent functor between toposes. Then 
the following conditions are equivalent: 


(i) T is cocartesian (equivalently, preserves coequalizers); 
(ii) T preserves R-closures of subobjects, for any reflexive relation R; 
(iii) T preserves transitive closures of reflexive relations. 


Proof (i) => (ii): Let R — A x A be a reflexive relation in E, A’ — A 
an arbitrary subobject and A” > A its R-closure. Since a coherent functor 
preserves composition of relations, it sends A” to a TR-closed subobject of 
TA containing TA’; we have to show that it is no larger than the TR-closure 
C — TA of TA’. But we constructed A” above as the image of v: S — A, where 
(u,v): S —> Bx A is the relation corresponding to B — PA; so TA” is the 
image of Tv: TS — TA. Now consider 


{y: TB | {z:TA| (y,z) ETS} < °C }; 


it is not hard to verify that this is a (T" A’', Tr)-closed subobject of TB, but TB 
is a numeral by 5.5.7 and so it is the whole of TB. Hence TA” < C, as required. 

(ii) > (iii): Given a reflexive relation R — A x A, let B = Ax A and let 
Ř —> B x B be the relation defined by the term 


{(x,y,2,t): A? | (æ = z) A ((y,t) € "R°))} . 


If we take B’ — B to be the diagonal subobject A — A x A, then it is clear 
that the R-closure of B’ is just the transitive closure of R. 

(iii) => (i): Any coherent functor between toposes preserves coequalizers of 
equivalence relations; and it preserves the construction of the reflexive and sym- 
metric relation generated by a parallel pair of morphisms in €, since the latter 
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may be constructed using finite limits, finite coproducts and images. Hence if 
(iii) holds, T preserves the construction of the equivalence relation generated by 
a parallel pair. O 


So far in this section, we have not had much to say about finiteness as such. 
We now make good this deficit. 


Lemma 5.5.9 Let (A,a,t) be an orbital, and let C(A) denote the cyclic part 
of A. 


(i) (A,a,t) ts a natural number object iff C(A) = 0. 
(ii) A is K-finite iff C(A) is well-supported. 

(iii) The equalizer of t and 14 is a subterminal object. 
(iv) (A,a,t) is a K-finite numeral iff C(A) = 1. 


Proof (i) The left-to-right implication is clear, since the preorder < on N must 
coincide with the usual partial order by 5.5.4. Conversely, suppose C(A) = 0. 
Then the image of t is disjoint from a, since (tz = a) would imply (tz < x), i.e. 
(x E "C(A)"). So by 5.1.2 it suffices to prove that t is monic. Suppose (tz = ty); 
by 5.5.3(iv) we have either (x < y) or (y < x). If the former holds, then we 
have (x < y < ty = tz); but (A,t) is clearly acyclic, and so by 5.5.6(iv) we 
have ((x = y) V (y = tz)). But (y = tr) would imply (y € "C(A)"), which is 
impossible, so we have (x = y). The argument if (y < x) is similar. 

(ii) First suppose A is K-finite. We shall say that a subobject A’ — A is 
bounded if (Sy: A)(Vz: A’)(x < y); clearly, we may construct a subobject 
L — PA indexing the bounded subobjects of A. This subobject clearly contains 
0 — A’ and singletons; and it is closed under binary unions by 5.5.3(iv). So 
K(A) < L and hence A is bounded, i.e. (Ay: A)(Vx: A)(x < y). Substituting ty 
for x in this formula, we deduce that any such y must be cyclic. 

For the converse, we first assume that (A, t) is acyclic. Suppose (y € "C(A)”); 
then {x: A | (x < y)} defines an (a,t)-closed subobject of A (since (x < y) 
implies (tz < ty = y)) which must be the whole of A, i.e. y must be an upper 
bound for A. Now consider the subobject A’ — A defined by 


{y: A | {z: A | (x < y)} is K-finite} ; 


we shall show that A’ is (a,t)-closed, and so any bounded subobject of 
A (in particular, A itself) is K-finite. Clearly a factors through A’ since 
{x: A | (x <x a)} ={a}; and from 5.5.6(iii) and (iv) we obtain 


{z: A | (x 3 ty)} = {2: A | (£ < y)} U {ty}, 


whence (Vy: A)((y € TA’) > (ty €"A’))). 
Unfortunately, if (A,t) contains nontrivial cycles, the above argument will 
not work, since the sets {x: A | x < y} need not be K-finite in this case. For 
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such an A, we consider the morphism Ê: K(A) -+ K(A) given by 


(x(a) K(t)) U 
A) ——————-> K(A) x K(A) —> K(4A), 


and write O: A > P(K(A)) for the composite of the singleton map A > K (A) 
and the orbit map for t. We claim first that we have 


(Wx: A)(Vw: K(A))((w € O(2)) => (Sy: A)(éw = wu {ty})) . 


We prove this, for a given z, by induction over w in O(z): clearly, for w = {x} 
we may take y = z, and if w = tw’ and y’ satisfies the condition for w’, then ty’ 
satisfies it for y. Similarly, we may prove 


(Vz, y: A)(Vw: K(A))(((w € O(z)) A (y € w)) > (O(z) = w u O(y))) . 
The key idea is now to consider 
{w: K(A) | ((w € O(z)) A (Wy: A)(((y € w) A (ty = )) > (w = O(z))))} 


where x is a free variable of type A. Once again, we may show that this set 
contains {x} and is t-closed, so it is the whole of O(x). We deduce that 


((ay: A)((y € O(z)) A (ty = z)) > (O(a) is K-finite)); 


but we know that (3x: A)(tx < x), and this particular x must equal ty for some 
y E€ O(x). Now since Oa =" A", we have 


(aw: K(A))((w € O(a)) A (x € w)) 


for this particular z, but we then have A’ = w U O(z). So A is the union of 
two K-finite subobjects, and hence K-finite. 

(iii) Suppose (x = tx); then, as in the proof of (ii), we deduce that x is an 
upper bound for A, i.e. (Vy: A)(x € O(y)). But if we also have (y = ty) then 
O(y) = {y}; so any two fixed points of t must be equal. 

(iv) If A is a K-finite numeral, then C(A) is well-supported by (ii) and 
subterminal by (iii); so it is isomorphic to 1. Conversely, if C(A) = 1, then 
A is K-finite by (ii). Also, it is clear that if (x € "C(A)7) then we also have 
(tz €"C(A)"); so C(A) = 1 implies that any cyclic element x satisfies (x = tr). 

o 


Thus, in a 2-valued topos, every orbital is either K-finite or a natural number 
object. In the opposite direction, it is clear that every finite successor cardinal can 
be given the structure of a numeral: we can ‘truncate’ the successor morphism 
on N to yield a morphism [sp] — [sp] with the required properties. However, 
not every quotient of such a cardinal can be given a numeral structure or even 
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an orbital structure. For example, let C be the finite category with four objects 
a,b,c,d and three non-identity morphisms a — b,a — c,a — d, and let A € 
ob |C, Set] be the functor defined by A(a) = {0,1,2}, A(b) = A(c) = A(d) = 
{0,1} with A(a — b), A(a — c) and A(a — d) being surjections which identify 
the pairs (1,2), (0,2) and (0,1) respectively. Then it is easily seen that there is 
no way of defining a transitive ‘successor map’ on A(a) which is compatible with 
all three transition maps. (If we delete one of the three ‘legs’ of this diagram, 
we obtain a K-finite object which admits an orbital structure but no numeral 
structure.) 

But the class of K-finite numerals does contain more than just the finite 
cardinals: 


Lemma 5.5.10 
(i) If E has a natural number object, then Num(E) is equivalent to the opposite 
of (the standard indexing of) the internal poset of s-closed subobjects of N. 


(ii) Even if E does not have a natural number object, Num(E)°? is cartesian 
closed. 


Proof (i) For any numeral (A,a,t), we have a unique T-algebra morphism 
g: N — A, and A is characterized up to isomorphism by the set {n: N | g(n) = 
g(sn)}. Conversely, given an s-closed subobject B — N, the pushout 


B>——> N 


oB >——>A 


inherits a numeral structure from N; and since this square is also a pullback by 
A2.4.3, we can recover B as {n: N | g(n) = g(sn)}. Thus we have an order- 
reversing bijection between isomorphism classes of numerals in € and s-closed 
subobjects of N; and the same argument works in E/I for any J, so we have an 
equivalence of €-indexed categories. 

(ii) It suffices to prove that, given any numeral (A,a,t), the preorder 
{B € ob Num(€) | B < A}°? is cartesian closed. But, as in (i), we may iden- 
tify these numerals up to isomorphism with subobjects of A which are t-closed 
(equivalently, which are upwards closed for the relation <) and contain the sub- 
object {x: A | (x = tx)}. These clearly form a sublattice of Sub(A); they do not 
form a sub-Heyting-algebra, but they do have an implication defined by 


(BC) = {zx: A | (vy)(((z < y) A(YETB)) > (yETC))}. 
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In the situation of 5.5.10(i), it is clear that the K-finite numerals correspond 
to the inhabited s-closed subobjects of N, and the inhabited finite cardinals 
correspond to the s-closed subobjects with least elements — equivalently, to the 
complemented ones. Thus, for example, whereas we saw in 5.2.8(b) that cardinals 
in a spatial topos Sh(X) correspond to continuous (= locally constant) functions 
f: X — N, the finite numerals in this topos correspond to (everywhere-nonzero) 
upper semicontinuous functions, i.e. those for which {x € X | f(x) < n} is open 
for each n. (Explicitly, a numeral A corresponds to the function whose value at 
x € X is the number of elements in the stalk Az.) 

There is another important connection between numerals and K-finiteness. 
Given an arbitrary object A of a topos, consider the relation R — PA x PA 
defined by the term 


{z: PA x PA | ((fst(z) = snd(z)) v (3x: A)(snd(z) = fst(z) U {x}))}. 


Clearly, R is reflexive, and so we can use the ideas described before 5.5.8 to con- 
struct the R-closure of "07: 1 — PA; but, by 5.4.2, the latter is precisely K (A). 
Thus we have a numeral (n(A),0,s) (which we shall call the numeral associ- 
ated with A) and a relation ¢: n(A) % K(A) satisfying the coherent sequents 
($(0, w) AFw (w ="0")) and 


($(sn, w) AF nw ($(n, w) V (Bw: K(A)) (Sx: A)(o(n, w’) A (w = (w u {x}))))) 


(where n and w are variables of types n(A) and K(A) respectively). The idea is 
that n(A) ‘counts the number of elements required to cover each finite subobject 
of A’. 
Proposition 5.5.11 Let A be a K-finite object. Then 
(i) n(A) is K-finite; 
(ii) n(A) is characterized up to isomorphism in Num(E€) by the sequents above 
together with 
(p(n, A") {Fn (n = sn)); 
(iii) if A underlies a numeral (A,a,t), then n(A) is isomorphic to the numeral 


(A,a,t)* with underlying object 1 II A and structure maps v:1 — 10A 
and v(a,t): 1UA->1UA. 


Proof (i) We first establish the validity of 
(Vn: n(A))(Vw, w: K(A))((O(n, w) A (w < w)) > O(n, w’)) 


by showing that the set of n for which the condition holds is a (o, s)-closed 
subobject of n(A). Clearly, the condition holds for o since "0 ' is the least element 
of K(A). Suppose it holds for n; let B — A be a K-finite subobject for which 
we have ¢(sn," B7) and B’ — A a K-finite subobject contained in B. Then we 
have either ¢(n,"B") or B = B” U {a} for some a: 1 — A and some K-finite 
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B” such that ¢(n," B”7). In the first case, we clearly have ¢(n," B’) and hence 
o(sn,* B"). In the second case, things are more complicated: because the union 
B” U {a} is not necessarily disjoint, the intersections B’ N B” and B’N {a} need 
not be K-finite. However, we can prove 


(vw: K(B))((w <°B") v (3w': K(B))((w! < B”) A (w = w V{a}7))) 


by showing that the object of K-finite subobjects of B satisfying this condition 
is a join-semilattice and contains singletons; so we have either B’ < B” or there 
exists a K-finite C < B” such that B’ = C U {a}. In either case, we deduce 
ġ(sn," B’), as required. 

Now since A is K-finite, there exists n such that ¢(n," A”) holds. But this n 
satisfies (Vw: K(A))@(n, w), from which it follows that n = sn; so n(A) has an 
s-fixed point, and is thus K-finite by 5.5.9(ii). 

(ii) If we have ¢(n," A”), then by (i) we have n = sn. Conversely, if n = sn, 
then n is the <-largest element of n(A), and so ¢(n,' A”) follows from the fact 
that the mapping n> {w: K(A) | O(n, w)} is order-preserving. 

Conversely, if (B,b,t) is any numeral equipped with a relation y: B % K(A) 
satisfying the sequents before the statement of the proposition, then the name 
ry: B - PPA ofthe composite relation B + K(A) — PA is easily seen to bea 
morphism of T-algebras, where PPA is given the T-algebra structure ({"0'},r) 
used in the definition of n(A). So its image is precisely n(A). The additional 
condition in the statement of (ii) says precisely that, as a morphism B — n(A), 
r reflects the cyclic element; but since we have 


(Vy, z: B)((y = 2) V (y x z) V (z < y)) 


by 5.5.3(iii) and (iv), this easily implies that "%7 is injective and hence an 
isomorphism. 
(iii) Let |; A — K(A) be the morphism defined by the term 


fy: Aly x2} 


where z is a free variable of type A (note that the sets of this form are indeed 
K-finite, by the proof of 5.5.9(ii)). Now we claim that 


(Va: A)(Sn: n(A))(o(n, |x) A (Vm: n(A))(O(m, |x) (n < m))) ; 


we prove this by showing that the subobject of those x for which the condition 
holds is (a, t)-closed. The condition holds for x = a, since we have |a = {a}, and 
hence ¢(so, | a) (but not (0, |a)). Suppose x = ty and that the condition holds 
for y; let n be the <-least element of n(A) satisfying ¢(n, | y). Since |z =| yU{z}, 
it is clear that we have ¢(sn, | x); also, any m for which (m, | x) holds also 
satisfies ¢(m, | y) and hence n < m. And if $(n, | x) holds, then we must have 
|x =|y, whence ty = y, so | y is the whole of A and hence sn = n. Thus we see 
that sn is the <-least element of n(A) satisfying (sn, | x). 
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Since the least n associated with x by the above formula is clearly unique, we 
have thus defined a morphism f: A — n(A), which we may extend to a morphism 
f: 111A — n(A) by sending the extra element to o. From the construction, it 
is clear that f is monic, and that it is a morphism of T-algebras, where 1 II A is 
given the structure maps in the statement of the proposition; so by 5.5.2(i) it is 
an isomorphism. O 


Following on from 5.5.11(i), we define an object A of a topos to be R-finite if 
n(A) is K-finite. (R stands for ‘Russell’; intuitively, A is R-finite if there exists a 
finite bound for the sizes of its K-finite subobjects.) We note some of the basic 
properties of this finiteness notion: 


Lemma 5.5.12 
(i) An object A is R-finite iff there exists a K -finite numeral (B,b,t) and a 
relation %: B % K(A) satisfying the sequents (y(b, w) dFw (w = "0?)) 
and 


(W(ty, w) AF yw (Yy, w)V(Sw": K(A))(Ax: A)(o(y, w')A(w = (w'u{e}))))). 


(ii) Any subobject of an R-finite object is R-finite. 

(iii) Any K-finite object (cf. 5.4.22) is R-finite. 

(iv) A numeral is R-finite iff it is K -finite. 

(v) Bicartesian functors between toposes preserve R-finiteness. 


Proof (i) One direction is immediate by taking B = n(A). Conversely, if y sat- 
isfies the conditions, then as in the proof of 5.5.11(ii) we see that "w’: B — PPA 
induces a morphism of numerals B — n(A), so n(A) inherits K-finiteness from B. 

(ii) If A’ is a subobject of A, we may identify K(A’) with the sub-semilattice 
of K(A) defined by the term {w: K(A) | (Vz: A)((z € w) > (x € "A))} 
It is then easy to see that the relation @: n(A) + K(A) restricts to a relation 
n(A) % K(A’) having the properties in (i); so A’ is R-finite. 

(iii) By (ii) and 5.5.11(i), every subobject of a K-finite object is R-finite. But 
R-finiteness is clearly a local property, so the result follows from 5.4.22. 

(iv) is immediate from 5.5.11(iii) (whose proof did not in fact make use of 
the K-finiteness of A). 

(v) follows easily from (i), since bicartesian functors preserve K -finite numer- 
als by 5.5.7 and 5.4.12. o 


Example 5.5.13 The converse of 5.5.12(iii) is false. Let W +— Q be the Higgs 
object, as constructed in Section D4.5: it follows easily from the first displayed 
formula in the proof of 4.5.12 that W satisfies 


(vz, y, z: W)((x = y) V (y = 2) V (x = 2)), 


from which it follows that any inhabited K-finite subobject of W is a quotient 
of 2, and hence that n(W) < 3 in Num(£); so W is R-finite in any topos. But 
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we identified the Higgs object of the functor category [N, Set] (where N is the 
ordered set of natural numbers) at the end of 5.4.25, and we observed that it is 
not K-finite. 


Remark 5.5.14 . We may now develop an ‘arithmetic of numerals’ which paral- 
lels and extends the arithmetic of finite cardinals in a topos with natural number 
object. We have already defined a successor operation (A,a,t) + (A,a,t)t on 
numerals; the corresponding ‘predecessor operation’ (—)~ sends (A,a,t) to the 
image of t, equipped with the structure maps ta and the restriction of t to its 
image. Because numerals are necessarily well-supported, we have a slightly awk- 
ward ‘shift’ between cardinals and numerals: we have to regard 1 as the numeral 
version of zero (it is, of course, n(0)), and 1 I 1 = n(1) as the numeral version 
of one. We define the sum (A,a,t) @ (B,b,u) to be n(A V B)~, where AV B is 
the pushout 


Pee Y 


A — > AVB 


note that A V B is K-finite if A and B are, since it is a quotient of AU B. 
The morphisms 14 xb: A= Ax1—-+AxBandaxlg:B=1xB-AxB 
induce a monomorphism AV B > A x B, and we may thus define the product 
(A,a,t) ® (B,b, u) to be n(A A B)~, where AA B is the pushout 


AVB>—>AxB 


1 ——> AAB 


(again, this is K-finite if A and B are, since it is a quotient of A x B). We can 
even define truncated subtraction: given numerals A and B; the least numeral C 
satisfying C ® B > A is given by n(D)~, where D is the ‘kernel’ of the T-algebra 
epimorphism A > AN B, i.e. the s-closed subobject of A which corresponds to 
AN B as in the proof of 5.5.10(ii). Note that D need not be K-finite if A is, 
since it need not be complemented in A; however, n(D) is still K-finite, since D 
is certainly a subobject of a K-finite object. 

It is not hard to verify that the operations on numerals thus defined satisfy 
(most of!) the usual laws of arithmetic, and that if E does have a natural number 
object then the mapping p +> [sp] = n(|[p]) is a ‘homomorphism’ € (1, N) > 
Num(€). However, some familiar identities involving — fail to hold for numerals, 
because of the non-Boolean nature of their arithmetic: for example, on the subset 
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{A € Num(€) | A < n(1)}, the operations @ and 
with the lattice operations U and N, and hence + 
implication on this preorder (which is equivalent 
of 5.5.10). So, for example, n(1)~(n(1)—A) ne 
A < n(1). We shall leave the further exploration 
to the reader. 
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(a) Notation for categories 


Notation 


Ab 


reg 


Bisep(C, J, K) 


BrE 
Bool 
ByE 


CMon(C) 
CoCart(C) 
Cont(G) 

Conts (G) 


Cs(E) 


INDEX OF NOTATION 


Meaning 


Category of abelian groups 

Syntactic allegory of (a regular theory) T 
Category of biseparated functors on 

(a bisite) (C, J, K) 

Lower bagdomain of E 

Category of Boolean algebras 

Upper bagdomain of £ 

Indexed category generated by (an internal 


category) C 


Internal full subcategory of C generated by A 
Free cartesian category on C 

Category of small categories and functors 
Kleisli category of (—) x B 

Category of complete Boolean algebras 
Category obtained by splitting idempotents 


in E 


Subcategory of exponentiable objects of C 
Category of Choquet spaces 

Category of complete join-semilattices 
Category of complete lattices 

Category of commutative monoids in C 
Free cocartesian category on C 

Category of continuous G-sets 

Category of continuous actions of (a localic 
groupoid) G 

Subcategory of coarse objects of 

(a quasitopos) E 

Syntactic category of T 

Décalage of C 

Simplicial category 


Introduced 


Al.3.6 
D1.4.15 
C2.2.13 


B4.4.16 
D3.4.12 
B4.4.21(c) 
B2.3.3 


B2.3.5 
B4.2.4(f) 
A1.5.1 
A1.5.4 
C1.1.21 
A1.1.8 


A1.5.1 
A2.6.2(b) 
C1.1.3 
C1.6.9 
C1.1.8 
B3.2.6 
A2.1.6 
C5.2.11 


A2.6.11 
D1.4.1 


B2.5.4(b) 
B2.3.2 
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Desc(C, R) 
Desc(£.) 
Div 

DLat 
doFib 


Dom 
EdKf 


Efe 
Eff (C) 


jSLat 
k-Tors(D) 
LH 


Loc 
Lop(€) 
Map(A) 
Mat(L) 
Mat; (L) 
ME 


Mf 
Mon 
mSLat 
Num(&) 
On 


INDEX OF NOTATION 


Category of descent data for R in C 
Category of descent data for £. 
Category of divisible abelian groups 
Category of distributive lattices 
Category of small categories and discrete 
opfibrations 

Category of domains 

Subcategory of decidable K-finite objects 
of € 

Subcategory of finite cardinals in £ 
Effectivization of C 

Subcategory of K-finite objects of £ 
Subcategory of K-finite objects of € 
Category of enumerated sets 
Filterquotient of £ modulo ® 

Category of f-extracts 


Category of indexed families of objects of C 


Category of finite families of objects of C 
Category of Fréchet spaces 

Category of frames 

Category of L-valued fuzzy sets 
Category of finite L-valued fuzzy sets 
Gleason cover of (a topos) £ 

Category obtained by glueing along F 
Category of groups 

Category of Heyting semilattices 
Category of posets and interval fibrations 
Inductive completion of C 

Category of join-semilattices 

Category of k-torsors on D 

Category of spaces (or locales) and local 
homeomorphisms 


- Category of locales and continuous maps 


Poset of local operators on £ 

Category of maps in (an allegory) A 
Allegory of L-valued matrices 

Allegory of finite L-valued matrices 
Topos of measures on £ 

Category of manifolds and smooth maps 
Category of monoids 

Category of meet-semilattices 

Preorder of numerals in £ 

Ordered class of ordinals 


B1.5.1 
B3.4.12 
D2.4.6 
C1.1.3 
B2.5.3 


C1.1.3 
D5.4.18 


D5.2.7 
A3.3.10 
D5.4.16 
D5.4.25 
A2.1.11(k) 
A2.1.13 
C3.3.8 
B1.4.16 
B1.4.17 
A2.6.2(c) 
C1.1.1 
A2.6.2(e) 
A2.6.2(e) 
D4.6.8 
A2.1.12 
Al.1.5 
Al.5.11 
C2.1.13 
C4.2.1 
C1.1.3 
D2.3.3 
A1.2.7, C1.3.2 


C1.2.1 
A4.5.10 
A3.2.1 
A3.2.2(b) 
A3.2.2(b) 
B4.5.8 
A2.1.11(e) 
Al1.3.4 
C1.1.3 
D5.5.5 
A2.1.5 


Sections A and B are in Vol. 1; Sections C and D are in Vol. 2 


Orb(€) 
Ord; 
Ord sm, 


PE 
PFrm 
Pos(C) 
Pos% (C) 
Poset 
Pt(£) 
Rect(D,C) 
Reg(C) 
Rel(C) 
sen({£) 
scens(£) 
Sep(C, T) 


sep, (£) 
Set 
Sete 
Set f 
Set fe 
Set fm 
Set, 
Set(L) 


Set 
Sh(C, T) 


She(X) 
sh;(€) 
Shs(C,T) 


INDEX OF NOTATION 


Preorder of orbitals in £ 

Category of finite totally ordered sets 
Category of finite totally ordered sets and 
injections 

Topos of probability measures on £ 
Category of preframes 

Positivization of C 

co-positivization of C 

Category of partially ordered sets 
Category of points of £ 

Rectangular diagram category 
Regularization of C 

Allegory of relations in C 

Sierpinski cone on £ 

Sierpinski cone on £ over S 

Category of separated functors on (a site) 
(C,T) 

Subcategory of j-separated objects of € 
Category of sets and functions 
Category of countable sets 

Category of finite sets 

Category of finite sets and surjections 
Category of finite sets and injections 
Category of k-small sets 

Category of [-valued sets 


Category of sets and injections 
Category of sheaves on (a site) (C,T) 


Category of G-equivariant sheaves on X 
Subcategory of j-sheaves in £ 


Category of S-valued sheaves on an internal 


site 


Category of sheaves on (a space or locale) X 


Symmetric Giraud frame on C 
Category of L-structures in C 

Category of simplicial objects of S 
Category of sketches 

Category of spatial locales 

Category of sober spaces 

Category of spaces and continuous maps 
Preorder of subobjects of A 

Category of torsion-free abelian groups 
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D5.5.2 
B1.1.7 
C3.5.9(d) 


B4.5.12 
C1.1.3 
Al.4.5 
Al1.4.19 
C1.1.3 
C4.3.3 
B1.3.12 
A1.3.9 
A3.1.2 
A2.1.12 
C3.6.3(f) 
A2.6.2(d), 
C2.1.2 
A4.4.3 
Al.1.5 
A1.5.13 
A1.4.6 
C3.5.9(c) 
A2.1.11(h) 
A2.1.2 
A3.3.13, 
C1.3.3 
A2.1.5 
A2.1.10, 
C2.1.2 
A2.1.11(c) 
A4.4.3 
C2.4.1 


A2.1.8, C1.3.1 
B4.5.4 
D1.2.1(b) 
B2.3.2 
D2.1.1(b) 
C1.2.3 

C1.2.3 

Al.2.5 

Al.3.1 

A13.6 
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T-Mod(C) 


T-Mod(C)e 


Tors(C, £) 
Unif(G) 


INDEX OF NOTATION 


Category of T-models in C 


Ditto, with elementary morphisms 
Category of C-torsors in £ 
Category of uniformly continuous G-sets 


(b) Notation for 2-categories 


Notation 


BooLlog 
BooTop 
BTop 
Cart 
CART 
Cat 

CAT 
Cats 
Cat(S) 
CATs 
Caus 
cSibs 
Cocomps 
Cobh 
Comp. 
DIMTop k 
DTop/S 
bibs 
SILT s 
GOFrms 
LocPress 
LocTop/S 
Log 
Logn 
LTop 
PreTLop 
Profs 
prop/S 
PTop/s 
Reg 
tLorss 
sgibs 
Span(C) 


Meaning 


Boolean toposes and logical functors 
Boolean toposes and geometric morphisms 
Toposes and bounded geometric morphisms 
Small cartesian categories and functors 
(Possibly) large cartesian categories 

Small categories and functors 

(Possibly) large categories and functors 
Locally internal categories over S 

Internal categories in S 

S-indexed categories 

Finite Cauchy-complete categories 

Cloven fibrations over S 

Cocomplete locally small S-indexed categories 
(Small) coherent. categories and coherent functors 
Complete locally small S-indexed categories 
De Morgan toposes and skeletal maps 
Diagram toposes over S 

Fibrations over S 

S-indexed categories with filtered colimits 
Giraud frames over S 

Locally presentable S-indexed categories 
Local toposes over S 


‘Toposes and logical functors 


Ditto, with natural number objects 

Toposes and localic morphisms 

(Small) pretoposes and coherent functors 
Internal profunctors in S 

Pointed bounded S-toposes 

Ditto (lax version) 

(Small) regular categories and regular functors 
Profunctors which are right torsors 

Split fibrations over S 

Bicategory of spans in C 


D1.2.12, 
D2.1.1(c), 
D4.3.13 
D1.2.12 
B3.2.3 
A2.1.7 


Introduced 


D4.3.14(e) 
D4.6.12 
B3.1.11 
A4.1.10 
B1.2.2(d) 
A4,1.4 
B1.1.1 
B2.2.1 
B2.3.1 
B1.2.1 
C2.2.23 
B1.3.6 
B4.5.1 
D3.5.5 
B4.5.3 
D4.6.12 
B3.2.8(c) 
B1.3.6 
C4.3.3 
B4.5.1 
B4.5.2 
C3.6.14 
D4.3.13 
D4.3.19(d) 
A4.6.12 
D2.2.5 
B2.7.1 
C3.6.13 
C3.6.14 
D2.2.3 
B3.2.8(e) 
B1.3.6 
A3.3.8 
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Gpan(C) 
Top 
Tops 
Tops 
V-Cat 


INDEX OF NOTATION 


Local poset reflection of Gpan(C) 
Toposes and geometric morphisms 
S-indexed toposes 

Toposes and skeletal morphisms 
V-categories 


(c) Miscellaneous notation 


Notation 


Meaning 


Partial-map representer for A 

(External) axiom of choice 
Pullback-stability condition for a coverage 
Semidirect. product of C and D 


Closed local operator (or nucleus) corresponding 


to U 

Closed sublocale complementary to U 
Theory of decidable objects 

Theory of infinite decidable objects 
Axiom of dependent choices 
Localization of £ at s 

Colocalization of £ at s 

Left adjoint of pullback for subobjects 
Internal version of A, 

Exterior of (a local operator) j 

Graph of f (as a relation) 

Right adjoint of pullback for subobjects 
Internal version of V+; 

Poset of ideals of A 

Internal axiom of choice 

Object of idempotent endomorphisms of B 
Internalization of (a locale map) f 
Interior of (a local operator) 3 

Object of isomorphisms from A to B 
Theory of K-finite objects 


Object of K-finite subobjects of A 
Local character condition for a coverage 
List object over A 

Left Kan extension along f 

Right Kan extension along f 

Theory of dense linearly ordered objects 
Maximal sieves condition for a coverage 
Numeral of A 
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A3.3.8 
A4.1.1 
B3.1.2 
D4.6.12 
B2.1.1 


Introduced 


A2.4.6 

D4.5.4 

A2.1.9, C2.1.1 
B2.5.9, C2.5.3 
A4.5.3, 
C1.1.16(b) 
C1.2.6(b) 
D3.2.7 
D3.4.10 
D4.5.16 
C3.6.12 
C3.6.19 
A1.3.1 

A2.2.1, A2.3.6 
A4.5.19 
A3.1.3, B2.7.4 
A1.4.10 
A2.3.10 
C1.1.3 

D4.5.1 
D5.4.10 
C1.6.1 
A4.5.19 
D5.1.9 
D1.2.15(k), 
D3.2.10 
C1.1.3, D5.4.1 
C2.1.8 
A2.5.15 
A4.1.4, B2.3.20 
A4.1.4, B2.3.20 
D3.4.11 
C2.1.8 
D5.5.11 
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Frame of nuclei on A 

Frame of B-fibrewise-closed nuclei on A 
Natural number object. 

Theory of objects 


Theory of inhabited objects 

(Weak) subobject classifier 

Classifier for j7-closed subobjects 

Open local operator (or nucleus) corresponding 
to U 

Frame of opens of (a space or locale) X 
Power-object of A 

Ordered power-object of A 

Partial product of A and f 

(Covariant) 2-categorical partial product 
Right adjoint of pullback functor 
Quasi-closed local operator corresponding to U 
Object. of Cauchy real numbers 

Object of Dedekind real numbers 

Locale of formal real numbers 

Object of lower semicontinuous real numbers 
Object of MacNeille real numbers 

Scott topology on A 

Left adjoint of pullback functor 

Singleton map 

Supports split 

Object. of sieves on C 

Term obtained by substitution 

1Ui1 

Higgs object 

Smallest dense sublocale of X 

Dissolution of (a locale) X 

Class of monomorphisms classified by D 
Space of points of (a locale) X 
Interpretation of ġ in M 


C1.1.15 
C1.1.22 
A2.5.1 
B4.2.4(a), 
D3.2.1 
C5.2.8(c) 
A1.6.1, A2.6.1 
A4.4,2 
A4.5.1, 
C1.1.16(a) 
A2.1.8, C1.2.1 
A2.1.1, A3.4.5 
B2.3.8(a) 
A1.5.7 
B4.4.13 
A1.5.3, B1.4.2 
A4.5.21 
D4.7.12 
D4.7.4 

D4.7.5 

D4.7.2 

D4.7.4 

C4.1.3 
A1.2.8, B1.4.4 
A2.2.2 

D4.5.1 

C2.4.1 

D1.1.5 
C3.4.12, D4.5.7 
D4.5.10 
C1.2.6(c) 
C1.2.13 
A4.5.11 
C1.2.2 
D1.2.3, D1.2.6, 
D4.1.5 
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Adjoint (in a 2-category) B1.1.2 
weak B1.1.8 
Adjoint retract B1.1.9 
Algebra (for algebraic theory) D1.2.15(a) 
free D5.3.5 
Algebra (for an endofunctor) A1.1.4 
Allegory A3.2.1 
distributive A3.2.13 
division A3.4.1 
effective A3.3.9 
geometric A3.2.12 
positive geometric A3.2.12 
positive union A3.2.11 
power A3.4.5 
pre-tabular A3.3.6 
syntactic D1.4.15, D4.3.9 
tabular A3.2.3 
union A3.2.11 
Apartness relation D4.7.6 
Arithmetic operations A2.5.4 
on numerals D5.5.14 
on R D4.7.8 
Arity D1.1.1 
Associated sheaf functor A4.1.8, 
C2 2.6 
Associated split fibration B1.3.10 
Atom C3.5.7 
Atomic proposition D1.1.1 
Atomic system D3.4.13 
Atomic topos C3.5.1 
Axiom 
distributive D1.3.1(#) 
Frobenius D1.3.1(i) 
geometric B4.2.7(b) 
identity D1.3.1(a) 
logical D1.3.1 
non-logical D1.1.6 
of choice D4.5.1(8), D4.5.4 
of theory D1.1.6 


A4.4.4, 


Bag of T-models B4.4.15 
Bagdomain 
lower B4.4.17 
upper B4.4.21(c) 
Basis (for locale) C2.2.4(b) 
Beck-Chevalley condition Al.4.11, A2.2.4, 
A4.1.16, B1.4.2 
weak A4.1.16 
Beck-Chevalley morphism C2.4.16 
stable C2.4.16 
weak C2,4.16 


Beth definability theorem D3.5.2 
Bicategory B1.1.2 
Bimodule (between internal categories) B2.7.1 
Biseparated functor C2.2.13 
Bisite C2.2.13 
Bound (for topos over S) B3.1.7 
pre- D3.2.3 


C-monoid D4.2.15 
Cantor—Bernstein Theorem D4.1.11 
Cantor diagonal argument D4.1.8 
Capitalization D1.5.3 
Cardinal 
finite A2.5.14, D5.2.1 
generic finite A2.5.14 
limit power A2.1.2 
Cartesian closed category A1.5.1 
locally Al.5.3 
properly A1.5.1 
Category 
accessible D2.3.1 
balanced Al.1.1 
Barr-exact A1.3.6 
Boolean coherent A1.4.10 
cartesian A1.2.1 
cartesian closed A1.5.1 
Cauchy-complete A1.1.10 
coherent A1.4.1 
concrete A1.1.5 
continuous C4 2.7 
co-slice A1.1.6 
cowell-powered A1.4.17 
disjunctive B1.5.10, D1.3.6 
effective regular A1.3.6 
enriched B2.1.1 
essentially small B2.3.4 
exact A1.3.6 
generated by a protocategory A1.1.5 
geometric A1.4.18 
Grothendieck B1.3.1 
Heyting A1.4.10 
ind-small C4.2.18 
indexed B1.2.1 
internal B2.3.1(a) 
«-accessible D2.3.1 
k-filtered D2.3.1 
«-Heyting D3.1.19 
Karoubian A1.1,10 
lextensive B1.5.10, D1.3.6 
locally cartesian A1.2.6 
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locally cartesian closed A1.5.3 
locally internal B2.2.1 
locally presentable B4.5.2, D2.3.6 
locally small Ai.1.1, B1.3.12 
logical A1.4.1 
partial D3.1.8 
positive coherent A1.4.4 
properly cartesian closed A1.5.1 
regular A1.3.3 
o-coherent A1.4.19 
simply connected A1.2.9 
sketchable D2.3.8 
slice A1.1.6 
smal] Al.1.1 
*-autonomous C1.1.7 
stiff B1.3.11 
strongly connected A4.6.9, D3.1.7 
structured over B Al.1.5 
suitable for B D4.1.4 
syntactic D1.4.1, D4,2.4, D4.3.4, D4.4.7 
y-enriched B2.1.1 
well-copowered A1.4.17 
well-powered Aj.4.17, B1.3.14 
Centre (of local morphism) C3.6.2 
Characteristic morphism A1.6.1 
Choice, axiom of D4.5.4 
dependent D4.5.16 
external D4.5.4 
internal D4.5.1(b) 
Classifying map (of subobject) A1.6.1 
Classifying topos B4.2.1(b), D3.1.1 
Boolean D3.4.3 
for atomless Boolean algebras D3.4.12 
for cartesian theory D3.1.1 
for coherent theory D3.1.9 
for decidable objects D3.2.7 
for dense linearly ordered objects D3.4.11 
for geometric theory B4.2.9, D3.1.12 
for infinite decidable objects D3.4.10 
for K-finite objects D3.2.10 
for objects B3.2.9, D3.2.1 
for partial equivalence relations on N 
C5,2.8(c) 
for partial surjections N + NN D4.1.9 
for propositional theory B4.2.12 
for regular theory D3.1.4 
for subobjects of | B3.2.10 
for subterminal objects B3.2.11 
Cleavage B1.3.4 
normalized B1.3.4 
Closed Subgroup Theorem C5.3.2 
Closure (of subtopos) A4.5.20 
Closure operation 
universal A4.3.2 
Coadjoint retract B1.1.9 
Co-bag of T-models B4.4.21(b) 
Cocomplete object (in 2-category) B1.1.16 
injective B1.1.16 


linearly B1.1.16 
pointwise B1.1.16 
Coherent category Al.4.1 
Boolean A1.4.10 
positive Ai.4.4 
Colocalization C3.6.19 
Comma object B1.1.4(c) 
Comonad 
cartesian A4.2.1 
idempotent B1.1.9(c) 
Comorphism of sites C2.3.18 
Comparison Lemma C2.2.3 
Completeness Theorem 
classical D1.5.4, D1.5.10, D1.5.14, D3.1.16 
conceptual D3.5.9 
for cartesian logic D1.4.6, D1.5.1 
for coherent logic D1.4.11, D1.5 10 
for first-order logic D1.4.11, D1.5.14, 
D3.1.18 
for geometric logic D1.4.11, D3.1 16 
for -calculus D4.2.6 
for pA#-calculus D4.4.9 
for regular logic D1.4.11, D1.5.4 
for r-calculus D4.3.11 
for 7 L-calculus D4.3.19(6) 
functional D1.3.12, D3.1.3 
Composition predicate A1.1.5 
Comprehension scheme B1.3.12 
Comprehensive factorization B2.5.11 
Condition (PCC) B4.4.4 
Cone 
discrete D2.1.1 
in sketch D2.1.1 
lax B1.1.6 
weighted B1.1.3 
Constant symbol D1.1.1 
Context D1.1.4, D4.4.2 
canonical D1.1.4 
suitable D1.1.4 
Continuous G-set A2.1.6 
uniformly A2.1.7 
Continuous map (of locales) C1.2.1 
closed C3.2.1 
minimal surjective D4.6.8 
open C1.5.3 
perfect C3.2.9 
proper C3.2.5 
separated C3.2.9 
triquotient C3.2.7 
Coproduct 
disjoint A1l.4.4 
disjoint S-indexed B1.4.11 
quasi-disjoint A1.5.14 
S-indexed B1.4.4 
stable S-indexed B1.4.10 
Core (in allegory) A3,3.1 
Coreflexive pair A1.2.10 
Cosieve A1.6.6, A4.5.2 
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Cotensor B1.1.4(b) 
Cover A1.3.2 
Coverage A2.1.9, C2.1.1 
canonical C2.1.11 
chaotic C2.2.14(a) 
coherent A2.1.11(b), C2.1.12(d) 
dm- C3.2.18 
Grothendieck C2,1.8 
induced C2.2.2 
pre-canonical C2.1.12(d) 
regular A2.1.11(a@), C2.1.12(b) 
rigid C2.2.18(b) 
separated 2.1.11 
sifted C2.1.3 
subcanonical C2.1.11 
Zariski A2.1.11(f), D3.1.11(a@) 
Covering family A1.3.2, A2.1.9, C2.1.1 
Curry—Howard isomorphism D4.2.1 


De Morgan’s law D4.6.2 
strong D4.6.6 
weak D4.6.4 
Décalage B2.5.4(b) 
Dedekind section D4.7.4 
Definability (in fibration) B1.3.15 
Definability Theorem D3.5.1 
Derivation D1.3.1 
Descent condition B1.5.2 
effective B1.5.2 
Descent data B1.5.1 
Descent morphism B1.5.5 
Diagram 
constant B2.5.4(a) 
in directed graph D2.1.1 
in indexed category B2.3.11 
lax B1.1.6 
weighted B1.1.3 
Diaconescu's Theorem B3.2.7 
Diers completion D3.2.8 
Direct image A4.1.1(a@) 
Dissolution (of locale) C1.2.13 
Distribution (on topos) B4.5.8 
Distributor B2.7.1 
Doctrine B1.1.11 
Domain C1.1.3 
Double coset D3.4.1 


Edwards criterion C3.4.4 
Effectivization A3.3.10 
Eilenberg-Moore object B1 1.7 
Embedding (of structures) D1.2.10(c) 

elementary D1.2.10(b) 
Epimorphism 

extremal Ai.3.2 

regular A1.3.2 

strong A1.3.2 


Equifier B1.1.4(f) 

Equivalence (in allegory) A3.3.1 

Equivalence (in 2-category) B1.1.2 
strong B1.1.2 

Equivalence (of categories) A1.1.1 
weak Al.j.1 

Equivalence (of theories) D1.3.7(b) 
Morita D1.4.9, D1.4.13 

Equivalence relation A1.3.6 
effective A1.3.6 
strong C2.2.13 

Equivalence sieve C3.4.5 

Espace étalé C1.3.3 

Btale completion C5.3.16 

Etale morphism C1.3.3 

Étendue C5.2.4 

Excluded Middle, law of D1.3.3 

Exponential ideal A1.5.10 

Exponential transpose A1.5.1 

Exponential variety C5.4.1 

Exterior (of subtopos) A4.5.19 


Factorization theorem 
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closed inclusion-totally connected C3.6.17(d) 


comprehensive B2.5.11 
direct image—inverse image A4.1.12 


for locally connected morphisms C3.3.5 


for pre-geometric morphisms A4.1.13 

for tidy morphisms C3.4.15 

hyperconnected-—localic A4.6.5 

open inclusion~local C3.6.3(f) 

pure—entire C3.4.13 

surjection-inclusion A4.2.10 
Fibration B1.3.4 

associated split B1.3.10 

cartesian B1.4.1 

cloven B1.3.4 

continuous C2.5.6 

discrete B1.3.11, B2.5.1 

in 2-category B4.4,1 

interval C2.1.13 

locally small B1.3.12 

of sites C2.5.6 

satisfying (PCC) B4.4.4 

split B1.3.4 

stiff B1.3.11 

well-powered B1.3.14 
Field D1.1.7(h) 

coherent D1.1.7(h) 

residue D4.7.9 
Filter 

completely prime C1.2.1 

neighbourhood C1.2.1 
Filterpower A2.1.13 
Filterquotient construction A2.1.13 
Finite cardinal A2.5.14, D5.2.1 

generic A2.5.14 
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Finite object 
Dedekind- A1.6.10, D5.1.4, D5.2.9 
K- D5.4.22 
Kuratowski- D1.1.7(k), D5.4.1(b) 
Russell- D5.5.12 
Sierpiński- D5.4.1 
Fixset C1.1.13 
dense C1.1.16(c) 
Formula D1.1,3, D4.1.3 
atomic D1.1.3, D4.1.3 
cartesian D1.1.7(m), D1.3.4 
closed D1.1.3 
coherent D1.1.3 
complete D3.4.13 
disjunctive D1.3.6 
first-order D1 1.3 
geometric D1.1.3 
Horn D1.1.3 
in context D1.1.4 
provably equivalent D1.3.7(a) 
regular D1.1.3 
T-complete D3.4.13 
well-formed D1.1.3 
Frame A3.2.2(b), C1.1.1 
coherent C2.4.3 
consistent C2.4.9 
Giraud B4.5.1 
internal C1.6.1 
nontrivial C2.4.9 
Frame-valued set A3.3.13, C1.3.3 
complete C1.3.6 
Free topos D4.3.14(a) 
with natural number object D4.3.19(d) 
Frobenius reciprocity A1.3.3, A1.5.8 
Function symbol D1.1.1, D4.1.1 
Functional completeness D1.3.12, 
D4.2.10 
Functional extension (of theory) B4.2.7(a) 
Functor 
accessible B3.1.12 
between 2-categories B1.1.2 
bicartesian D5.1.8 
biseparated ©C2.2.13 
cartesian A1.2.1 
cartesian closed A1.5.2 
cocontinuous (between sites) C2.3.18 
cofinal B2.5.12 
coherent Ai.4.4 
conservative A1.1.1 
continuous (between sites) C2.3.1 
continuous (indexed) B1.4.13 
cover-creating C3.5.10 
cover-preserving C2.3.1 
cover-reflecting C2.3.18 
direct image A4.1.1(a), C1.4.3 
essentially surjective Al.1.1 
faithful A1.1.1 
flat C2,3.7 


D3.1.3, 
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final B2.5.12 

finitely covering D3.5.7 

full on subobjects D3.5.6 

Heyting A1.4.10 

indexed B1.2.1 

initial B2.5.11 

interna] B2.3.1(b) 

inverse image A4.1.1(a), C1.4.1 

«-continuous D2.3.4 

«-Heyting D3.1.19 

lax B1.1.2 

left exact Al.2.1 

locally internal B2.2.1 

logical A2.1.1 

monadic A1.1.2 

regular A1.3.3 

&-continuous B1.4,13 

separated A2.6.4(d), C2.1.2 

strict B1.1.2 

strong B2.1.4 

sub-cartesian-closed ©3.1.1 

sub-logical C3.1.1 

tripleable A1.1.2 

2-initial B3.4.11 

V-enriched B2.1.1 
Fundamental Theorem of topos theory A2.3.2 


G-set A2.1.4 

continuous A2.1.6 

effective B3.2.4(b) 

transitive B3.2.4(b) 

uniformly continuous A2.1.7 
G-torsor B3.2.4(b) 
Generating family A1.2.4 
Generator A1.2.4 
Geometric category 41.4.18 

oo-positive A1.4.19 
Geometric construct B4.2.5 

coherent B4.3.8 
Geometric morphism A4.1.1(a@) 

atomic C3.5.1 

Beck-Chevalley C2.4.16 

bounded B3.1.7 

closed C3.2.3 

connected C1.5.7 

descent C5.1.1 

embedding A4.2.9 

entire B4.4.21(a) 

essential] A4.1.5 

inclusion A4.2.9 

lax descent C5.1.8 

local C3.6.1 

localic A4.6.1 

locally connected C3.3.1 

hyperconnected A4.6,3 

hyperlocal B4.4.19 

minimal surjective D4.6.8 
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open C1 5.4, C3.1.7 

partial A4.1.13 

pre-descent ©5.1.1 

proper C3.2.12(a) 

pure C3.4.12 

S-essential B3.1.1 

S-indexed B3.1.2 

separated C3.2.12(b) 

skeletal D4.6.9 

stably closed C3.2.26 

surjection A4.2.6 

tidy C3.4.2 

totally connected C3.6.16 

weak Beck—Chevalley C2.4.16 
Geometric quotient B4,2.7(b) 
Geometric theory B4.2.7(c), D1.1.6 

dualizable B4.3.2 

propositiona] B4,2,12,D1.1.7(m) 
Geometric transformation A4.1.1(b) 
Giraud frame B4.5.1 

symmetric B4.5.4 
Giraud’s Theorem B3.3.4, B3.3.11, C2.2.8 

little C2.1.11 
Gleason cover D4.6.8 
Glued topos A2 1.12 
Glueing construction A2.1.12 
Gédel’s Incompleteness Theorem D4.3.19(d) 
Graph 

directed D2.1.1 

of morphism A3.1.3 

undirected D3.2.7 
Grothendieck construction A1.1.7, B1.3.1 
Grothendieck coverage C2.1.8 
Grothendieck topology A2.1.9 
Grothendieck topos C2.2.9 
Groupoid 

interna] B2.3.12(c) 

localic C5.2.11 


Heine-Borel Theorem D4.7.4 
Heyting algebra A1.5.11 
irreducible A2.6.4(e) 
Heyting category A1.4.10 
Heyting semilattice A1.5.11 
Higgs object D4.5.10 
Hyperconnected-localic factorization A4.6.5 


Idea] 
exponential A1.5.10 
for coverage C1.1.16(e), C2.3.21 
in poset B3.2.4(d), C1.1.3 
maxima] D4.6.14 
prime D4.6.14 
proper D4.6.14 
Idempotent A1.1.8 
(in 2-category) B1.1.9 
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split A1.1.8 
symmetric A3.3.2 
Idempotent monad B1.1.9 
associated A4.3.11 
Identifier B1.1-4(e) 
Image A1.3.1 
direct Ad.1.1 
inverse A4.1.1 
Inclusion (geometric) A4.2.9 
Indexed category B1.2.1 
cartesian B1.4.1 
complete B1.4.2 
essentially small B2.3.4 
locally small B1.3.12 
well-powered B1.3.14 
Indexing 
canonical B1.2.2(c) 
naive B1.2.2(a) 
Inductive completion C4.2.1 
oo-pretopos A1.4.19 
S-indexed B3.3.9 
Inserter B1.1.4(d) 
Interior (of subtopos) A4.5.19 
Internal category B2.3.1(a) 
connected B2.5.6 
discrete B2.3.12(a) 
filtered B2.6.2(a) 
indiscrete B2.3.4 
weakly filtered B2.6.2(6) 
Internal full subcategory B2.3.5 
Internal groupoid B2.3.12(c) 
Internal language D1.3.11, D4.1.8 
Internal poset B2,3.8 
complete B2.3.9 
conditionally complete D4.7.7 
Dedekind-complete D4.7.7 
directed B2.6 2 
DM-separated D4.7.15 
Internal site C2.4.1 
Internalization C1.6.1 
Interpolation property C4.1.2, C4.2.15 
Interpretation (in structure) D1.2.3, D1.2.6, 
94.1.5 
Brouwer—Heyting-Kolmogorov D1.2.2 
Tarski D1.2.1 
Interval (in poset) C2.1.13 
Inverse image A4.1.1(a) 
Inverter B1.1.4(e) 
Iterator D4.1.2(ix) 


Joyal’s Lemma C4.1.15 


Kan extension A4.1.4 

internal B2.3.20 
Kock—Zoéberlein monad B1.1.11 
Knaster~Tarski Theorem D4.1.10 
KZ-monad B1.1.11 


Sections C and D are in Vol. 2 


[66] 


L-valued matrix A3.2.2(b) 
L-valued set A3.3.13 
A-calculus D4.2.1 
ettended D4.2.16 
typed D4.2.1 
untyped D4.2.16 © 
Language 
first-order D1.1.1 
higher-order D4.1.1 
internal D1.3.11, D4.1.8 
Lattice 
continuous C4.1.1(c) 
distributive C1.1.3 
stably continuous C4.1.10 
Lawvere—Tierney topology A4.4.1 
Lax functor B1.1.2 
Lax limit B1.1.6 
normalized C2.3.15(d) 
Laz pullback B1.1.4(c) 
Leibniz equality D4.3.2 
Limit 
conica] B1.1.3 
indexed B1.1.3 
lax B1.1.6 
weighted B1.1.3 
Limit power cardinal A2.1.2 
Lindenbaum algebra D1.4.14 
for Dedekind sections of Q D4.7.4 
for injections A >> N C1.2.9 
for paradoxical open cover of [0, 1) D4.7.13 
for partial surjections N +» N" D4.1.9 
for surjections N > A C1.2.8 
List object A2.5.15, D4.1.4 
Local character condition C2.1.8 
Loca] homeomorphism A1.2.7, C1.3.2, C3.3.4 
Local operator A4.4.1 
closed A4.5.3 
dense A4.5.20 
double-negation A4.5.9 
open A4.5,.1 
pullback A4.5.14(e) 
quasi-closed A4.5.21 
Locale C1.2.1 
coherent C2.4.3 
compact C1.5.5 
connected C1.5.7 
discrete C1.6.4 
exponentiable C4.1.9 
extremally disconnected C1.5.10, D4.6.3(6) 
Hausdorff C1.2.17 
injective C4.1.6 
internal C1.6.1 
local €1.5.6 
locally coherent C4.1.13 
locally compact C4.1.7 
locally connected C1.5.8 
locally metastably compact C4.4.7 
locally stably compact C4.1.11 
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open C3.1.16 
regular C1.2.17 
second countable D4.5.2(d) 
spatial C1.2.3, C1.6.5 
stably locally compact C4.1.11 
strongly compact C3.4.1 
strongly Hausdorff C1.2.17 
strongly zero-dimensiona]l D4.5.2(d) 
Tuy- C1.2.17 
Localic groupoid C5.2.11 
algebraically connected €5.3.7 
atomic C5.2.11 
étale-complete C5.3.16 
open C5,.2.11 
proper C5.2.11 
Localic reflection A4.6.12 
Localization (of cartesian category) A4.3.1 
Localization (of topos) C3.6.12 
Logic 
cartesian D1.3.4 
classica] D1.3.3 
coherent D1.3.1 
disjunctive D1.3.6 
first-order D1.3.1 
infinitary D1.3.1 
K-cartesian D2.3.6 
K-firat-order D3.1.19 
regular D1.3.1 
Logical category A1.4.1 
Logical functor A2.1.1 
Logos A1.4.10 
Los's Theorem D2.4.11 
Lowenheim—Skolem Theorem D2.3.10 


MacNeille completion C2.5,8(e) 
MacNeille section D4.7.4 
Mal’cev operation D4.2.13 
Map (in allegory) A3.1.3 
Martin-Léf type theory D4.4.1 
Measure (on topos) B4.5.8 
probability B4.5.12 
Model (of sketch) D2.1.1(c) 
Model (of theory) B4.2.1(a), D1.2.10(d), 
D4.1.5 
conservative D3.2.6 
finitely presented D2.4.1 
free D5.3.5 
functionally complete D3.1.3 
generic B4.2.1(b), D1.4.5, D3.1.1 
term C3.6.3(c) 
ultraproduct of D2.4.11 
Modification B1.1.2 
Modular Jaw A3.1.5 
Monad (in 2-category) B1.1.7 
idempotent B1.1.9 
KZ- B1.1.11 
symmetric B4.5.6 
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Monoid 
free D5.3.3 
Morleyization D1.5.13 
Morita equivalence D1.4.9, D1.4,13 
for localic groupoids C5 3.1 
Morphism 
cartesian B1.3.3 
characteristic Al 6.1 
descent B1.5.5 
effective descent B1.5.5 
elementary D1.2.10(a@) 
étale C1.3.3 
fully monic B1.1.19 
geometric A4.1.1 
horizontal B1.3.1 
of allegories A3.2.10 
of diagrams B2.3.11 
of fibrations B1.3.6 
of profunctors B2.7.1 
of sites C2.3.1, C2.3.7 
of sketches D2.1.1(b) 
of structures D1.2.1 
pre-descent B1.5.5 
pre-geometric A4.1.13 
prone B1.3.3 
supine B1,3.3 
vertical B1.3.1 
pA-caleulus D4.4.1 
cartesian D4.4.5 
ful] D4.4.5 


Name (of relation) A2.1.1, A3.4.5 
Natural number object A2.5.1 
Freyd D5.1.1 
Lawvere D5.1.1 
Peano D5.1.1 
standard D5.1.7 
weak D4.2.12 
Natural transformation 
indexed B1.2.1 
internal B2.3.1(c) 
lax B1.1.2 
locally internal B2.2.1 
pseudo- B1.1.2 
strong B2.1.6 
V-enriched B2.1.1 
Negation (of subobject) A1.4.13 
Nerve (of internal category) B2.3.2 
Nucleus C1.1.13 
closed C1.1.16(b) 
dense C1.1.16(c) 
double- negation C1.1.16(c) 
fibrewise closed C1.1.22 
fibrewise dense C1.1.22 
flat C1.1.16(d) 
open C1.1.16(a) 


strongly dense C1.1.22 

weakly closed C1.1.22 
Numeral D5.5.4 

K-finite D5.5.9 


Object 
choice D4.5.5 
coarse A2.6.11 
cocomplete B1.1.16 
coherent D3.3.4 
compact D3.3.2 
consistent D3.4.5 
decidable A1.4.15 
decidably well-ordered D5.1.6 
Dedekind-finite Al.6.10, D5.1.4 
exponentiable A1.5.1 
finitely presentable C4.2.1, D2.3.1 
generic D3.2.1 
Higgs D4.5.10 
indecomposable A1.1.10 
internally projective D4.5.1(b) 
irreducible (for coverage) C2.2.18(a) 
«-presentable D2.3.1 
K-finite D5.4,22 
Kuratowski-finite D1.1.7(k), D5.4.1(6) 
list A2.5.15, D4.1.4 
natural number A2.5.1 
of components B2.5.5 
of generators B3.1.7 
of morphisms B2.3.1(a) 
of objects B2.3.1(a) 
orderable C5.4.8 
power A2.1.1, A3 4.5 
projective A1l.1.10, A1.3.8 
quasi-initial A1.5.14 
R-finite D5.5.12 
real number D4.7.4 
separated A4.3.4 
simplicia] B2.3.2 
solid A2.6.6 
stable D3.3.4 
strict (initial) Al.4.1 
subterminal A1.5.10 
supercoherent D3.3.10 
supercompact D3.3.10 
thin A2.6.6 
well-ordered D4.5.9 
well-supported A1.3.8 
Object classifier B3.2.9, B4.2.9, D3.2.1 
Opfibration B1.3.7(a) 
discrete B2.5.1 
in 2-category B4.4.1 
Oplax functor B1.1.2 
Oplax limit B1.1.6 
Orbital D5.5.1 
acyclic D5.5.3 
K-finite D5.5.9 
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Ordered wedge D4.7.15 
thick D4.7.15 
Ore condition A2.1.11(h) 


Partial map A2.4.6, A3.4.10 
representable A2.4.6 
Partial product A1.5.7 
contravariant B4.4.13 
covariant B4 4.13 
Partial surjection A4.6.1 
Peano postulates A2.5.9, D4.3.17(c), D5.1.1 
Pierce representation B4.4.23 
Pitts’s Theorem C2.3.17 
Point (of locale) C1.2.1 
focal C1.5.6 
Point (of topos) C2.2.12 
Poset 
complete B2.3.9 
conditionally complete D4.7.7 
continuous C4.1.1(b) 
Dedekind-complete D4.7.7 
directed B2.6.2 
DM-separated D4.7.15 
inductive D4.5.14 
internal B2.3 8 
Posite C1.1.16(e) 
Positivization A1.4.5 
Power object A2.1.1, A3.4.5 
weak A2.6.1 
Pre-bound D3.2.3 
Pre-descent condition B1.5.2 
Preframe C1.1.3 
Pre-geometric morphism A4.1.13 
Pre-logos A1.4.1 
Preorder A1.1.5 
Presheaf A2.1.8, C1.3.1(a) 
flat B3.2.3 
Pre-stack B1.5.2 
Pretopology A2.1.9 
Pretopos Al 4.8 
oo- A1.4.19 
o- Al.4.19 
Product 
S-indexed B1.4.2 
Profunctor (internal) B2.7,1 
unit B2.7.2 
Profunctor comonad C4,2.16 
Progenitor B3.1.7 
Propositions-as-types paradigm D4.2.1 
Protocategory A1.1.5 
Protomorphism A1.1.5 
Pseudo-complement A1.4.13 
Pseudofunctor B1.1.2 
strictly unital B1.1.2 
Pseudonatural transformation B1.1 2 
Pullback-stability theorem 
for atomic morphisms C3.5.12 


for coherent morphisms C2.4.7 

for connected locally connected morphisms 
C3.3.15 

for connected tidy morphisms C3.4.15 

for exponentiable morphisms B4.3.5 

for fibrations B4.4.3 

for hyperconnected morphisms B3.3.7, 
C2.4.11 

for local maps C©3.6.7(iv) 

for localic morphisms B3.3.6 

for locally connected morphisms C3.3.15 

for open maps C3.1.11, C3.1.23, C3.1.27 

for proper maps C3.2.6, C3.2.21 

for totally connected morphisms C3.6.18 

for tidy morphisms C3.4.7 

Pure-entire factorization C3.4.13 


Quasitopos A2.6,1 
Grothendieck ©2.2.13 
solid A2.6.5 


Real number object 

Cantor D4.7.12 

Cauchy D4.7.12 

Dedekind D4.7.4 

formal D4.7.5 

MacNeille D4.7.4 

semicontinuous D4.7.2 
Rectangular diagram category B1.3.12 
Recursion data D5.1.9 
Recursive function 

partial D5.1.6 

primitive A2.5.4 
Recursor D4.2.11(a) 
Reflection Ai.i.1 
Reflector Al.1.1 

cartesian A4.3.1 
Reflexive pair A1.1.2 
Regular category A1.3.3 

capital A1.3.8 

effective A1.3.6 

quasi-effective C2.2.13 
Regularization A1.3.9 
Relation A1.3.6, A2.1.1, A3.1.1 

apartness D4.7.6 

entire D4.5.5 

equivalence A1.3.6 

name of A2.1.1 

opposite A3.1.3 

reflexive A1.3.6 

symmetric A1.3.6 

transitive A1.3.6 
Relation symbol D1.1.1, D4.1.1 
Ring 

indecomposable B4.4.23 

local D1.1.7(g) 
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reduced D3 1.11(b) 

separably real-closed local D4.7.10 

von Neumann regular D3.1.11(b) 
Rule of inference D1.3 1 

cut D1.3.1(a) 

equality D1.3.1(b), D4.2.2 

of -calculus D4.2.2 

of AN-calculus D4.2.11(b) 

of pr-calculus D4.4.5 

of r-calculus D4.3.1 

substitution D1.3.1({a) 

weakening D1.3.1(a) 
Ryll-Nardzewski’s criterion D3.4.6 


Scone A2.1.12 
Scott topology C4.1.3 
Section 
Dedekind D4.7.4 
lower D4.7.1 
MacNeille D4.7.4 
Semilattice C1.1.3 
complete join- C1.1.3 
continuous C4,1.1(c) 
free C1.1.3, D5.4.9 
Heyting A1.5.11 
join- C1.1.3 
meet- C1.1.3 
Sentence D1.1.3 
Separating family A1.2.4, B2.4.1 
Separator Al.2.4, B2.4.1 
strong A3 4.10 
Sequent D1.1.5 
cartesian D1.1.7(m), D1.3.4 
coherent D1.1.5 
derivable D1.3.1 
disjunctive D1.3.6 
first-order D1.1.5 
geometric D1.1.5 
Horn Di.1.5 
provable D1.3.1 
regular D1-1.5 
satisfied in M D1.2.12(a) 
Sequent calculus D1.3.1 
Sheaf 
for a closure operation A4.3.4 
G-equivariant A2.1.11(c) 
on a locale C1.3.1(6) 
on a site A2.1.9, C2.1.2 
on a space A2.1.8 
Sheafification A4.4.4 
Shift (of simplicial object) B2.5.4(b) 
Sierpiński cone A2.1.12, C3.6.3(f) 
Sierpinski space A2.1.12, C1.1.4 
Sierpiński topos B3.2.11 
Sieve A1.6.6, A4.5.5, C2.1.3 
closed A2.1.10 
connected C3.3.10 


covering C2.1.3 
dm- C3,2.18 
effective-epimorphic C2.1.11 
equivalence C3.4.5 
Sifted colimit A1.2.12 
Sifted coverage C2.1.3 
o-pretopos Aj.4,19 
%-structure D1.2.1, D4.1.4 
homomorphism of D1.2.1 
Signature D1.1.1, D4 1.1 
canonical D1.3.11, D4.1.7 
first-order D1.1.1 
higher-order D4.1.1 
à- D4.2.1 
AN- D4.2.11 (a) 
uà- D4.4.2 
propositional D1.1.7(m) 
T- D4.3.1 
7 L- D4.3.16 
Simplicial identities B2.3.2 
Simplicial object B2.3.2 
2-truncated B3 4.11 
Simplicial topos B3.4.11 
Singleton (in frame-valued set) C1.36 
Singleton map A2.2 2 
Site A2.1 9, ©2.1.1 
atomic C3.5.8 
consistent C2,4.8 
essentially smal] C2.2.3 
internal C2.4.1 
local C3.6.3(d) 
locally connected C3.3.10 
of definition C2.2.15 
small C2.1.1 
standard A2.1.11(a), C2.1.11 
syntactic D3.1.1 
Sketch D2.1.1(a) 
coherent D2.1.2(g) 
colimit D2.1.2(8) 
coproduct D2.1.2(c) 
disjunctive D2.1.2(e) 
finitary D2.1.2(d) 
finite limit D2.1.2(b) 
finite product D2.1.2(b) 
geometric D2.1.2(h) 
inductive D2.1.2(c) 
limit D2.1.2(b) 
linear D2.1.2(a) 
mixed D2.1.2(d) 
product D2.1.2(5) 
projective D2.1.2(b) 
regular D2.1.2(f) 
o-coherent D2.2.9 
Slice category Al 1.6 
Sobrification C1.2.3 
Sorgenfrey plane D4.5.2(e) 
Sort D1.1.1, D4.1.1 
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Soundness Theorem D1.3.2, D4.2.3, D4.3.3, 
D4.4.6 
Source-target predicate A1.1.5 
Space 
Choquet A2.6.4(b) 
coherent D3.3.14 
extremally disconnected C1.5.10 
Fréchet A2.6.4(c) 
injective C4.1.6 
local C1.5.6 
sober C1.2.3 
Tp- C1.2.5 
Span (in a category) A3.3.8 
Stack B1.5.2 
Stone algebra D4.6.1 
Strength (of functor) B2.1.4 
Structure (for signature) D1.2.1, D4.1.4, 
D4,4.3 
Subcategory 
definable B1.3.16 
dense C4.2.18, D2.3.2(iii) 
dense (for coverage) C2.2.1 
full Al.1.1 
internal full B2.3.5 
reflective A1.1.1 
replete Al.1.1 
Subclosure C1.2.5 
Subfibration B1.3.16 
definable B1.3.16 
Sublocale C1.2.5 
closed €1.2.6(b) 
complemented C1.2.11 
dense C1.2.6(c) 
fibrewise closed C1.2.14 
fibrewise dense C1.2.14 
open C1.2.6(a) 
Submersion C1.6.7 
Subobject A1.3.1 
closed A4 3.2 
complemented A1.4.10 
dense A4.3.2 
effective C3.4.5 
generic Al.6.1 
negation of A1.4.13 
widespread A1.6.11 
Subobject classifier A1.6.1 
weak A2.6.1 
Subquotient A4.6.1 
Substitution (in formal language) D1.1.4 
Substitution Property D1.2.4, D1.2.7, D4.1.6 
Substructure D1.2.10(c) 
Subtopos A4.3.9 
Boolean A4.5.21 
closed A4.5.3 
closure of A4.5.20 
dense A4.5.20 
exterior of A4.5.19 
geometric A4.3.9 
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interior of A4.5.19 

logical A4.3.9 

open A4.5.1 

quasi-closed A4.5.21 
Superdense ©1.5.2 
Support of an object A1.3.8 
Supports split D4.5.1(a) 
Surjection (geometric) A4.2.6 
Surjection-inclusion factorization A4,2.10 
Syntactic category 

for cartesian theory D1.4.1 

for coherent theory D1.4.10 

for first-order theory D1.4.10 

for geometric theory D1.4.10 

for \-theory D4.2.4 

for wA-theory D4.4.7 

for regular theory D1.4.10 

for r-theory D4.3.4 


Tabulation A3.1.1, A3.2.3 
7-calculus D4.3.1 
7L-calculus D4.3.16 
Tensor (in 2-category) B1.1.4(b) 
Tensor condition B4.4.5 
Term (in formal language) D1.1.2, D4.1.2 
closed D1.1.3 
in context D1.1.4 
à- D4.2.1 
Term constructor D4.1.2 
Term declaration D4.4.2 
Theory B4.2.1(a), D1.1.6 
No-categorical D3.4.7 
algebraic D1.1.7(a), D5.3.1 
cartesian D1.1.7(m), D1.3.4 
coherent 34.3.8, D1.1.6 
complete D3.4.7 
disjunctive D1.3.6 
equivalent D1.3.7(b) 
first-order D1.1.6 
geometric B4.2.7(c), D1.1.6 
Horn D1.1.6 
internal algebraic D5.3.9 
à- D4.2.2 
model-complete D3.4.9 
pà- D4.4.4 
of objects B4.2.4(a), D3.2.1 
over & B4.2.1 (a) 
propositional B4.2.12, D1.1.7(m) 
regular D1.1.6 
o-coherent D2.2.9 
T- D4.3.3 
TL- D4.3.16 
Topological group 
coherent D3.4.1 
nearly discrete A2.1.6 
pro-discrete A2.1.6 
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Topology 
Grothendieck A2.1.9 
Lawvere-Tierney A4.4.1 

Topos A2 1.1 
atomic C3.5.1 
bounded over S B3.1.7 
classifying B4.2.1(b), D3.1.1 
coherent D3,3.1 
compact C3.2.12(a) 
defined over S B3.1.1 
exponentiable B4.3.1, C4.4.1 
free D4.3.14(a), D4.3.19(d) 
glued A2.1.12 
Grothendieck C2.2.9 
grouplike C3.6.9 
Hausdorff C3.2,12(b) 
having enough points C2,2.12 
hyperlocal B4.4.19 
injective C4.3.1 
Jénsson-Tarski A2.1.11(i) 
local C3.6.2 
locally decidable C5.4.1 
locally orderable C5.4.8 
measure B4.5.8 
quasi-injective C4.3.9 
regular D3.3.1 
S-indexed B3.1.2 
Schanuel A2.1.11(h) 
Sierpiński B3.2.11 
simplicial B3.4.11 
standard D5.1.7 
strongly compact C3.4.1 
symmetric B4.5.4 
totally connected C3.6.16 
2-valued A2.1.13 
weak A2.1.1 
Zariski A2.1.11(f) 

Torsor B3.2.3 
k- D2.3.3 

2-category B1.1.1 
locally ordered A3 1.2 

2-cell B1.1.1 

2-functor B1.1 2 

Type D1.1.1, D4.1.1 
basic D4.1.1 
dependent product D4.4.1 
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dependent sum D4.4.1 
equality D4.4.2(g) 
function D4.1.1 
list D4.1.1 
Mal'cev D4.2.15 
power D4.1.1 
product D4.1.1 
unit D4,4.2(f) 
Type constructor D4.1.1 
Type declaration D4.4.2 


Ultrapower A2.1.13 

Ultraproduct D2.4.11 

Union (of subobjects) A1.4.1 
effective A1.4.3 

Unit (in allegory) A3.2.8 

Universal closure operation A4.3.2 
proper A4,4.2 
strict A4.4,.2 


V-category B2.1.1 
V-embedding B2.1.1 
V-enrichment B2.1,2 
\Y-functor B2.1.1 
y-natural transformation B2.1.1 
Variable D1.1.2 

bound D1.1.3 

free D1.1.3 


Wavy arrow C4.2.12 
Way below C4.1.1(a) 
Weak adjoint B1.1.8 
Weak equivalence A1.1.1 
of localic groupoids C5.3.12 
Weak topos A2.1.1 
Weak subobject classifier A2.6.1 
Weakening Property D1.2.4 
Wf D1.1.3 


Yoneda profunctor B2.7.2 


Zorn’s Lemma D4.5.14 
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